Java - StreamTokenizer whitespaceChars(int low, int hi) method



Description

The Java StreamTokenizer whitespaceChars(int low, int hi) method specifies that all characters c in the range low <= c <= high are white space characters. White space characters serve only to separate tokens in the input stream. Any other attribute settings for the characters in the specified range are cleared.

Declaration

Following is the declaration for java.io.StreamTokenizer.whitespaceChars(int low, int hi) method.

 public void whitespaceChars(int low, int hi) 

Parameters

  • low − The low end of the range.

  • high − The high end of the range.

Return Value

This method does not return a value.

Exception

NA

Example - Usage of StreamTokenizer whitespaceChars(int low, int hi) method

The following example shows the usage of StreamTokenizer whitespaceChars(int low, int hi) method.

StreamTokenizerDemo.java

 package com.tutorialspoint; import java.io.BufferedReader; import java.io.FileInputStream; import java.io.FileOutputStream; import java.io.InputStreamReader; import java.io.ObjectInputStream; import java.io.ObjectOutputStream; import java.io.Reader; import java.io.StreamTokenizer; public class StreamTokenizerDemo { public static void main(String[] args) { String text = "Hello. This is a text \n that will be split " + "into tokens. 1 + 1 = 2"; try { // create a new file with an ObjectOutputStream FileOutputStream out = new FileOutputStream("test.txt"); ObjectOutputStream oout = new ObjectOutputStream(out); // write something in the file oout.writeUTF(text); oout.flush(); // create an ObjectInputStream for the file we created before ObjectInputStream ois = new ObjectInputStream(new FileInputStream("test.txt")); // create a new tokenizer Reader r = new BufferedReader(new InputStreamReader(ois)); StreamTokenizer st = new StreamTokenizer(r); // set letters o- t as white space chars st.whitespaceChars('o', 't'); // print the stream tokens boolean eof = false; do { int token = st.nextToken(); switch (token) { case StreamTokenizer.TT_EOF: System.out.println("End of File encountered."); eof = true; break; case StreamTokenizer.TT_EOL: System.out.println("End of Line encountered."); break; case StreamTokenizer.TT_WORD: System.out.println("Word: " + st.sval); break; case StreamTokenizer.TT_NUMBER: System.out.println("Number: " + st.nval); break; default: System.out.println((char) token + " encountered."); if (token == '!') { eof = true; } } } while (!eof); } catch (Exception ex) { ex.printStackTrace(); } } } 

Output

Let us compile and run the above program, this will produce the following result −

 Word: AHell Number: 0.0 Word: Thi Word: i Word: a Word: ex Word: ha Word: will Word: be Word: li Word: in Word: ken Number: 0.0 Number: 1.0 + encountered. Number: 1.0 = encountered. Number: 2.0 End of File encountered. 

Example - Treat comma (,) and space as whitespace

The following example shows the usage of StreamTokenizer whitespaceChars(int low, int hi) method.

StreamTokenizerDemo.java

 package com.tutorialspoint; import java.io.IOException; import java.io.Reader; import java.io.StreamTokenizer; import java.io.StringReader; public class StreamTokenizerDemo { public static void main(String[] args) throws IOException { String input = "Java,Python C++"; Reader reader = new StringReader(input); StreamTokenizer tokenizer = new StreamTokenizer(reader); // Treat space and comma as whitespace tokenizer.whitespaceChars(32, 32); // space tokenizer.whitespaceChars(',', ','); // comma tokenizer.wordChars('a', 'z'); tokenizer.wordChars('A', 'Z'); tokenizer.wordChars('+', '+'); // to include '+' in C++ System.out.println("Tokens:"); while (tokenizer.nextToken() != StreamTokenizer.TT_EOF) { if (tokenizer.ttype == StreamTokenizer.TT_WORD) { System.out.println("Word: " + tokenizer.sval); } } } } 

Output

Let us compile and run the above program, this will produce the following result−

 Tokens: Word: Java Word: Python Word: C++ 

Explanation

  • Both comma and space are whitespace, so tokens are split by them.

  • They don't appear in the output.

Example - Treat digits 0-9 as whitespace (stripping them out)

The following example shows the usage of StreamTokenizer whitespaceChars(int low, int hi) method.

StreamTokenizerDemo.java

 package com.tutorialspoint; import java.io.IOException; import java.io.Reader; import java.io.StreamTokenizer; import java.io.StringReader; public class StreamTokenizerDemo { public static void main(String[] args) throws IOException { String input = "a1b2c3"; Reader reader = new StringReader(input); StreamTokenizer tokenizer = new StreamTokenizer(reader); tokenizer.whitespaceChars('0', '9'); // treat all digits as whitespace tokenizer.wordChars('a', 'z'); tokenizer.wordChars('A', 'Z'); System.out.println("Tokens:"); while (tokenizer.nextToken() != StreamTokenizer.TT_EOF) { if (tokenizer.ttype == StreamTokenizer.TT_WORD) { System.out.println("Word: " + tokenizer.sval); } } } } 

Output

Let us compile and run the above program, this will produce the following result−

 Tokens: Word: a Word: b Word: c 

Explanation

  • The digits 1, 2, and 3 are treated as whitespace.

  • They are used to split the words but do not appear as tokens.

java_io_streamtokenizer.htm
Advertisements