560 likes | 576 Views
Learn about string manipulation, character access, substring extraction, and utility functions in Java. Understand the difference between String and StringBuffer classes.
E N D
http://www.csie.nctu.edu.tw/~tsaiwn/oop/ Programming in Java String and String Parser java.util.* -- Vector, Stack, HashTable… 蔡文能 交通大學資訊工程學系 tsaiwn@csie.nctu.edu.tw
Agenda • Character, String class and the StringBuffer class • Character in a String -- charAt(int i) • SubString in a String –substring(int p, int n) • More String/StringBuffer methods • Type Wrappers again • Utilities -- java.util.* • StringTokenizer • Vector, Stack, HashTable
Consider how to read a number? • In C Language: • scanf with %d format (%f for float, %lf for double) • May cause program down due to strange input • May forget to pass address of variable to scanf • Better way • Read it into program as a string ( how ? ) • gets(buf); /* not good enough */ • fgets(buf, sizeof(buf), stdin); /* buf is a char array */ • Then, use sscanf to scan that string; or use the following utility functions: • atol, atof, strtol, strtod, strtok, strsep • What about in Java ?
String in Java is not char array • String is not a char array in Java • String is different from StringBuffer • Elements in String/StringBuffer can be accessed via using the charAt( position ) method. • The content of a String can not be modified • substring ? • capacity of a StringBuffer ? Size of a String Buffer ? • size <= capacity
Characters • In Java single characters are represented using the data type char. ( do NOT confuse with the wrapper Character class) • char is one of the 8 primitive data types. • Character constants are written as symbols enclosed in single quotes, for example, 'a', 'X', and '5'. ( ' : apostrophe, " : quote ) • To represent characters in computer, U. S. computer manufacturers devised several coding schemes. • One coding scheme widely used today is ASCII (American Standard Code for Information Interchange). • To accommodate the character symbols of non-English languages, the Unicode Consortium established the Unicode Worldwide Character Standard, commonly known as Unicode => http://www.unicode.org/
Not String Strings • Features: • Strings are objects and java.lang.String is the class. • Use 16 bit UNICODE • All objects (java.lang.Object) have methods toString(); • Special treatment of the + operator "Object" + obj + "will call toString( ) function" new StringBuffer("Object").append(obj.toString()).append("will call toString( ) function").toString() • Example: System.out.println("Hello World"); String hey = "Hello"; String you = "World"; System.out.println(hey + "" + you); int len = hey.length(); • Notice the performance.
StringBuffer • For the StringBuffer Class • StringBuffer provides length and charAt accessor methods, as well as the String class. • capacity method • Returning the amount of space currently allocated for the StringBuffer, rather than the amount of space used. • For example, in reverseIt method (next page) : • dest.capacity() will never change. • dest.length() will increase for each iteration. • Initializing a StringBuffer's capacity to a reasonable first guess. • Minimizing the number of times memory must be allocated for it.
String vs. StringBuffer • Use the String class when you are working with strings that cannot change. • StringBuffer is used when you want to manipulate the contents of the string on the fly. public class ReverseString { public static String reverseIt(String source) { int i, len = source.length(); StringBuffer dest = new StringBuffer(len); for (i = (len - 1); i >= 0; i--) dest.append(source.charAt(i)); return dest.toString(); } }
Creating Strings and StringBuffers • Creating a String • String a = "Gobbledy gook.”; • new String("Gobbledy gook."); • Creating a StringBuffer • StringBuffer dest = new StringBuffer(len); • StringBuffer's default constructor may leaves the buffer's length undetermined. • It's more efficient to specify the length of the buffer.
String name1; name1 = new String( "Latte" ); String name1; name1 = "Latte"; These two statements are equivalent. String and StringBuffer are Objects • String is a class in the java.lang package. • Because String is a class, we need to create an instance of String in Java for string processing. Like any other objects, we need a declaration and object creation for the instances of the String class. For example, But we normally use a shorthand, instead, treating String objects much like primitive data. For example,
Accessing char Elements in a String • Individual characters in a String accessed with the charAt method. • char charAt(intindex) • Returns the character at the specified index. • An index ranges from 0 to length( ) - 1. • The first character of the sequence is at index 0, the next at index 1, and so on, as for array indexing. • 不可以直接使用 s[n]而要改用 s.charAt(n)
0 1 2 3 4 5 6 E x a m p l e String name = "Example"; name name.charAt( 3 ) This variable refers to the whole string. The method returns the character at position # 3. Accessing char Elements in a String (cont.)
substring in a String/StringBuffer • substring in a String accessed with the substring method. • Stringsubstring(intbeginIndex) • Returns a new string that is a substring of this string. The substring begins with the character at the specified beginIndex and extends to the end of this string. • Examples: "unhappy". • substring(2) returns "happy" • substring(3) returns “appy" • Stringsubstring(intbeginIndex, intendIndex) • Returns a new string that is a substring of this string. The substring begins at the specified beginIndex and extends to the character at index endIndex - 1. Thus the length of the substring is endIndex-beginIndex. • Examples: "hamburger". • substring(4, 8) returns "urge" • "smiles". • substring(1, 5) returns "mile"
7 3 0 Error! name.length( ); str1.length( ); str2.length( ); str3.length( ); String name = “Sumatra”, str1 = “one”, str2 = “”, str3; Determining the String Size • We determine the number of characters in a String with the length method. Error because no object is created for str3, so it is a null.
Example using length( ) , charAt(int i) • Reverse a string public class ReverseString { public static String reverseIt(String source) { int i, len = source.length(); StringBuffer dest = new StringBuffer(len); for (i = (len - 1); i >= 0; i--) dest.append(source.charAt(i)); return dest.toString(); } }
More String/StringBuffer Methods • intindexOf(int character) • int lastIndexOf(int character) • Return the index of the first (last) occurrence of the specified character. • int indexOf(int character, int from) • int lastIndexOf(int character, int from) • Return the index of the first (last) occurrence of the specified character, searching forward (backward) from the specified index. • voidgetChars(intsrcBegin, intsrcEnd, char[]dst, intdstBegin) • Copies characters from this string into the destination character array. • booleanequals(ObjectanObject) • Compares this string to the specified object. • Objectclone() throws CloneNotSupportedException • Creates and returns a copy of this object.
String Concatenation • Concatenation and the + Operator • In Java, you can use + to concatenate String: String cat = "cat"; System.out.println("con" + cat + "enation"); • But the compiler uses StringBuffer to implement concatenation behind the scenes. String cat = "cat"; System.out.println(new StringBuffer().append("con"). append(cat).append("enation").toString());
Converting Objects to Strings • The toString Method public class ReverseString { public static String reverseIt(String source) { int i, len = source.length(); StringBuffer dest = new StringBuffer(len); for (i = (len - 1); i >= 0; i--) dest.append(source.charAt(i)); return dest.toString(); } } • All classes inherit toString from the Object class. • Many classes in the java.lang package override this method to provide an implementation that is meaningful to that class. • The valueOf Method • You can use valueOf to convert variables of different types to String. System.out.println(String.valueOf(Math.PI));
Object Converting Strings to Numbers • The "type wrapper" classes (Integer, Long, Float, and Double) provide a class method named valueOf that converts a String to an object of that type. String piStr = "3.14159"; Float pi = Float.valueOf(piStr); /*物件*/ float pi2 = Float.parseFLOAT(piStr); /*值*/ class method = static method
TYPE Wrapper Long (1/2) • As we mentioned before, every Primitive data type has correspondingtype wrapper. • Long • Field Summary staticlong MAX_VALUE = The largest value of type long. staticlong MIN_VALUE = The smallest value of type long. staticClassTYPE = The Class object representing the primitive type long. • Constructor Summary Long(longvalue) : Constructs a newly allocated Long object that represents the primitive long argument. Long(Strings) : Constructs a newly allocated Long object that represents the value represented by the string in decimal form.
TYPE Wrapper Long (2/2) • some methods of the class Long • staticStringtoHexString(longi) Creates a string representation of the long argument as an unsigned integer in base 16. • staticStringtoOctalString(longi) Creates a string representation of the long argument as an unsigned integer in base 8. • staticlongparseLong(Strings) Parses the string argument as a signed decimal long. • staticlongparseLong(Strings, intradix) Parses the string argument as a signed long in the radix specified by the second argument. • staticlongvalueOf(Strings) Returns a new long object initialized to the value of the specified String.
Strings and the Java Compiler • Literal Strings • You can use literal strings anywhere you would use a String object. System.out.println("Hello World!"); • You can also use String methods directly from a literal string. int len = "Goodbye Cruel World".length(); • These two constructs are equivalent. String s = "Hala Hala"; String s = new String("Hala Hala"); • But first one which is more efficient than the second. • The second end up creating two String objects instead of one.
More about I/O : C • Read an Integer • C static char buf[999]; fgets(buf, sizeof(buf), stdin); /* read one line first */ static char buf[999]; file * fp; int haha; fp = fopen("myinput.dat", "rt"); fgets(buf, sizeof(buf), fp); /* read one line first */ haha = atol(buf);
More about I/O : C++ • Read an Integer • C++ static char buf[999]; cin.getline(buf, sizeof(buf) ); /* read one line first */ static char buf[999]; fstream fs; int haha; fs.open( "myinput.dat", ios::in); fs.getline(buf, sizeof(buf) ); /* read one line first */ int haha = atol(buf);
More about I/O : Java • Read an Integer • Java • Use BufferedReader • Stream 要先轉為 Reader: InputStreamReader • 把Reader 丟給BufferedReader 包起來 • Use readLine( ) in BufferedReader • Use ???.parse??? ( Double.parseDouble( yourString ) ); • i.e., methods in class wrapper • Use StringTokenizer • To parse string contains several tokens
Example: using BufferedReader InputStreamReader isr = new InputStreamReader(System.in); /* 鍵盤是以 byte 為單位的 Stream */ BufferedReader br = new BufferedReader(isr); /* BufferedReader 的 constructor 只接受 Reader */ /* 所以不能直接把 System.in 丟給它 */ String stmp = br.readLine( ); /* 讀入一整列 */ long x = Long.parseLong(stmp); /* if it is a long */ /* 使用 class wrapper */ double x = Double.parseDouble(stmp); /* if double */ 一列有多項輸入則可使用 StringTokenizer
Utilities ( java.util.* ) • In java.util.*, • StringTokenizer • Vector • Hashtable • Properties • ... • Vector is a growable array of objects. • In Hashtable, an entry has two parts: a name and a value. • Pairs of name/value are objects. • Use hashing to find each pair. • Also known as associative array, or called “map”. • In Properties, inherits from Hashtable. • Pairs of name/value are Strings.
StringTokenizer (2/3) import java.io.*; import java.util.*; public class Test { InputStreamReader isr= new InputStreamReader(System.in); BufferedReader b = new BufferedReader(isr); public static void main ( String p[ ] ) throws Exception { new Test( ); } Test( ) throws Exception { System.out.print("In: "); String x = b.readLine(); System.out.println("Got "+ x); StringTokenizer s = new StringTokenizer(x, " ,\t" ); while(s.hasMoreTokens( ) ){ System.out.println(s.nextToken( )); } System.out.println("======"); } }
StringTokenizer (3/3) System.out.print("\r\nYour 三圍, 空白或逗號隔開: "); String tmps = b.readLine(); StringTokenizer s = new StringTokenizer(tmps, " ,\t"); int kk=0; round[2] = round[1] = round[0] =0; try{ while(s.hasMoreTokens () ) { round[ kk++ ] = Double.parseDouble( s.nextToken() ); if(kk > 3 ) break; } } catch (Exception e) { System.out.println("Missing "+ (4-kk) + " round"); kk=3;} if(kk < 3 ) { System.out.println("Missing "+ (3-kk) + " round");} System.out.println("===Thank you!==="); PrintStream myoo = System.out; myoo.println("\r\n=== Your Name is " + name); myoo.println("Height = " + height + " \t" + "Weight = " + wet); myoo.print("Three circle(三圍)== "); DecimalFormat dd = new DecimalFormat("####0.00"); myoo.print( dd.format( round[0] ) ); myoo.print( " , " + dd.format( round[1] ) +" , "+ dd.format(round[2]) ); myoo.println("\r\n====== ====== ======");
Vectors (1/5) • This class is in the java.util package • You make it available to your program with import statement “import java.util.Vector” or “import java.util.*” at the top of your program. • Intended for dynamic sizes of elements. • Lets you add arbitrary objects to a list of objects • E.g., addElement(x); • Reallocate the internal array when grows doubly (or a constant factor).
Vectors (2/5) • Performance: flexible and efficient • Access time complexity: O(1) • Time complexity for addElement(x): • Amortized for access: O(1) • Times of calling malloc(): O((log n)/n) • Memory complexity: No more than 2 or constant times more. • But, • Not efficient for removeElement(). • Not so efficient because it uses synchronized. • Sun internally uses a class named FastVector, without synchronized. • Similarly, for StringBuffer and Hashtable. • Note: for String, don’t concatenate strings in a loop!
Vectors (5/5) java.util.Vector java.lang.Object | + - - java.util.AbstractCollection | + - - java.util.AbstractList | + - - java.util.Vector Direct Known Subclasses: Stack
Interface Queue • java.util InterfaceQueue <E> • Type Parameters: • E - the type of elements held in this collection • Since: • JDK 1.5
Hashtable (1/3) • Implements a hashtable in Java • Hashtables have keys and values Key Value “Joe” employee object “Fred” employee object “Heather” employee object
Hashtable (3/3) • A hashtable can be used to quickly look up an object associated with a key. • The key uses the hashcode() method for fast access • get() and put() are used to add and remove objects from the table • elements() returns an object that lets you cycle through the hashtable Also take a look at HashMap (The HashMap class is roughly equivalent to Hashtable, except that it is unsynchronized and permits nulls.)
Properties • The Properties class itself provides methods for the following: • Loading key/value pairs into a Property object from a stream • Retrieving a value from its key • Listing the keys and their values • Enumerating over the keys • Saving the properties to a stream • Properties extends the Hashtable class: • Testing to see if a particular key or value is in the Properties object. • Getting the current number of key/value pairs. • Removing a key and its value. • Adding a key/value pair to the Properties list. • Enumerating over the values or the keys. • Retrieving a value by its key. • Finding out if the Properties object is empty. • Important to be used as something like configuration files or *.ini files.
Loading from FileInputStream • Setting Up Your Properties Object • loading the default properties and loading the remembered properties: . . . // create and load default properties Properties defaultProps = new Properties(); FileInputStream in = new FileInputStream("defaultProperties"); defaultProps.load(in); in.close(); // create program properties with default Properties applicationProps = new Properties(defaultProps); // now load properties from last invocation in = new FileInputStream("appProperties"); applicationProps.load(in); in.close(); . . .