Skip to Main Content

Java Programming

Announcement

For appeals, questions and feedback about Oracle Forums, please email oracle-forums-moderators_us@oracle.com. Technical questions should be asked in the appropriate category. Thank you!

Chinese (BIG5) encoding

807580Sep 19 2009 — edited Sep 19 2009
String tag = "稀釋";
byte[] big5Tag = tag.getBytes("Big5");
System.out.println(Arrays.toString(big5Tag));

Output: [63, 63, 63, 63]

There are TWO different Chinese characters in the string. But the encoding printouts are the same. I am not familiar about this. Can anybody help? Thx

(I need to parse a Chinese HTML table, eventually. What is the best HTML parser which support Chinese language?)
Comments
Locked Post
New comments cannot be posted to this locked post.
Post Details
Locked on Oct 17 2009
Added on Sep 19 2009
42 comments
783 views