Chinese (BIG5) encoding
807580Sep 19 2009 — edited Sep 19 2009String tag = "稀釋";
byte[] big5Tag = tag.getBytes("Big5");
System.out.println(Arrays.toString(big5Tag));
Output: [63, 63, 63, 63]
There are TWO different Chinese characters in the string. But the encoding printouts are the same. I am not familiar about this. Can anybody help? Thx
(I need to parse a Chinese HTML table, eventually. What is the best HTML parser which support Chinese language?)