v***@gmail.com
2005-07-02 01:16:13 UTC
Hi,
I am a little confused of the definition of Unicode.
My understanding is that Unicode defines the set of characters (which
is a supset of most character reportires), but it doesn't define the
encoding scheme.
UTF-8, UTF-16 and UTF-32 are possible ones for unicode. But I keep
hearing that unicode will use 2 bytes per character. That's not always
true is it? B/c UTF-8 is smart enough to use 1 byte for Latin
characters I thought.
Please help.
Victor
I am a little confused of the definition of Unicode.
My understanding is that Unicode defines the set of characters (which
is a supset of most character reportires), but it doesn't define the
encoding scheme.
UTF-8, UTF-16 and UTF-32 are possible ones for unicode. But I keep
hearing that unicode will use 2 bytes per character. That's not always
true is it? B/c UTF-8 is smart enough to use 1 byte for Latin
characters I thought.
Please help.
Victor