[Mono-list] unicode trouble
Mon, 9 Feb 2004 07:32:14 -0800
> No, Gabor is not confused. Unicode has grown. It is now 20 bits, not 16.
> See for example <http://www.terena.nl/library/multiling/unicode/utf16.html>
> (which I just found by googling; it looks a bit out-of-date).
I had absolutely no clue about this ;) I've been using unicode for years and I
had never heard about 20 bits!
thanks for correcting me - I have some code to change in my C++ classes now.
> Unfortunately Windows, Java, and .NET all use 16-bit characters.
> That means that they must either (a) use UCS-2 encoding, i.e.
> don't support the new unicode characters such as "OLD ITALIC LETTER A";
> or (b) use UTF-16 encoding, which means that these characters which
> don't fit in 16 bits get represented as a pair of 16-bit codes.