[Mono-list] unicode trouble

max aranym@adelphia.net
Mon, 9 Feb 2004 07:32:14 -0800


>
> No, Gabor is not confused.  Unicode has grown.  It is now 20 bits, not 16.
> See for example <http://www.terena.nl/library/multiling/unicode/utf16.html>
> (which I just found by googling; it looks a bit out-of-date).

I had absolutely no clue about this ;) I've been using unicode for years and I 
had never heard about 20 bits!

thanks for correcting me - I have some code to change in my C++ classes now.

max

> Unfortunately Windows, Java, and .NET all use 16-bit characters.
> That means that they must either (a) use UCS-2 encoding, i.e.
> don't support the new unicode characters such as "OLD ITALIC LETTER A";
> or (b) use UTF-16 encoding, which means that these characters which
> don't fit in 16 bits get represented as a pair of 16-bit codes.