[Mono-list] string encoding

Miguel de Icaza miguel@ximian.com
22 Jun 2003 17:37:06 -0400


Hello,

> So is GTK# broken, if not why not, if yes how do I do it properly?
> Basically, how is string encoding handled?
> 
> The clean solution to me seems to be that CharSet would contain UTF-8
> as a value and CharSet=Auto would imply UTF-8 on UNIX, but I imagine
> this would be an unacceptable extension of standard APIs.

Today the mappings Mono uses are as follows:

	Ansi -> UCS-2 to UTF-8 conversion.
	Unicode -> UCS-2 to UCS-2
	Auto -> Same as ANSI.

We have already requested to allocate a couple of extra bits to gain
more precise control at ECMA.  The proposal is on its way to be
accepted, but Microsoft wanted to double check with its team about this.

Will post more information as it becomes available.

Miguel.