[Mono-dev] Re: [Mono-devel-list] mcs patch for default encoding
Atsushi Eno
atsushi at ximian.com
Mon Aug 22 06:21:56 EDT 2005
Hi again,
> Agreed. In fact, I was also fixing bug #75065, maybe duplicate.
> I have a fix for UTF8Encoding, but it uncovered another mcs bug
> which does not handle files with BOM with specific encoding.
> To summarize the situation:
>
> - Currently driver.cs does not process source files with
> default encoding.
> - UTF8Encoding.cs does not handle U+FEFF correctly.
> - When we fix UTF8Encoding.cs to handle U+FEFF, it starts
> to reject some source files which has BOM.
> (CS8025:Parsing error)
> - Even if we fix driver.cs to let StreamReader consider BOM
> (currently we disable it), there are still some files
> borking.
>
> Am digging into this bug in depth. Hopefully I'll post a set of
> fixes later.
... and now I finished the fixes as was done in the attached patch:
- driver.cs :
a) uses Encoding.Default for the default input.
b) Always use true for detecting BOM at any time.
- support.cs : Handle preamble_size precisely.
- UTF8Encoding.cs : it should not skip U+FEFF. This fixes
bug #73086 and #75065.
They should be applied at a time, except for a).
Atsushi Eno
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: mcs-encoding-fix.patch
Url: http://lists.ximian.com/pipermail/mono-devel-list/attachments/20050822/524ee492/attachment.pl
More information about the Mono-devel-list
mailing list