[Mono-bugs] [Bug 362740] New: Support supplementary chars in C# \U form

bugzilla_noreply at novell.com bugzilla_noreply at novell.com
Mon Feb 18 10:20:20 EST 2008


           Summary: Support supplementary chars in C# \U form
           Product: Mono: Compilers
           Version: 1.9.0
          Platform: Other
        OS/Version: Other
            Status: NEW
          Severity: Enhancement
          Priority: P5 - None
         Component: C#
        AssignedTo: mono-bugs at lists.ximian.com
        ReportedBy: andyhume32 at yahoo.co.uk
         QAContact: mono-bugs at lists.ximian.com
          Found By: ---

Created an attachment (id=195451)
 --> (https://bugzilla.novell.com/attachment.cgi?id=195451)
Test cases

In C# the \Uxxxxxxxx escape sequence can be used to enter supplementary
codepoints (those in the range U+10000 to U+10FFFF).  In the UTF-16 encoding,
as used by .NET, codepoints in that range are represented in as a pair of chars
(codeunits) called a "surrogate pair".  The conversion is a simple arithmetic
conversion, see http://www.unicode.org/faq/utf_bom.html#UTF16  I thought I
would refer here to the code in the UTF32Encoding class, but it appears to no
support such codepoints either.  I will open a separate bug for that issue.

Mono appears not to support such usage, for instance in the first unit-test
attached the string contains a single char \x0041 rather that the surrogate
pair \xD800\xDC41.

Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
You are the assignee for the bug.

More information about the mono-bugs mailing list