[Mono-bugs] [Bug 480152] New: string.Normalize() frequently produces incorrect output
bugzilla_noreply at novell.com
bugzilla_noreply at novell.com
Thu Feb 26 18:25:54 EST 2009
https://bugzilla.novell.com/show_bug.cgi?id=480152
Summary: string.Normalize() frequently produces incorrect
output
Classification: Mono
Product: Mono: Class Libraries
Version: SVN
Platform: Macintosh
OS/Version: Mac OS X 10.5
Status: NEW
Severity: Normal
Priority: P5 - None
Component: CORLIB
AssignedTo: mono-bugs at lists.ximian.com
ReportedBy: dmitchell at logos.com
QAContact: mono-bugs at lists.ximian.com
Found By: ---
User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10_5_6; en-us)
AppleWebKit/525.27.1 (KHTML, like Gecko) Version/3.2.1 Safari/525.27.1
A number of examples could be provided. Here are a couple, taken from
http://unicode.org/reports/tr15/
"\u212B" -- Form D is "\u0041\u030A", but Mono provides "\u00C5" (Taken from
Figure 3. Singletons)
"\u1E69" -- Form D is "\u0073\u0323\u0307" but Mono provides "\u1E63\u0307"
(Taken from Figure 5. Multiple Combining Marks)
Additionally, I believe:
Form D of "\u1e4e" should be "\u004f\u0303\u0308", but Mono provides
"\u00d5\u0308"--incidentally, you get "\u004f\u0303\u0308" if you try to
normalize "\u00d5\u0308".
Form D of "\u1e2f" should be "\u0069\u0308\u0301", but Mono provides
"\u00ef\u0301"--this also comes out correctly if doubly-normalized.
Form C of "\u0041\u030a\u0061\u030a" should be "\00c5\00e5" but Mono provides
"\u212B\00e5"--however, Mono correctly reports form C of "\u0041\u030a" as
"\u00c5".
Reproducible: Always
Steps to Reproduce:
1. Compile/run code that Normalizes the example string
2.
3.
Actual Results:
Normalize() frequently produces incorrect results.
Expected Results:
Normalize should provide correct results.
--
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
You are the assignee for the bug.
More information about the mono-bugs
mailing list