[Mono-bugs] [Bug 480178] System.Globalization.CharUnicodeInfo.GetUnicodeCategory() does not handle surrogate characters appropriately.
bugzilla_noreply at novell.com
bugzilla_noreply at novell.com
Fri May 14 16:54:57 EDT 2010
http://bugzilla.novell.com/show_bug.cgi?id=480178
http://bugzilla.novell.com/show_bug.cgi?id=480178#c29
Damien Diederen <dd at crosstwine.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
Attachment #359239|0 |1
is obsolete| |
--- Comment #29 from Damien Diederen <dd at crosstwine.com> 2010-05-14 20:54:56 UTC ---
Created an attachment (id=362382)
--> (http://bugzilla.novell.com/attachment.cgi?id=362382)
create-category-table: Utility to generate reasonably-packed Unicode tables
This program generates bi-level category tables encoding the contents
of multiple variants of the Unicode category database.
Mono embeds a linear table with category codes for the Unicode BMP
(first 65536 codepoints), and lacks information about characters in
the astral planes--leading to requests such as bug 480178. Extending
the linear table to cover the full codespace is not an ideal solution
as that would expand the embedded "blob" by a factor of 17.
The new tables generated by this program can be used to support the
full 0000-10FFFF range of characters. Despite covering a much larger
codespace, the "page" sharing mechanism enabled by the additional
level of indirection allows the data size to shrink from 64 to ~30kB
for .NET 2.0-compatible tables, and ~48kB for combined 2.0+4.0 tables.
Cf. in-file comments for usage instructions.
--
Configure bugmail: http://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
You are the assignee for the bug.
More information about the mono-bugs
mailing list