[jdev] SASL debugging
Joe Hildebrand
hildjj at gmail.com
Sat Dec 10 15:27:50 CST 2005
A better example is
Å (U+212B: ANGSTROM SIGN)
Å (U+00C5: LATIN CAPITAL LETTER A WITH RING ABOVE)
Å (U+0041: LATIN CAPITAL LETTER A and U+030A: COMBINING RING ABOVE)
These all look the same, and pretty much mean the same thing.
Luckily, we don't have to argue about what the characters "mean",
that's a job for the Unicode consortium. For example, they have
decided that:
А (U+0410: CYRILLIC CAPITAL LETTER A)
does *not* map onto U+0041. Whatever.
The important thing is that for all three of the Å's above, they all
canonicalize (in NFKC) to the UTF-8 bytes:
41 CC 8A (hex)
or
61 CC 8A (hex)
if you've got case folding turned on.
This way you can compare them together for equality.
Oh, another favorite example of mine is Ⅷ (U+2167: ROMAN NUMERAL
EIGHT). This NFKC's to viii. There are some more examples here:
http://jabberstudio.org/cgi-bin/viewcvs.cgi/cvs/jabber-net/test/
stringprep/
On Dec 10, 2005, at 1:49 PM, Yves Goergen wrote:
> On 10.12.2005 12:28 (+0100), Matthias Wimmer wrote:
>> Examples of mapped characters are:
>>
>> “℉” (U+2109, single charater!) is mapped to “°f” (two
>> characters),
>> “™” (U+2122, single character!) is mapped to “tm” (two
>> characters),
>> “ℂ” (U+2102) is mapped to “c”,
>> “ℹ” (U+2139) is mapped to “i”,
>> “№” (U+2116, single character!) is mapped to “no” (two
>> characters),
>> “²” (U+00B2) is mapped to “2”.
>
> What's the point in mapping similar-looking characters to another one?
> They are simply not the same and mapping a character from one language
> set to one of an arbitrary other language can disturb sorting things
> very much. Imagine our alphabet was A,B,D,F,G,H,...,C,E only because C
> and E were mapped to the greerillew language characters that look
> similar (or vice versa). Well anyway, I don't think I need this for
> now.
> I'll simply make sure it's Unicode-capable, plugging in a string
> converter later is still possible.
>
> --
> Yves Goergen "LonelyPixel" <nospam.list at unclassified.de>
> "Does the movement of the trees make the wind blow?"
> http://newsboard.unclassified.de - Unclassified NewsBoard Forum
>
More information about the JDev
mailing list