samba and utf-8-mac

Michael Sweet mike at easysw.com
Sun Jun 19 21:18:32 GMT 2005


Jeremy Allison wrote:
> On Sat, Jun 18, 2005 at 12:23:29PM +0400, Alexander Bokovoy wrote:
> 
> 
>>It is utf-8 varient which differs in that all composite characters are
>>decomposed.
>>http://developer.apple.com/technotes/tn/tn1150table.html
>>http://www.opensource.apple.com/darwinsource/10.4.1/libiconv-13/libiconv/lib/utf8mac.h
> 
> 
> Hmmm utf8 "varient"... Apple not using an international standard ! No :-). What a
> suprise.  Thinking "different" (ly) I guess :-).
> 
> I not that it's not listed on the unicode.org pages but only at apple.com...
> 
> So it's not suprising that glibc iconv doesn't support it (I believe that's
> the problem, yes ?).

Actually, no, it is just that Unicode actually comes in four
normalization forms: NFC (composed), NFD (decomposed), NFKC
(composed with preference to previous ISO-8859-x chars), and
NFKD (decomposed except for ISO-8859-x chars).

IIRC, Apple uses NFD, while UNIX/Linux typically uses NFKC.

-- 
______________________________________________________________________
Michael Sweet, Easy Software Products           mike at easysw dot com
Internet Printing and Publishing Software        http://www.easysw.com


More information about the samba-technical mailing list