[jcifs] Character Set discussions

Glass, Eric eric.glass at capitalone.com
Sat Feb 8 06:47:32 EST 2003


> > 
> > UTF-8 is not specified by RFC 2396, but is recommended by 
> RFC 2718.  It will
> > be necessary for compatibility with the IRI specification, 
> if and when that
> > becomes a standard.
> 
> You see what I mean, though.  The escaped string has no 
> meaning without
> context.  If I interpret it using UTF-8 I get one result 
> (character-wise)  
> and if I interpretit using UCS-2LE I'll get a different result (again,
> display character-wise).
> 
> Am I making any sense?
> 
> Chris -)-----

Yes.  The recommendation from RFC 2718 is to interpret the escapes as
characters from the UTF-8 character set.

The W3C has a good discussion of this in their "Character Model for the
World Wide Web" working draft:

http://www.w3.org/TR/charmod/#sec-URIs


Eric
> 
> -- 
> Samba Team -- http://www.samba.org/     -)-----   Christopher 
> R. Hertel
> jCIFS Team -- http://jcifs.samba.org/   -)-----   ubiqx 
> development, uninq.
> ubiqx Team -- http://www.ubiqx.org/     -)-----   crh at ubiqx.mn.org
> OnLineBook -- http://ubiqx.org/cifs/    -)-----   crh at ubiqx.org
> 
 
**************************************************************************
The information transmitted herewith is sensitive information intended only
for use by the individual or entity to which it is addressed. If the reader
of this message is not the intended recipient, you are hereby notified that
any review, retransmission, dissemination, distribution, copying or other
use of, or taking of any action in reliance upon this information is
strictly prohibited. If you have received this communication in error,
please contact the sender and delete the material from your computer.


More information about the jcifs mailing list