[jcifs] Character Set discussions

Christopher R. Hertel crh at ubiqx.mn.org
Sat Feb 8 05:13:36 EST 2003


On Fri, Feb 07, 2003 at 01:03:33PM -0500, Glass, Eric wrote:
> > > UNescape characters, however; if you are given:
> > > 
> > > smb://svr/slovak/m%C3%B4%C5%BEem/jes%C5%A5/sklo/nezran%C3%AD/ma.zip
> > > 
> > > that is a valid URL, and should work.  This would be 
> > especially important
> > 
> > I need, for my own sake, to go back over something that I 
> > think we covered
> > earlier.  In the above case, using escapes, the software that 
> > converts the
> > escaped string into internal format needs to know which 
> > encoding to use. 
> > 
> > If I recall correctly, the standard is (or is planned to be) 
> > UTF-8, yes?
> > 
> 
> UTF-8 is not specified by RFC 2396, but is recommended by RFC 2718.  It will
> be necessary for compatibility with the IRI specification, if and when that
> becomes a standard.

You see what I mean, though.  The escaped string has no meaning without
context.  If I interpret it using UTF-8 I get one result (character-wise)  
and if I interpretit using UCS-2LE I'll get a different result (again,
display character-wise).

Am I making any sense?

Chris -)-----

-- 
Samba Team -- http://www.samba.org/     -)-----   Christopher R. Hertel
jCIFS Team -- http://jcifs.samba.org/   -)-----   ubiqx development, uninq.
ubiqx Team -- http://www.ubiqx.org/     -)-----   crh at ubiqx.mn.org
OnLineBook -- http://ubiqx.org/cifs/    -)-----   crh at ubiqx.org


More information about the jcifs mailing list