[jcifs] Character Set discussions
Christopher R. Hertel
crh at ubiqx.mn.org
Sat Feb 8 05:13:36 EST 2003
On Fri, Feb 07, 2003 at 01:03:33PM -0500, Glass, Eric wrote:
> > > UNescape characters, however; if you are given:
> > >
> > > smb://svr/slovak/m%C3%B4%C5%BEem/jes%C5%A5/sklo/nezran%C3%AD/ma.zip
> > >
> > > that is a valid URL, and should work. This would be
> > especially important
> >
> > I need, for my own sake, to go back over something that I
> > think we covered
> > earlier. In the above case, using escapes, the software that
> > converts the
> > escaped string into internal format needs to know which
> > encoding to use.
> >
> > If I recall correctly, the standard is (or is planned to be)
> > UTF-8, yes?
> >
>
> UTF-8 is not specified by RFC 2396, but is recommended by RFC 2718. It will
> be necessary for compatibility with the IRI specification, if and when that
> becomes a standard.
You see what I mean, though. The escaped string has no meaning without
context. If I interpret it using UTF-8 I get one result (character-wise)
and if I interpretit using UCS-2LE I'll get a different result (again,
display character-wise).
Am I making any sense?
Chris -)-----
--
Samba Team -- http://www.samba.org/ -)----- Christopher R. Hertel
jCIFS Team -- http://jcifs.samba.org/ -)----- ubiqx development, uninq.
ubiqx Team -- http://www.ubiqx.org/ -)----- crh at ubiqx.mn.org
OnLineBook -- http://ubiqx.org/cifs/ -)----- crh at ubiqx.org
More information about the jcifs
mailing list