Support for UTF-8

Dmitry Borodaenko d.borodaenko at sam-solutions.net
Thu Feb 28 03:26:47 GMT 2002


On Thu, Feb 28, 2002 at 11:42:03AM +0100, Luis Bermejo wrote:
> This is my first post here. We have developed a program that is based on samba
> and we would like to offer support for unicode. There is no problem with that
> as samba offers support for Unicode but we would like to use the codepage utf-8
> and samba doesn't support it. Currently we are using samba 2.0.7

For now, you can take old patch by Igor Vergeichik, look at samba ML
archive for the message with headers:

From: Dmitry Borodaenko <angdraug at mail.ru>
Subject: UTF-8 patch for Samba 2.0.7
Date: Wed, 28 Mar 2001 14:04:33 +0300

> Is it planned to include this code page in the future?. How can we add
> this code page to samba?. I think this is not exactly unicode as
> certain characters are three bytes long in utf8.

In short: wait for 3.0, it will support Unicode (and UTF-8 in
particular) in its full glory.

Long version: try to learn more about Unicode (unicode.org). Unicode
itself is a charset, that is agreed upon ordered set of characters from
almost all languages on Earth. UTF-8 is one of the encodings, a method
to represent string of Unicode characters as a string of bytes. What you
are talking about (two-byte characters, right?) is UCS-2, another
encoding for a subset of Unicode (because all Unicode doesn't fit into
two bytes).

-- 
Dmitry Borodaenko




More information about the samba-technical mailing list