Setting up CTDB on OCFS2 and VMs ...

Wed Dec 31 10:59:16 MST 2014

On 2014-12-31 at 15:46 +0000, Rowland Penny wrote:
> 
> OK, I have been having another attempt at the ctdb cluster, I cannot get
> both nodes healthy if I use a lockfile in /etc/default/ctdb,

This can't be expected to work, since a recovery lock file needs
to be on shared storage (clustered file system, providing posix
fcntl byte range lock semantics), and /etc/default/ is not
generally such a place, unless you are building a fake cluster
with multiple ctdb instances running on one host.

> so I have commented it out, both nodes are now showing OK.

It is possible to run w/o recovery lock, but it is not
recommended in a production setup at least.

> I then moved on to trying
> to get samba to join the domain, but it always fails with this error
> message:
> 
> Could not initialise message context. Try running as root
> Failed to join domain: Access is denied
> 
> I have investigated ctdb on my system and have come to the conclusion that
> ctdb is a *MESS*, don't believe me ? then consider this:
> 
> root at cluster1:~# ls /var/ctdb
> iptables-ctdb.flock  persistent  state
> root at cluster1:~# ls /var/lib/ctdb
> iptables-ctdb.flock  persistent  state
> root at cluster1:~# ls /var/lib/lib/ctdb
> brlock.tdb.1           iptables-ctdb.flock  persistent
> smbXsrv_open_global.tdb.1     smbXsrv_version_global.tdb.1
> dbwrap_watchers.tdb.1  locking.tdb.1        printer_list.tdb.1
> smbXsrv_session_global.tdb.1  state
> g_lock.tdb.1           notify_index.tdb.1   serverid.tdb.1
> smbXsrv_tcon_global.tdb.1
> root at cluster1:~# ls /var/ctdb/persistent/
> root at cluster1:~# ls /var/ctdb/state/
> failcount  interface_modify_eth0.flock    service_state
> root at cluster1:~# ls /var/lib/ctdb/persistent/
> root at cluster1:~# ls /var/lib/ctdb/state/
> failcount  interface_modify_eth0.flock    service_state
> root at cluster1:~# ls /var/lib/lib/ctdb/persistent/
> account_policy.tdb.1  ctdb.tdb.0  ctdb.tdb.1  group_mapping.tdb.1
> passdb.tdb.1  registry.tdb.1  secrets.tdb.1    share_info.tdb.1
> root at cluster1:~# ls /var/lib/lib/ctdb/state
> failcount  interface_modify_eth0.flock    persistent_health.tdb.1
> recdb.tdb.1  service_state
> 
> Why have very similar data in 3 places ? why have the conf (which
> incidentaly isn't called a conf file) in a different place from the other
> ctdb files in /etc ?

That's essentially two places, one hierarchy under /var/ctdb
(old ctdb versions) and one hierarchy under /var/lib/ctdb (new
ctb versions) so my guess is that this stems from earlier
installs of older versions.

If you stop ctdb, remove both these directory trees, and then
restart ctdb, do both trees reappear?

> More to the point, Why, oh why doesn't it work.

Has the samba version been compiled against the used ctdb
version. One possible source of such problems is that
samba might have beenm compiled against an older version
of ctdb and then you install the latest version of ctdb.

The problem that could explain the "Could not initialize ..."
message would be that samba tries to access CTDB under the
socket file /tmp/ctdbd.socket (default in old ctdb versions)
and the new ctdbd uses /var/run/ctdb/ctdbd.socket by default.

So you could (without needing to recompile) test if things
work out more nicely if you set:

"ctdbd socket = /var/run/ctdb/ctdbd.socket"
in smb.conf

and (for the sake of explicitness):
"CTDB_SOCKET=/var/run/ctdb/ctdbd.socket"
in /etc/default/ctdb.

And then of course restart everything.
let's see if this improves anything...

Cheers - Michael

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: not available
URL: <http://lists.samba.org/pipermail/samba-technical/attachments/20141231/b52b513a/attachment.pgp>