tdb API issues

Howard Chu hyc at highlandsun.com
Thu Apr 2 00:20:55 GMT 2009


tridge at samba.org wrote:
> Hi Howard,
>
> The error code isn't the only place where tdb doesn't mix well with
> threads. Things like tdb_traverse() involve state in the tdb context,

Ah, I wasn't planning to use tdb_traverse() so I hadn't even looked at that. 
It would probably require splitting the traverse state out of the tdb_context 
and into its own dynamically allocated cursor struct.

> and all the tdb_lock primitives would affect the whole process, not
> just a thread.

I didn't expect this would be a problem since normally only one slapd process 
accesses a DB at a time. (With the exception that slapcat is allowed read-only 
access.) I figured using a rdwr lock to control calls into tdb should be 
sufficient. Did I miss something?

> The same goes for the transaction code. For a threaded environment,
> you'd like it if it worked so that when thread1 was in a transaction,
> that thread2 would be blocked in writes, and reads would see the
> pre-transaction data (this is what happens with two processes in tdb).

Yes, I was thinking that we'd be getting dirty reads when I looked at this 
code; I'd forgotten that it was protecting against other processes and not 
other threads. So we'd need a way to make sure only the thread that invoked 
the transaction used transaction_read(), while all the other threads still 
fell into the default tdb_read(). Sounds like we need an actual transaction 
handle as well, instead of just leaving all this in the tdb_context.

> If you can arrange it so that each thread has its own tdb context then
> life will be much simpler. That gives you a separate ecode per thread
> as well. The problem is that you'll probably need to use some other
> locking mechanism than fcntl locks, so you'll need to offer a hook
> that abstracts away the locking primitives. Trying to make this work
> so that tdb operations can also be performed safely by other processes
> on the same tdb that is being used by your threaded task will be
> interesting (I think its possible, just a bit tricky).

I'd thought of that at first, but it seems that a separate context per thread 
would also multiply the address space consumed per DB, since they would each 
have their own complete mmap of the data. Doesn't sound too practical.

Yeah, coordinating the fcntl locks across multiple tdb_context's would also be 
a pain. Pretty sure that splitting to a separate context per thread is the 
wrong level of granularity... ;)

> tdb and threads certainly isn't easy :-)
>
> Cheers, Tridge
>
-- 
   -- Howard Chu
   CTO, Symas Corp.           http://www.symas.com
   Director, Highland Sun     http://highlandsun.com/hyc/
   Chief Architect, OpenLDAP  http://www.openldap.org/project/


More information about the samba-technical mailing list