OpenLDAP backend broken by the new ldb.tdb event context work, would appreciate advice on how to fix
Andrew Bartlett
abartlet at samba.org
Thu Jul 20 00:40:41 UTC 2017
On Wed, 2017-07-19 at 17:00 +0300, Nadezhda Ivanova wrote:
> Hi team and Andrew,
> I'm trying to start using the newest Samba for my OpenLDAp backend work
> - after the long hiatus, I was using code that was 2 years old, so I
> decided to adapt to the newest master.
It is so good to see you back at this. It has been a long time!
> However, we are no longer able to provision an OpenLDAP backend - one of
> the issues I have a workaround for, by adding "client ldap sasl wrapping
> = plain" to my smb.conf, until I'm able to fix it properly, but the
> other I need some advice on.
>
> While trying to create the domain naming context, provisioning hangs here:
>
>
> #0 0x00007ffff78f1673 in __epoll_wait_nocancel () at
> ../sysdeps/unix/syscall-template.S:81
> #1 0x00007ffff5b0a074 in epoll_event_loop (epoll_ev=0x1ec7360,
> tvalp=0x7fffffffc640) at ../lib/tevent/tevent_epoll.c:642
> #2 0x00007ffff5b0a98c in epoll_event_loop_once (ev=0x1ec70e0,
> location=0x7ffff64b0edd "../lib/ldb/common/ldb.c:637") at
> ../lib/tevent/tevent_epoll.c:930
> #3 0x00007ffff5b07693 in std_event_loop_once (ev=0x1ec70e0,
> location=0x7ffff64b0edd "../lib/ldb/common/ldb.c:637") at
> ../lib/tevent/tevent_standard.c:114
> #4 0x00007ffff5b0050f in _tevent_loop_once (ev=0x1ec70e0,
> location=0x7ffff64b0edd "../lib/ldb/common/ldb.c:637") at
> ../lib/tevent/tevent.c:721
> #5 0x00007ffff64a71bc in ldb_wait (handle=0x15a1fb0, type=LDB_WAIT_ALL)
> at ../lib/ldb/common/ldb.c:637
> #6 0x00007fffda0d3d7e in new_partition_set_replicated_metadata
> (ldb=0x1c52460, module=0x1b77090, last_req=0x1d5a070, data=0x15f8bd0,
> partition=0x1c75bd0)
> at ../source4/dsdb/samdb/ldb_modules/partition_init.c:601
> #7 0x00007fffda0d46da in partition_create (module=0x1b77090,
> req=0x174c980) at ../source4/dsdb/samdb/ldb_modules/partition_init.c:827
> #8 0x00007fffda0d21e2 in partition_extended (module=0x1b77090,
> req=0x174c980) at ../source4/dsdb/samdb/ldb_modules/partition.c:1172
> #9 0x00007ffff6488c42 in ldb_next_request (module=0x1b77090,
> request=0x174c980) at ../lib/ldb/common/ldb_modules.c:543
> #10 0x00007fffdaefe427 in np_part_search_callback (req=0x2162590,
> ares=0x15a47a0) at ../source4/dsdb/samdb/ldb_modules/new_partition.c:143
> #11 0x00007fffd6fd7275 in ltdb_request_done (ctx=0x1c9b4c0, error=32) at
> ../lib/ldb/ldb_tdb/ldb_tdb.c:1314
> #12 0x00007fffd6fd7556 in ltdb_callback (ev=0x1ec70e0, te=0x2303ba0,
> t=..., private_data=0x1c9b4c0) at ../lib/ldb/ldb_tdb/ldb_tdb.c:1424
> #13 0x00007ffff5b08602 in tevent_common_loop_timer_delay (ev=0x1ec70e0)
> at ../lib/tevent/tevent_timed.c:369
> #14 0x00007ffff5b0a90c in epoll_event_loop_once (ev=0x1ec70e0,
> location=0x7ffff64b0edd "../lib/ldb/common/ldb.c:637") at
> ../lib/tevent/tevent_epoll.c:915
> #15 0x00007ffff5b07693 in std_event_loop_once (ev=0x1ec70e0,
> location=0x7ffff64b0edd "../lib/ldb/common/ldb.c:637") at
> ../lib/tevent/tevent_standard.c:114
> #16 0x00007ffff5b0050f in _tevent_loop_once (ev=0x1ec70e0,
> location=0x7ffff64b0edd "../lib/ldb/common/ldb.c:637") at
> ../lib/tevent/tevent.c:721
> #17 0x00007ffff64a71bc in ldb_wait (handle=0x15af320, type=LDB_WAIT_ALL)
> at ../lib/ldb/common/ldb.c:637
> #18 0x00007ffff68c4af2 in py_ldb_add (self=0x7fffde1f1eb0,
> args=0x7fffd647d908, kwargs=0x0) at ../lib/ldb/pyldb.c:1415
> #19 0x0000000000523f6d in PyEval_EvalFrameEx ()
> #20 0x0000000000555551 in PyEval_EvalCodeEx ()
>
>
> This seems to start happening after this commit:
>
> commit e67d3568e5853387d0675ed13d51df21e6274770
> Author: Andrew Bartlett <abartlet at samba.org>
> Date: Fri May 12 02:30:01 2017 +0200
>
> ldb: Use the private event context in ldb_tdb and ldb_wait()
>
> This enables the previous commits, and ensures that ldb_tdb is safe
> from operations while locks
> are held
>
> Signed-off-by: Andrew Bartlett <abartlet at samba.org>
> Reviewed-by: Stefan Metzmacher <metze at samba.org>
> Reviewed-by: Garming Sam <garming at catalyst.net.nz>
>
>
>
> Could you advise me on why this happens, and possible steps to fix?
ldb_ildap.c:ildb_connect()
ildb->event_ctx = ldb_get_event_context(ldb);
This uses one of those evil nested event loops, which are essentially
banned in Samba and now also ldb, for causing too much mischief.
Instead, we now create a new event context for every request, so for
every request, you need to find the per-request event context and wait
on that instead.
Eliminating long-term variables with the event context in it, and
instead using ldb_handle_get_event_context(req->handle) would make a
big difference.
Andrew Bartlett
--
Andrew Bartlett http://samba.org/~abartlet/
Authentication Developer, Samba Team http://samba.org
Samba Developer, Catalyst IT http://catalyst.net.nz/services/samba
More information about the samba-technical
mailing list