[PATCH] GUID index for LDB
abartlet at samba.org
Fri Sep 8 09:07:58 UTC 2017
On Fri, 2017-09-08 at 10:36 +0200, Stefan Metzmacher wrote:
> Am 08.09.2017 um 05:56 schrieb Andrew Bartlett:
> > Control points for choice of index mode
> > ---------------------------------------
> > The choice of index and TDB key mode is made based (for example, from
> > Samba) on entries in the @INDEXLIST DN:
> > dn: @INDEXLIST
> > @IDXGUID: objectGUID
> > @IDX_DN_GUID: GUID
> > By default, the original DN format is used.
> So we're upgrading the database on first use with the new code?
Yes, when Samba 'fixes' the @INDEXLIST on schema load this will be
> My fear with this is that a simple package upgrade will make
> a dc with a large database unusable for quite some time.
> Can you please check the cost of an upgrade for databases with
> 1.) 5000 users, 5000 computers and 5000 groups
> 2.) 20000 users, 20000 computers and 20000 groups
> 3.) with the numbers of the largest known customer size
> I guess rewriting the whole database consumes quite some cpu
> and also memory. A server may run out of memory while doing this
> as we need more than twice the size of all sam.ldb* databases together.
I understand your fears, but to be clear, a database of 100,000 users
was only 860MB with the new code.
In terms of existing deployment scale, the indeed network that Kevin
presented at SambaXP is the largest production deployment I know of and
before Samba 4.7 significantly larger deployments are un-feasible due
to the O(n^2) handling on links at join time. (They take him ~ 30-45
* 45 domain controllers on 5 continents
* 6,252 User/Group objects ,
* 37,625 group memberships
> I think I'd prefer making the switch for existing databases an
> explicit task for the admin.
I understand the concern, and I changed the index re-write code not to
force the set of DSDB_FEATURES_SUPPORTED flag for 4.7 for this reason,
meaning only new databases support 'features'. A similar arrangement
could be made if needed.
The reverse concern I have is that if we do that, we have to maintain
and test Samba in both modes in perpetuity, particularly as we start to
structure our code to try to be GUID rather than string DN based.
Either way, it is Samba (not LDB) that controls when this is enabled,
and this and the auto-upgrade is only proposed to be enabled on a major
version upgrade, for Samba 4.8. I will naturally make any change we
make here very clear in the WHATSNEW.
Most installations (from the discussions on the list) upgrade by
joining a new DC to the domain. The few that don't just upgrade one DC
in a pair, then the other. We hear about these because folks regularly
ask if the versions need to be upgraded in sync.
Garming keeps a 40,000 user test DB around, so I'll get those numbers
from that and hope the above helps address your concerns.
Thankfully, the new code (actually everything since 1.2.2) is much
faster to re-index than the old, as I removed an O(n^2) loop.
Thanks for your continued thoughts on this!
Andrew Bartlett http://samba.org/~abartlet/
Authentication Developer, Samba Team http://samba.org
Samba Developer, Catalyst IT http://catalyst.net.nz/services/samba
More information about the samba-technical