[Samba] Samba suddenly restart and replication does not works anymore
Rowland penny
rpenny at samba.org
Sun Apr 3 08:31:48 UTC 2016
On 03/04/16 09:18, Prunk Dump wrote:
> 2016-04-01 23:36 GMT+02:00 Rowland penny <rpenny at samba.org>:
>> On 30/03/16 13:35, Prunk Dump wrote:
>>> Hello Samba team !
>>>
>>> On my network I have three Samba-4.1.17 domain controllers (Debian Jessie)
>>> :
>>> -> One PDC : pdc01
>>> -> Two "slave" DC : sdc02, sdc03
>>>
>>> I don't know why, but sometimes Samba receive the SIGTERM signal and
>>> restart even if I remove it from the logrotate configuration. On
>>> "pdc01" I see :
>>>
>>> ----------
>>> pdc01 (log.samba)
>>> ----------
>>> SIGTERM: killing children
>>> Exiting pid ... on SIGTERM
>>> ...
>>> samba version 4.1.17-Debian started.
>>> ../lib/util/become_daemon.c:136(daemon_ready)
>>> ----------
>>>
>>> After that, the replication stop working. And on the two other DCs I
>>> can see error messages like below. But nothing on the PDC's logs !
>>>
>>> ----------
>>> sdc02 or sdc03 (log.samba)
>>> ----------
>>> ../auth/gensec/gensec.c:247(gensec_update)
>>> Did not manage to negotiate mandetory feature SIGN for dcerpc auth_level 6
>>> ../source4/librpc/rpc/dcerpc_util.c:681(dcerpc_pipe_auth_recv)
>>> Failed to bind to uuid e3514235-4b06-11d1-ab04-00c04fc2dcd2 for
>>>
>>> e3514235-4b06-11d1-ab04-00c04fc2dcd2 at ncacn_ip_tcp:b339b873-f01c-4672-8984-61e1e48422ea._msdcs.mydom.fr[1024,seal,krb5]
>>> NT_STATUS_ACCESS_DENIED
>>> ...
>>> ...
>>> -----
>>>
>>> When I manually restart the two slave DCs the error messages stop. But
>>> the PDC complain that it can't connect to the slave DC (due to the
>>> samba restart) and after, the replication fail on the PDC :
>>>
>>> ----------
>>> pdc01
>>> ----------
>>> (the slave DC restart ... on the PDC I see ...)
>>> ../source4/dsdb/repl/drepl_out_helpers.c:862(dreplsrv_update_refs_done)
>>> UpdateRefs failed with NT_STATUS_END_OF_FILE
>>>
>>> (the slave is restarting, so the PDC cannot make the connection)
>>> ../source4/librpc/rpc/dcerpc_sock.c:262(continue_socket_connect)
>>> Failed to connect host 172.16.0.21 on port 1024 -
>>> NT_STATUS_CONNECTION_REFUSED
>>> ../source4/librpc/rpc/dcerpc_sock.c:425(continue_ip_open_socket)
>>> Failed to connect host 172.16.0.21
>>> (04c6b4b0-4584-4368-831e-42aa7ac08c04._msdcs.mydom.fr) on port 1024 -
>>> NT_STATUS_CONNECTION_REFUSED.
>>> ../source4/librpc/rpc/dcerpc_sock.c:262(continue_socket_connect)
>>> Failed to connect host 172.16.0.21 on port 1024 -
>>> NT_STATUS_CONNECTION_REFUSED
>>> ../source4/librpc/rpc/dcerpc_sock.c:425(continue_ip_open_socket)
>>> Failed to connect host 172.16.0.21
>>> (04c6b4b0-4584-4368-831e-42aa7ac08c04._msdcs.mydom.fr) on port 1024 -
>>> NT_STATUS_CONNECTION_REFUSED.
>>>
>>> (the slave DC is restarted, but the replication does not work )
>>> ../auth/gensec/gensec.c:247(gensec_update)
>>> Did not manage to negotiate mandetory feature SIGN for dcerpc auth_level 6
>>> ../source4/librpc/rpc/dcerpc_util.c:681(dcerpc_pipe_auth_recv)
>>> Failed to bind to uuid e3514235-4b06-11d1-ab04-00c04fc2dcd2 for
>>>
>>> e3514235-4b06-11d1-ab04-00c04fc2dcd2 at ncacn_ip_tcp:04c6b4b0-4584-4368-831e-42aa7ac08c04._msdcs.mydom.fr[1024,seal,krb5]
>>> NT_STATUS_ACCESS_DENIED
>>> ...
>>> ...
>>> (same messages when I restart the other slave DC )
>>> ----------
>>>
>>> So I need to restart the PDC to solve the problem. This very annoying
>>> because I need to check every days, on the three DCs, if the
>>> replication works !
>>>
>>> Does someone understand what's happend ? What makes samba restarting ?
>>> And why the replication stop working ?
>>>
>>> Thanks !
>>>
>>> Baptiste.
>>>
>> First lets get this straight, you do not have a PDC and two slave DCs, you
>> have 3 DCs, apart from the FSMO roles, all DCs are equal and you can share
>> the FSMO roles between your 3 DCs.
>>
>> Having said that, you need to find out what is restarting your first DC, can
>> you post your smb.conf files (or just one, if they are all the same.)
>>
>> Can you also raise the loglevel on the first DC to 10 and then see if there
>> is an obvious reason for the restart.
>>
>> Rowland
>>
>> --
>> To unsubscribe from this list go to the following URL and read the
>> instructions: https://lists.samba.org/mailman/options/samba
> Thanks for your help !
>
> I will raise the log level of the DCs to 10. But as the problem
> appears only one time by month, and as logrotate is disabled, I hope
> that the logs will not be too big. Do you understand what can make
> this series of events ?
>
> 1) pdc01 restart -> sdc02 and sdc03 say "Did not manage to negotiate
> mandatory feature SIGN"
> 2) I restart sdc02 and sdc03 manually -> pdc01 say "Did not manage to
> negotiate mandatory feature SIGN"
> 3) I restart pdc01 again and everything works fine
>
> This is like a machine password desynchronization no ? When logrotate
> was enabled on samba log files the problem appear must more often. So
> it seem related to the samba restart, manually or not.
>
> Il have checked the DC's time clock. No problem.
>
>
> Here my smb.confs.
>
> ###########
> For pdc01
> ###########
>
> [global]
> netbios aliases = sambaaccount sambaaccount.fichnet.fr
> load printers = yes
> workgroup = FICHNET
> realm = FICHNET.FR
> netbios name = FICHDC
> interfaces = lo, eth0
> bind interfaces only = Yes
> server role = active directory domain controller
> server services = s3fs, rpc, nbt, wrepl, ldap, cldap, kdc,
> drepl, winbind, ntp_signd, kcc, dnsupdate
> idmap_ldb:use rfc2307 = yes
>
> [netlogon]
> path = /var/lib/samba/sysvol/fichnet.fr/scripts
> read only = No
>
> [sysvol]
> path = /var/lib/samba/sysvol
> read only = No
>
> [profiles_local]
> path = /fichsamba/smbprofile
> read only = No
> browseable = No
>
> [profiles]
> path = /srv/dfs/profiles
> read only = No
> msdfs root = yes
>
> [homes_local]
> path = /fichsamba/smbhome
> read only = No
> browseable = No
>
> [homes]
> path = /srv/dfs/homes
> read only = No
> msdfs root = yes
>
> [printers]
> path = /var/spool/samba
> printable = yes
> printing = CUPS
>
> [print$]
> path = /srv/samba/Printer_drivers
> comment = Printer Drivers
> writeable = yes
>
> #############
> For sdc02 and sdc03 (in reality fichds01 and fichds02)
> #############
>
> [global]
> workgroup = FICHNET
> realm = net.fichnet.fr
> netbios name = FICHDS01
> server role = active directory domain controller
> server services = s3fs, rpc, nbt, wrepl, ldap, cldap, kdc,
> drepl, winbind, ntp_signd, kcc, dnsupdate
> idmap_ldb:use rfc2307 = yes
>
> [netlogon]
> path = /var/lib/samba/sysvol/fichnet.fr/scripts
> read only = No
>
> [sysvol]
> path = /var/lib/samba/sysvol
> read only = No
>
> [profiles_local]
> path = /fichsamba/smbprofile
> read only = No
> browseable = No
>
> [homes_local]
> path = /fichsamba/smbhome
> read only = No
> browseable = No
>
> Thanks again !
>
OK, I did an internet search and it would seem the problem you are
having is well known, can I suggest you upgrade your version of
Samba,especially as 4.1.x is EOL and will not get any more updates from
Samba
Rowland
More information about the samba
mailing list