[Samba] PANIC outstanding aio + key does not exist

Isaac Stone isaac.stone at som.com
Thu Nov 19 18:09:45 UTC 2020


Our samba server in ap-east-1 is running very slow today

I noticed three things in log.smb, and I don't understand any of them

----------

#1 roughly every 80 seconds what appears to be a reconnect loop

[2020/11/19 17:35:46.283315,  2]
../../source3/lib/tallocmsg.c:87(register_msg_pool_usage)
  Registered MSG_REQ_POOL_USAGE
[2020/11/19 17:35:46.287132,  1] ../../source3/lib/util.c:1694(name_to_fqdn)
  getaddrinfo: Name or service not known
[2020/11/19 17:35:46.800224,  2]
../../source3/param/loadparm.c:2862(lp_do_section)
  Processing section "[DC]"
[2020/11/19 17:35:46.802682,  2]
../../source3/auth/auth.c:329(auth_check_ntlm_password)
  check_ntlm_password:  authentication for user [srv-acct] -> [srv-acct] ->
[DEVAD\srv-acct] succeeded
[2020/11/19 17:36:06.483196,  2]
../../source3/smbd/service.c:1152(close_cnum)
  somdev-or-cdc-1 (ipv4:10.7.3.228:58542) closed connection to service DC
[2020/11/19 17:36:06.851891,  2]
../../source3/modules/vfs_acl_xattr.c:234(connect_acl_xattr)
  connect_acl_xattr: setting 'inherit acls = true' 'dos filemode = true'
and 'force unknown acl user = true' for service NY
[2020/11/19 17:36:06.852223,  2]
../../source3/smbd/service.c:877(make_connection_snum)
  devad-or-cdc-1 (ipv4:10.7.3.228:58544) connect to service DC initially as
user DEVAD\srv-acct (uid=12187, gid=10513) (pid 3881171)

notes:
- hitting the or-cdc even though `net ads lookup dc` returns the ap-cdc
- uid and gid remain the same, pid changes

google search all I can find is hostname issues. hostname is set in
/etc/hosts correctly

----------

#2 close attempt and PANIC

[2020/11/19 17:37:45.156495,  0] ../../source3/lib/util.c:830(smb_panic_s3)
  PANIC (pid 3881171): can not close with outstanding aio requests
[2020/11/19 17:37:45.156965,  0] ../../lib/util/fault.c:265(log_stack_trace)
  BACKTRACE: 34 stack frames:
   #0 /lib64/libsamba-util.so.0(log_stack_trace+0x34) [0x7f1d05f0e264]
   #1 /lib64/libsmbconf.so.0(smb_panic_s3+0x29) [0x7f1d05969639]
   #2 /lib64/libsamba-util.so.0(smb_panic+0x31) [0x7f1d05f0e361]
   #3 /usr/lib64/samba/libsmbd-base-samba4.so(+0x1d9fff) [0x7f1d05b7ffff]
   #4 /usr/lib64/samba/libsmbd-base-samba4.so(close_file+0x240)
[0x7f1d05b80b10]
   #5 /usr/lib64/samba/libsmbd-base-samba4.so(file_close_user+0x3d)
[0x7f1d05b1bacd]
   #6 /usr/lib64/samba/libsmbd-base-samba4.so(smbXsrv_session_logoff+0x51)
[0x7f1d05bca1e1]
   #7 /usr/lib64/samba/libsmbd-base-samba4.so(+0x22458a) [0x7f1d05bca58a]
   #8 /usr/lib64/samba/libdbwrap-samba4.so(+0x50c4) [0x7f1d03a8e0c4]
   #9 /usr/lib64/samba/libdbwrap-samba4.so(+0x52fe) [0x7f1d03a8e2fe]
   #10 /usr/lib64/samba/libdbwrap-samba4.so(dbwrap_traverse+0xb)
[0x7f1d03a8c85b]
   #11
/usr/lib64/samba/libsmbd-base-samba4.so(smbXsrv_session_logoff_all+0x5b)
[0x7f1d05bca74b]
   #12 /usr/lib64/samba/libsmbd-base-samba4.so(+0x229e5e) [0x7f1d05bcfe5e]
   #13 /usr/lib64/samba/libsmbd-base-samba4.so(+0x22a3c4) [0x7f1d05bd03c4]
   #14 /usr/lib64/samba/libsmbd-shim-samba4.so(exit_server_cleanly+0x18)
[0x7f1d0536ea38]
   #15
/usr/lib64/samba/libsmbd-base-samba4.so(smbd_server_connection_terminate_ex+0x162)
[0x7f1d05baad12]
   #16 /usr/lib64/samba/libsmbd-base-samba4.so(+0x208d19) [0x7f1d05baed19]
   #17 /lib64/libtevent.so.0(tevent_common_invoke_fd_handler+0x83)
[0x7f1d04cf4533]
   #18 /lib64/libtevent.so.0(+0xdacf) [0x7f1d04cfaacf]
   #19 /lib64/libtevent.so.0(+0xbadb) [0x7f1d04cf8adb]
   #20 /lib64/libtevent.so.0(_tevent_loop_once+0x95) [0x7f1d04cf3c05]
   #21 /lib64/libtevent.so.0(tevent_common_loop_wait+0x1b) [0x7f1d04cf3eab]
   #22 /lib64/libtevent.so.0(+0xba6b) [0x7f1d04cf8a6b]
   #23 /usr/lib64/samba/libsmbd-base-samba4.so(smbd_process+0x7bf)
[0x7f1d05b9d0ef]
   #24 /usr/sbin/smbd(+0xdd20) [0x55b693cd7d20]
   #25 /lib64/libtevent.so.0(tevent_common_invoke_fd_handler+0x83)
[0x7f1d04cf4533]
   #26 /lib64/libtevent.so.0(+0xdacf) [0x7f1d04cfaacf]
   #27 /lib64/libtevent.so.0(+0xbadb) [0x7f1d04cf8adb]
   #28 /lib64/libtevent.so.0(_tevent_loop_once+0x95) [0x7f1d04cf3c05]
   #29 /lib64/libtevent.so.0(tevent_common_loop_wait+0x1b) [0x7f1d04cf3eab]
   #30 /lib64/libtevent.so.0(+0xba6b) [0x7f1d04cf8a6b]
   #31 /usr/sbin/smbd(main+0x1bd2) [0x55b693cd22c2]
   #32 /lib64/libc.so.6(__libc_start_main+0xf3) [0x7f1d0472d7b3]
   #33 /usr/sbin/smbd(_start+0x2e) [0x55b693cd267e]
[2020/11/19 17:37:45.157318,  0] ../../source3/lib/dumpcore.c:318(dump_core)
  coredump is handled by helper binary specified at
/proc/sys/kernel/core_pattern
[2020/11/19 17:38:09.237006,  2]
../../source3/smbd/process.c:2888(deadtime_fn)
  Closing idle connection

then directly back into the same loop as #1

----------

#3 verify record

shortly after the PANIC above

[2020/11/19 17:38:10.206414,  2]
../../source3/auth/auth.c:329(auth_check_ntlm_password)
  check_ntlm_password:  authentication for user [srv-acct] -> [srv-acct] ->
[DEVAD\srv-acct] succeeded
[2020/11/19 17:38:10.207772,  2]
../../source3/smbd/smbXsrv_session.c:881(smbXsrv_session_global_verify_record)
  smbXsrv_session_global_verify_record: key '65EB71F1' server_id 0:3881171
does not exist.
[2020/11/19 17:38:10.207802,  2]
../../source3/smbd/smbXsrv_session.c:882(smbXsrv_session_global_verify_record)
[2020/11/19 17:38:10.207809,  1] ../../librpc/ndr/ndr.c:433(ndr_print_debug)
       &global_blob: struct smbXsrv_session_globalB
          version                  : SMBXSRV_VERSION_0 (0)
          seqnum                   : 0x00000006 (6)
          info                     : union smbXsrv_session_globalU(case 0)
          info0                    : *
              info0: struct smbXsrv_session_global0
                  db_rec                   : NULL
                  session_global_id        : 0x65eb71f1 (1709928945)
                  session_wire_id          : 0x0000000065eb71f1 (1709928945)
                  creation_time            : Thu Nov 19 05:35:46 PM 2020 UTC
                  expiration_time          : Thu Sep 14 02:48:05 AM 30828
UTC
                  auth_time                : Thu Nov 19 05:35:47 PM 2020 UTC
                  auth_session_info_seqnum : 0x00000001 (1)
                  auth_session_info        : *
                      auth_session_info: struct auth_session_info
                          security_token           : *
                              security_token: struct security_token
                                  num_sids                 : 0x00000017 (23)
                                  sids: ARRAY(23)
                                      sids                   [ ... ]
                                  privilege_mask           :
0x000000001ffffff0 (536870896)
                                         1: SEC_PRIV_MACHINE_ACCOUNT_BIT
                                         1: SEC_PRIV_PRINT_OPERATOR_BIT
                                         1: SEC_PRIV_ADD_USERS_BIT
                                         1: SEC_PRIV_DISK_OPERATOR_BIT
                                         1: SEC_PRIV_REMOTE_SHUTDOWN_BIT
                                         1: SEC_PRIV_BACKUP_BIT
                                         1: SEC_PRIV_RESTORE_BIT
                                         1: SEC_PRIV_TAKE_OWNERSHIP_BIT
                                         1: SEC_PRIV_INCREASE_QUOTA_BIT
                                         1: SEC_PRIV_SECURITY_BIT
                                         1: SEC_PRIV_LOAD_DRIVER_BIT
                                         1: SEC_PRIV_SYSTEM_PROFILE_BIT
                                         1: SEC_PRIV_SYSTEMTIME_BIT
                                         1:
SEC_PRIV_PROFILE_SINGLE_PROCESS_BIT
                                         1:
SEC_PRIV_INCREASE_BASE_PRIORITY_BIT
                                         1: SEC_PRIV_CREATE_PAGEFILE_BIT
                                         1: SEC_PRIV_SHUTDOWN_BIT
                                         1: SEC_PRIV_DEBUG_BIT
                                         1: SEC_PRIV_SYSTEM_ENVIRONMENT_BIT
                                         1: SEC_PRIV_CHANGE_NOTIFY_BIT
                                         1: SEC_PRIV_UNDOCK_BIT
                                         1: SEC_PRIV_ENABLE_DELEGATION_BIT
                                         1: SEC_PRIV_MANAGE_VOLUME_BIT
                                         1: SEC_PRIV_IMPERSONATE_BIT
                                         1: SEC_PRIV_CREATE_GLOBAL_BIT
                                  rights_mask              : 0x00000000 (0)
                                         0: LSA_POLICY_MODE_INTERACTIVE
                                         0: LSA_POLICY_MODE_NETWORK
                                         0: LSA_POLICY_MODE_BATCH
                                         0: LSA_POLICY_MODE_SERVICE
                                         0: LSA_POLICY_MODE_PROXY
                                         0: LSA_POLICY_MODE_DENY_INTERACTIVE
                                         0: LSA_POLICY_MODE_DENY_NETWORK
                                         0: LSA_POLICY_MODE_DENY_BATCH
                                         0: LSA_POLICY_MODE_DENY_SERVICE
                                         0:
LSA_POLICY_MODE_REMOTE_INTERACTIVE
                                         0:
LSA_POLICY_MODE_DENY_REMOTE_INTERACTIVE
                                      0x00: LSA_POLICY_MODE_ALL       (0)
                                      0x00: LSA_POLICY_MODE_ALL_NT4   (0)
                          unix_token               : *
                              unix_token: struct security_unix_token
                                  uid                      :
0x0000000000002f9b (12187)
                                  gid                      :
0x0000000000002911 (10513)
                                  ngroups                  : 0x0000000b (11)
                                  groups: ARRAY(11)
                                      groups                   [ ... ]
                          info                     : *
                              info: struct auth_user_info
                                  account_name             : *
                                      account_name             : 'srv-acct'
                                  user_principal_name      : NULL
                                  user_principal_constructed: 0x00 (0)
                                  domain_name              : *
                                      domain_name              : 'DEVAD'
                                  dns_domain_name          : NULL
                                  full_name                : *
                                      full_name                : ''
                                  logon_script             : *
                                      logon_script             : ''
                                  profile_path             : *
                                      profile_path             : ''
                                  home_directory           : *
                                      home_directory           : ''
                                  home_drive               : *
                                      home_drive               : ''
                                  logon_server             : *
                                      logon_server             :
'DEVAD-AP-CDC-1'
                                  last_logon               : Thu Nov 12
04:06:15 PM 2020 UTC
                                  last_logoff              : Thu Sep 14
02:48:05 AM 30828 UTC
                                  acct_expiry              : Thu Sep 14
02:48:05 AM 30828 UTC
                                  last_password_change     : Tue Sep  8
07:44:38 PM 2020 UTC
                                  allow_password_change    : Wed Sep  9
07:44:38 PM 2020 UTC
                                  force_password_change    : Thu Sep 14
02:48:05 AM 30828 UTC
                                  logon_count              : 0x0104 (260)
                                  bad_password_count       : 0x0000 (0)
                                  acct_flags               : 0x00000210
(528)
                                  authenticated            : 0x01 (1)
                          unix_info                : *
                              unix_info: struct auth_user_info_unix
                                  unix_name                : *
                                      unix_name                :
'DEVAD\srv-acct'
                                  sanitized_username       : *
                                      sanitized_username       : 'srv-acct'
                          torture                  : NULL
                          credentials              : NULL
                          unique_session_token     :
26addcdb-1953-4652-a7de-8c12952e525c
                  connection_dialect       : 0x0311 (785)
                  signing_flags            : 0x06 (6)
                         0: SMBXSRV_SIGNING_REQUIRED
                         1: SMBXSRV_PROCESSED_SIGNED_PACKET
                         1: SMBXSRV_PROCESSED_UNSIGNED_PACKET
                  encryption_flags         : 0x08 (8)
                         0: SMBXSRV_ENCRYPTION_REQUIRED
                         0: SMBXSRV_ENCRYPTION_DESIRED
                         0: SMBXSRV_PROCESSED_ENCRYPTED_PACKET
                         1: SMBXSRV_PROCESSED_UNENCRYPTED_PACKET
                  signing_key              : NULL
                  encryption_key           : NULL
                  decryption_key           : NULL
                  num_channels             : 0x00000001 (1)
                  channels: ARRAY(1)
                      channels: struct smbXsrv_channel_global0
                          server_id: struct server_id
                              pid                      : 0x00000000003b38d3
(3881171)
                              task_id                  : 0x00000000 (0)
                              vnn                      : 0x00000000 (0)
                              unique_id                : 0xb2fe48f46a0cc1c4
(-5548917476285431356)
                          local_address            : 'ipv4:10.15.22.214:445'
                          remote_address           : 'ipv4:10.7.3.228:58544'
                          remote_name              : '10.7.3.228'
                          signing_key              : NULL
                          auth_session_info_seqnum : 0x00000001 (1)
                          connection               : NULL
                          encryption_cipher        : 0x0002 (2)
[2020/11/19 17:38:10.576763,  2]
../../source3/modules/vfs_acl_xattr.c:234(connect_acl_xattr)
  connect_acl_xattr: setting 'inherit acls = true' 'dos filemode = true'
and 'force unknown acl user = true' for service DC
[2020/11/19 17:38:11.818899,  2]
../../source3/smbd/service.c:877(make_connection_snum)
  somdev-or-cdc-1 (ipv4:10.7.3.228:58549) connect to service DC initially
as user DEVAD\srv-acct (uid=12187, gid=10513) (pid 3883266)

then continuing as before.

I cannot find information about the PANIC or
smbXsrv_session_global_verify_record

--------

Samba version 4.12.3

---------
#======================= Global Settings
=====================================
[global]
netbios name = OFS-C25-USE1
realm = DEVAD.LOCAL
workgroup = DEVAD

security = ads

clustering = yes
fake oplocks = no

log level = 2

idmap config * : backend = tdb
idmap config * : range = 3000-7999
idmap config DEVAD: backend = rid
idmap config DEVAD: range = 10000-999999

dedicated keytab file = /etc/krb5.keytab
kerberos method = secrets and keytab

map acl inherit = yes

winbind refresh tickets = yes

# disables printing:

load printers = no
printing = bsd
printcap name = /dev/null
disable spoolss = yes

bind interfaces only = yes
interfaces = lo eth0

#============================ Share Definitions
==============================
[DC]
path = "/mnt/use1/DC"
writeable = yes
browsable = yes
fileid:algorithm = fsname_norootdir
vfs objects = fileid shadow_copy2 acl_xattr
acl_xattr:ignore system acls = yes
shadow:mountpoint = /mnt/use1
shadow:snapdir = /snapshots/use1
shadow:snapsharepath = DC
shadow:format = %Y-%m-%dT%H:%M:%SZ


----------

servers with the same configuration are running in four regions. I am
suspicious of some kind of timeout happening. The us-west-2 is noticeably
slow as well but still functional. ap-east-1 is taking 20 minutes just to
list the root directory from Windows Explorer, and failing more often than
not.

Possibly related, there is a very large copy job going on in us-east-1. We
are running a global filesystem using ctdb to synchronize locks. Last I
checked there were more than 200 locks held and it took nearly three
minutes to run `smbstatus -L`

Any advice is appreciated.

 - isaac


More information about the samba mailing list