[Samba] Re: Intermittent "internal error: signal 11" with 3.0.24

Joe Murphy joe.murphy at clear.net.nz
Thu Jun 14 22:13:34 GMT 2007


Hi all

Follow up to this post, as it didn't seem to go through the
first time. We have a common problem occurring with our
Samba setups. We run 3 (identical) processing environments
that each contain a Samba host sharing approx .5TB of data
to 6 Wintel machines.

Normally these hosts operate fine, though we intermittently
experience a Samba panic (intermittent = every 2-3 days) as
per below. Eventually this will kill the host and require a
restart to clear.

Our issues appear to have started when we upgraded our Samba
to 3.0.24. We begun recording panic messages in syslog
following the upgrade. Going up to 3.0.25a is an option
though we'd like to do it with a bit of an idea that it will
fix things.

We've tried a range of things to reproduce and isolate the
issue; load testing, re-processing batches that experienced
problems - so far without success. Any suggestions
appreciated.

sambaserver:~ # gdb /usr/sbin/smbd 1778
...
(gdb) bt
#0  0xffffe410 in ?? ()
#1  0x00000001 in ?? ()
#2  0x00000000 in ?? ()
#3  0xbfffcfd8 in ?? ()
#4  0x402b36e3 in __waitpid_nocancel () from
/lib/tls/libc.so.6
#5  0x4025ef58 in do_system () from /lib/tls/libc.so.6
#6  0x402268dd in system () from /lib/tls/libpthread.so.0
#7  0x0822b612 in smb_panic (why=0x0) at lib/util.c:1608
#8  0x08219b3f in fault_report (sig=-512) at lib/fault.c:47
#9  0x08219b50 in sig_fault (sig=-512) at lib/fault.c:70
#10 <signal handler called>
#11 0x40292d43 in strlen () from /lib/tls/libc.so.6
#12 0x40268242 in vfprintf () from /lib/tls/libc.so.6
#13 0x40285e76 in vsnprintf () from /lib/tls/libc.so.6
#14 0x08219956 in dbgtext (format_str=0x36373020 <Address
0x36373020 out of bounds>) at lib/debug.c:1011
#15 0x0825b360 in oplock_timeout_handler (te=0x844d838,
now=0xbfffdfc0, private_data=0x844e498) at smbd/oplock.c:351
#16 0x08242d7d in run_events () at lib/events.c:102
#17 0x080f2801 in receive_message_or_smb (buffer=0x40433008
"", buffer_len=131137, timeout=60000) at smbd/process.c:457
#18 0x080f4122 in smbd_process () at smbd/process.c:1649
#19 0x082beea9 in main (argc=909586464, argv=0xbfffe334) at
smbd/server.c:1024


This relates to the following panic message recorded in
syslog:

Jun 14 15:36:56 sambaserver smbd[1778]: [2007/06/14
15:36:56, 0] printing/print_cups.c:cups_cache_reload(85)
Jun 14 15:36:56 sambaserver smbd[1778]:   Unable to connect
to CUPS server localhost - Connection refused
Jun 14 15:36:56 sambaserver smbd[1778]: [2007/06/14
15:36:56, 0] printing/print_cups.c:cups_cache_reload(85)
Jun 14 15:36:56 sambaserver smbd[1778]:   Unable to connect
to CUPS server localhost - Connection refused
Jun 14 15:38:02 sambaserver smbd[1778]: [2007/06/14
15:38:02, 0] tdb/tdbutil.c:tdb_log(783)
Jun 14 15:38:02 sambaserver smbd[1778]:  
tdb(/var/lib/samba/locking.tdb): tdb_lock failed on list
2872 ltype=1 (Resource deadlock avoided)
Jun 14 15:38:02 sambaserver smbd[1778]: [2007/06/14
15:38:02, 0] smbd/close.c:close_remove_share_mode(164)
Jun 14 15:38:02 sambaserver smbd[1778]:  
close_remove_share_mode: Could not get share mode lock for
file Templates/TI.dmsft
Jun 14 15:38:03 sambaserver smbd[1778]: [2007/06/14
15:38:03, 0] tdb/tdbutil.c:tdb_log(783)
Jun 14 15:38:03 sambaserver smbd[1778]:  
tdb(/var/lib/samba/locking.tdb): tdb_lock failed on list
1324 ltype=1 (Resource deadlock avoided)
Jun 14 15:38:03 sambaserver smbd[1778]: [2007/06/14
15:38:03, 0] smbd/close.c:close_remove_share_mode(164)
Jun 14 15:38:03 sambaserver smbd[1778]:  
close_remove_share_mode: Could not get share mode lock for
file Templates/TI.dmsft
Jun 14 15:38:32 sambaserver smbd[1778]: [2007/06/14
15:38:32, 0] smbd/oplock.c:oplock_timeout_handler(351)
Jun 14 15:38:32 sambaserver smbd[1778]: [2007/06/14
15:38:32, 0] lib/fault.c:fault_report(41)
Jun 14 15:38:32 sambaserver smbd[1778]:  
===============================================================
Jun 14 15:38:32 sambaserver smbd[1778]: [2007/06/14
15:38:32, 0] lib/fault.c:fault_report(42)
Jun 14 15:38:32 sambaserver smbd[1778]:   INTERNAL ERROR:
Signal 11 in pid 1778 (3.0.24-SerNet-SuSE)
Jun 14 15:38:32 sambaserver smbd[1778]:   Please read the
Trouble-Shooting section of the Samba3-HOWTO
Jun 14 15:38:32 sambaserver smbd[1778]: [2007/06/14
15:38:32, 0] lib/fault.c:fault_report(44)
Jun 14 15:38:32 sambaserver smbd[1778]:
Jun 14 15:38:32 sambaserver smbd[1778]:   From:
http://www.samba.org/samba/docs/Samba3-HOWTO.pdf
Jun 14 15:38:32 sambaserver smbd[1778]: [2007/06/14
15:38:32, 0] lib/fault.c:fault_report(45)
Jun 14 15:38:32 sambaserver smbd[1778]:  
===============================================================
Jun 14 15:38:32 sambaserver smbd[1778]: [2007/06/14
15:38:32, 0] lib/util.c:smb_panic(1599)
Jun 14 15:38:32 sambaserver smbd[1778]:   PANIC (pid 1778):
internal error
Jun 14 15:38:32 sambaserver smbd[1778]: [2007/06/14
15:38:32, 0] lib/util.c:log_stack_trace(1706)
Jun 14 15:38:32 sambaserver smbd[1778]:   BACKTRACE: 14
stack frames:
Jun 14 15:38:32 sambaserver smbd[1778]:    #0
/usr/sbin/smbd(log_stack_trace+0x22) [0x822b6fb]
Jun 14 15:38:32 sambaserver smbd[1778]:    #1
/usr/sbin/smbd(smb_panic+0x6f) [0x822b59a]
Jun 14 15:38:32 sambaserver smbd[1778]:    #2 /usr/sbin/smbd
[0x8219b3f]
Jun 14 15:38:32 sambaserver smbd[1778]:    #3 /usr/sbin/smbd
[0x8219b50]
Jun 14 15:38:32 sambaserver smbd[1778]:    #4 [0xffffe420]
Jun 14 15:38:32 sambaserver smbd[1778]:    #5
/lib/tls/libc.so.6(vsnprintf+0xb6) [0x40285e76]
Jun 14 15:38:32 sambaserver smbd[1778]:    #6
/usr/sbin/smbd(dbgtext+0x2e) [0x8219956]
Jun 14 15:38:32 sambaserver smbd[1778]:    #7 /usr/sbin/smbd
[0x825b360]
Jun 14 15:38:32 sambaserver smbd[1778]:    #8
/usr/sbin/smbd(run_events+0x15f) [0x8242d7d]
Jun 14 15:38:32 sambaserver smbd[1778]:    #9 /usr/sbin/smbd
[0x80f2801]
Jun 14 15:38:32 sambaserver smbd[1778]:    #10
/usr/sbin/smbd(smbd_process+0x10e) [0x80f4122]
Jun 14 15:38:32 sambaserver smbd[1778]:    #11
/usr/sbin/smbd(main+0x946) [0x82beea9]
Jun 14 15:38:32 sambaserver smbd[1778]:    #12
/lib/tls/libc.so.6(__libc_start_main+0xd0) [0x40240210]
Jun 14 15:38:32 sambaserver smbd[1778]:    #13
/usr/sbin/smbd [0x808ceb1]
Jun 14 15:38:32 sambaserver smbd[1778]: [2007/06/14
15:38:32, 0] lib/util.c:smb_panic(1607)
Jun 14 15:38:32 sambaserver smbd[1778]:   smb_panic():
calling panic action [/bin/sleep 90000]


Versions
Distro: SLES9 (SP3)
Kernel: 2.6.5-7.97-bigsmp
Samba: Version 3.0.24-SerNet-SuSE

Samba config below (<ad_domain> changed, but orig followed
sub.dom.tld format):
----------------------------
[global]
   workgroup = <ad_domain>
   domain master = no
   local master = no
   preferred master = no
   os level = 0
   username map = /etc/samba/smbusers
   map to guest = Bad User
   logon path = \\%L\profiles\.msprofile
   logon home = \\%L\%U\.9xprofile
   logon drive = P:
   security = ads
   realm = <ad_domain>
   encrypt passwords = yes
   idmap uid = 10000-20000
   idmap gid = 10000-20000
   template primary group = "Domain Users"
   template shell = /bin/bash
   winbind separator = +
   winbind enum users = yes
   winbind enum groups = yes
   winbind use default domain = yes
   password server = prod1.<ad_domain> prod2.<ad_domain>
prod3.<ad_domain>
   log level = 3
   panic action = "/bin/sleep 90000"

[Data]
   comment = Data directory
   path = /data01/Data
   read only = No
   inherit permissions = Yes
   directory mask = 0755
   create mask = 0755

----------------------------

Much appreciated.

Joe Murphy
joe.murphy at clear.net.nz



More information about the samba mailing list