[Samba] Is samba or a kernel bug causing my FC4 server to crash?

Jason Welter jwelter at cfl.rr.com
Mon Nov 28 01:22:05 GMT 2005


I've got a fully updated Fedora Core 4 server crashing hard every week or
two.  I use Samba via smbmount and autofs to read & delete log files on 17 XP
boxs and 6 NT4SP6 boxes as well as a couple other Windows files servers
every 5 minutes.  The first indication of a problem I get is smbmount stops
working, then the server becomes unresponsive to the point where only a
power slam will fix it, and it does fix it...for a few days.

I've got  Samba 3.0.14a-2 installed and have been updating my kernel as often as
a new one is released.  Currently I'm running 2.6.14-1.1637_FC4smp.

Here are 4 seperate crash excerpts of the sytem log.  Anybody know how to tell
if samba is involved and if so, if it is responsible? 

################################################################################
Nov  8 17:15:14 poseidon automount[32023]: failed to mount /win/metal10
Nov  8 17:15:37 poseidon kernel: smb_add_request: request [efeff680, mid=36572] timed out!
Nov  8 17:15:37 poseidon kernel: smb_writepage_sync: failed write, wsize=4096, write_ret=-5
Nov  8 17:15:37 poseidon kernel: smb_add_request: request [eb962080, mid=14] timed out!
Nov  8 17:21:53 poseidon kernel: Unable to handle kernel paging request at virtual address 0600000
0
Nov  8 17:21:53 poseidon kernel:  printing eip:
Nov  8 17:21:53 poseidon kernel: f8b4b5a4
Nov  8 17:21:53 poseidon kernel: *pde = 37e1b001
Nov  8 17:21:53 poseidon kernel: Oops: 0000 [#2]
Nov  8 17:21:53 poseidon kernel: SMP
Nov  8 17:21:53 poseidon kernel: Modules linked in: nfs lockd nfs_acl smbfs radeon drm parport_pc
lp parport autofs4 i2c_dev i2c_core rfcomm l2cap bluetooth sunrpc ipv6 dm_mod video button battery
 ac uhci_hcd ehci_hcd hw_random shpchp e1000 floppy mptspi sg ext3 jbd megaraid_mbox megaraid_mm m
ptscsih mptbase sd_mod scsi_mod
Nov  8 17:21:53 poseidon kernel: CPU:    3
Nov  8 17:21:53 poseidon kernel: EIP:    0060:[<f8b4b5a4>]    Not tainted VLI
Nov  8 17:21:53 poseidon kernel: EFLAGS: 00010206   (2.6.13-1.1532_FC4smp)
Nov  8 17:21:53 poseidon kernel: EIP is at smbiod+0xef/0x184 [smbfs]
Nov  8 17:21:53 poseidon kernel: eax: 12221400   ebx: d1de9000   ecx: eceb6f98   edx: 0321cf60
Nov  8 17:21:53 poseidon kernel: esi: 06000000   edi: eceb6000   ebp: eceb6fc4   esp: eceb6fbc
Nov  8 17:21:53 poseidon kernel: ds: 007b   es: 007b   ss: 0068
Nov  8 17:21:53 poseidon kernel: Process smbiod (pid: 16251, threadinfo=eceb6000 task=ed2b8aa0)
Nov  8 17:21:53 poseidon kernel: Stack: f8b4cbd7 eceb6000 00000000 ed2b8aa0 c01347c2 eceb6fd0 eceb
6fd0 f8b4b4b5
Nov  8 17:21:53 poseidon kernel:        00000000 00000000 00000000 c0101ca1 00000000 00000000 0000
0000 00000000
Nov  8 17:21:53 poseidon kernel:        00000000
Nov  8 17:21:53 poseidon kernel: Call Trace:
Nov  8 17:21:53 poseidon kernel:  [<c01347c2>] autoremove_wake_function+0x0/0x37
Nov  8 17:21:53 poseidon kernel:  [<f8b4b4b5>] smbiod+0x0/0x184 [smbfs]
Nov  8 17:21:53 poseidon kernel:  [<c0101ca1>] kernel_thread_helper+0x5/0xb
Nov  8 17:21:53 poseidon kernel: Code: 0f 85 90 00 00 00 f0 0f ba 35 6c 48 b5 f8 01 b8 c8 25 b5 f8
 e8 0c ca 7c c7 8b 1d c0 25 b5 f8 81 fb c0 25 b5 f8 74 79 8b 33 eb 0e <8b> 06 89 f3 81 fe c0 25 b5
 f8 74 50 89 c6 8b 43 08 85 c0 75 eb
Nov  8 18:02:42 poseidon syslogd 1.4.1: restart.

################################################################################
Oct 26 09:30:11 poseidon kernel: smb_lookup: find //fabnet failed, error=-5
Oct 26 09:30:11 poseidon kernel: smb_add_request: request [e1a5e280, mid=104] timed out!
Oct 26 09:30:11 poseidon kernel: smb_lookup: find //fabnet failed, error=-5
Oct 26 09:30:11 poseidon kernel: smb_add_request: request [c5610280, mid=65] timed out!
Oct 26 09:30:11 poseidon kernel: smb_lookup: find //mdsystem failed, error=-5
Oct 26 09:30:15 poseidon ntpd[2219]: ntpd exiting on signal 15
Oct 26 09:30:15 poseidon rpc.statd[1782]: Caught signal 15, un-registering and exiting.
Oct 26 09:30:15 poseidon auditd[1796]: The audit daemon is exiting.
Oct 26 09:30:15 poseidon kernel: audit(1130333415.760:21310): audit_pid=0 old=1796 by auid=4294967
295
Oct 26 09:30:15 poseidon kernel: audit(1130333415.900:21311): SELinux:  unrecognized netlink messa
ge type=1009 for sclass=49
Oct 26 09:30:15 poseidon kernel:
Oct 26 09:30:15 poseidon kernel: audit(1130333415.900:21311): arch=40000003 syscall=102 success=ye
s exit=16 a0=b a1=bfc8d790 a2=80510f8 a3=bfc93bb8 items=0 pid=18765 auid=4294967295 uid=0 gid=0 eu
id=0 suid=0 fsuid=0 egid=0 sgid=0 fsgid=0 comm="auditctl" exe="/sbin/auditctl"
Oct 26 09:30:15 poseidon kernel: audit(1130333415.900:21311): saddr=100000000000000000000000
Oct 26 09:30:15 poseidon kernel: audit(1130333415.900:21311): nargs=6 a0=3 a1=bfc91a1c a2=10 a3=0
a4=bfc93bb8 a5=c
Oct 26 09:30:16 poseidon kernel: audit(1130333416.000:21312): SELinux:  unrecognized netlink messa
ge type=1009 for sclass=49
Oct 26 09:30:16 poseidon kernel:
Oct 26 09:30:16 poseidon kernel: audit(1130333416.000:21312): arch=40000003 syscall=102 success=ye
s exit=16 a0=b a1=bfc8d780 a2=80510f8 a3=bfc93ba8 items=0 pid=18765 auid=4294967295 uid=0 gid=0 eu
id=0 suid=0 fsuid=0 egid=0 sgid=0 fsgid=0 comm="auditctl" exe="/sbin/auditctl"
Oct 26 09:30:16 poseidon kernel: audit(1130333416.000:21312): saddr=100000000000000000000000  
Oct 26 09:30:16 poseidon kernel: audit(1130333416.000:21312): nargs=6 a0=3 a1=bfc91a0c a2=10 a3=0
a4=bfc93ba8 a5=c
Oct 26 09:30:16 poseidon kernel: Kernel logging (proc) stopped.
Oct 26 09:30:16 poseidon kernel: Kernel log daemon terminating.
Oct 26 09:30:17 poseidon exiting on signal 15
Oct 26 09:33:37 poseidon syslogd 1.4.1: restart.

################################################################################
Oct 29 12:46:10 poseidon kernel: Unable to handle kernel paging request at virtual address 4f3e353
c
Oct 29 12:46:10 poseidon kernel:  printing eip:
Oct 29 12:46:10 poseidon kernel: f8b4b5a4
Oct 29 12:46:10 poseidon kernel: *pde = 132d2001
Oct 29 12:46:10 poseidon kernel: Oops: 0000 [#1]
Oct 29 12:46:10 poseidon kernel: SMP
Oct 29 12:46:10 poseidon kernel: Modules linked in: loop nfs lockd nfs_acl smbfs radeon drm parpor
t_pc lp parport autofs4 i2c_dev i2c_core rfcomm l2cap bluetooth sunrpc ipv6 dm_mod video button ba
ttery ac uhci_hcd ehci_hcd hw_random shpchp e1000 floppy mptspi sg ext3 jbd megaraid_mbox megaraid
_mm mptscsih mptbase sd_mod scsi_mod
Oct 29 12:46:10 poseidon kernel: CPU:    2
Oct 29 12:46:10 poseidon kernel: EIP:    0060:[<f8b4b5a4>]    Not tainted VLI
Oct 29 12:46:10 poseidon kernel: EFLAGS: 00010206   (2.6.13-1.1526_FC4smp)
Oct 29 12:46:10 poseidon kernel: EIP is at smbiod+0xef/0x184 [smbfs]
Oct 29 12:46:10 poseidon kernel: eax: 32312039   ebx: f7b42800   ecx: dcd2ef98   edx: 03214f60
Oct 29 12:46:10 poseidon kernel: esi: 4f3e353c   edi: dcd2e000   ebp: dcd2efc4   esp: dcd2efbc
Oct 29 12:46:10 poseidon kernel: ds: 007b   es: 007b   ss: 0068
Oct 29 12:46:10 poseidon kernel: Process smbiod (pid: 26546, threadinfo=dcd2e000 task=f7961560)
Oct 29 12:46:10 poseidon kernel: Stack: f8b4cbd7 dcd2e000 00000000 f7961560 c01347c2 dcd2efd0 dcd2
efd0 f8b4b4b5
Oct 29 12:46:10 poseidon kernel:        00000000 00000000 00000000 c0101ca1 00000000 00000000 0000
0000 00000000
Oct 29 12:46:10 poseidon kernel:        00000000
Oct 29 12:46:10 poseidon kernel: Call Trace:
Oct 29 12:46:10 poseidon kernel:  [<c01347c2>] autoremove_wake_function+0x0/0x37
Oct 29 12:46:10 poseidon kernel:  [<f8b4b4b5>] smbiod+0x0/0x184 [smbfs]
Oct 29 12:46:10 poseidon kernel:  [<c0101ca1>] kernel_thread_helper+0x5/0xb
Oct 29 12:46:10 poseidon kernel: Code: 0f 85 90 00 00 00 f0 0f ba 35 6c 48 b5 f8 01 b8 c8 25 b5 f8
 e8 cc c8 7c c7 8b 1d c0 25 b5 f8 81 fb c0 25 b5 f8 74 79 8b 33 eb 0e <8b> 06 89 f3 81 fe c0 25 b5
 f8 74 50 89 c6 8b 43 08 85 c0 75 eb
Oct 29 12:46:10 poseidon kernel:  <5>smb_lookup: find //34 failed, error=-512
Oct 29 12:46:10 poseidon kernel: smb_lookup: find //34 failed, error=-512
Oct 29 12:46:39 poseidon last message repeated 279 times
Oct 29 12:46:39 poseidon kernel: smb_add_request: request [eee02e80, mid=12] timed out!
Oct 29 12:46:40 poseidon kernel: smb_lookup: find //34 failed, error=-512
Oct 29 12:47:09 poseidon last message repeated 288 times
Oct 29 12:47:09 poseidon kernel: smb_add_request: request [eee02e80, mid=13] timed out!
Oct 29 13:58:20 poseidon syslogd 1.4.1: restart.

################################################################################
Nov 25 15:05:34 poseidon automount[14437]: failed to mount /win/prober01
Nov 25 15:05:41 poseidon automount[14451]: >> Error connecting to xxx.xxx.xxx.xxx (No route to host)
Nov 25 15:05:41 poseidon automount[14451]: >> 14453: Connection to SAW4341 failed
Nov 25 15:05:41 poseidon automount[14451]: >> SMB connection failed
Nov 25 15:05:41 poseidon automount[14451]: mount(generic): failed to mount //SAW4341/fabdata (type
 smbfs) on /win/prober01
Nov 25 15:05:41 poseidon automount[14451]: failed to mount /win/prober01
Nov 25 15:07:55 poseidon kernel: BUG: spinlock lockup on CPU#1, smbmnt/14461, f8b7c790 (Not tainte
d)
Nov 25 15:07:55 poseidon kernel:  [<c01decc3>] __spin_lock_debug+0xac/0xcf
Nov 25 15:07:55 poseidon kernel:  [<c01ded32>] _raw_spin_lock+0x4c/0x6a
Nov 25 15:07:55 poseidon kernel:  [<f8b75251>] smbiod_register_server+0xd/0x39 [smbfs]
Nov 25 15:07:55 poseidon kernel:  [<f8b743da>] smb_fill_super+0x23b/0x3b5 [smbfs]
Nov 25 15:07:55 poseidon kernel:  [<c01d9aba>] idr_get_new_above_int+0x5e/0xe9
Nov 25 15:07:55 poseidon kernel:  [<c017de5f>] get_filesystem+0xf/0x36
Nov 25 15:07:55 poseidon kernel:  [<c0169d70>] sget+0x161/0x16d
Nov 25 15:07:55 poseidon kernel:  [<c016a420>] set_anon_super+0x0/0xa1
Nov 25 15:07:55 poseidon kernel:  [<c016a6cf>] get_sb_nodev+0x37/0x71
Nov 25 15:07:55 poseidon kernel:  [<c016a84a>] do_kern_mount+0xaf/0x14a
Nov 25 15:07:55 poseidon kernel:  [<f8b7419f>] smb_fill_super+0x0/0x3b5 [smbfs]
Nov 25 15:07:55 poseidon kernel:  [<c017f314>] do_new_mount+0x6b/0x90
Nov 25 15:07:55 poseidon kernel:  [<c017f991>] do_mount+0x18b/0x1a9
Nov 25 15:07:55 poseidon kernel:  [<c017fd62>] sys_mount+0x77/0xae
Nov 25 15:07:55 poseidon kernel:  [<c01039e1>] syscall_call+0x7/0xb
Nov 25 15:57:41 poseidon kernel: input: AT Translated Set 2 keyboard on isa0060/serio0
Nov 25 16:01:30 poseidon syslogd 1.4.1: restart.
Nov 25 16:01:30 poseidon kernel: klogd 1.4.1, log source = /proc/kmsg started.
Nov 25 16:01:30 poseidon kernel: Linux version 2.6.14-1.1637_FC4smp (bhcompile at hs20-bc1-4.build.re
dhat.com) (gcc version 4.0.1 20050727 (Red Hat 4.0.1-5)) #1 SMP Wed Nov 9 18:34:11 EST 2005




More information about the samba mailing list