[Samba] Is samba or a kernel bug causing my FC4 server to crash?

Craig White craigwhite at azapple.com
Mon Nov 28 01:36:09 GMT 2005


On Sun, 2005-11-27 at 20:22 -0500, Jason Welter wrote:
> I've got a fully updated Fedora Core 4 server crashing hard every week or
> two.  I use Samba via smbmount and autofs to read & delete log files on 17 XP
> boxs and 6 NT4SP6 boxes as well as a couple other Windows files servers
> every 5 minutes.  The first indication of a problem I get is smbmount stops
> working, then the server becomes unresponsive to the point where only a
> power slam will fix it, and it does fix it...for a few days.
> 
> I've got  Samba 3.0.14a-2 installed and have been updating my kernel as often as
> a new one is released.  Currently I'm running 2.6.14-1.1637_FC4smp.
> 
> Here are 4 seperate crash excerpts of the sytem log.  Anybody know how to tell
> if samba is involved and if so, if it is responsible? 
> 
> ################################################################################
> Nov  8 17:15:14 poseidon automount[32023]: failed to mount /win/metal10
> Nov  8 17:15:37 poseidon kernel: smb_add_request: request [efeff680, mid=36572] timed out!
> Nov  8 17:15:37 poseidon kernel: smb_writepage_sync: failed write, wsize=4096, write_ret=-5
> Nov  8 17:15:37 poseidon kernel: smb_add_request: request [eb962080, mid=14] timed out!
> Nov  8 17:21:53 poseidon kernel: Unable to handle kernel paging request at virtual address 0600000
> 0
> Nov  8 17:21:53 poseidon kernel:  printing eip:
> Nov  8 17:21:53 poseidon kernel: f8b4b5a4
> Nov  8 17:21:53 poseidon kernel: *pde = 37e1b001
> Nov  8 17:21:53 poseidon kernel: Oops: 0000 [#2]
> Nov  8 17:21:53 poseidon kernel: SMP
> Nov  8 17:21:53 poseidon kernel: Modules linked in: nfs lockd nfs_acl smbfs radeon drm parport_pc
> lp parport autofs4 i2c_dev i2c_core rfcomm l2cap bluetooth sunrpc ipv6 dm_mod video button battery
>  ac uhci_hcd ehci_hcd hw_random shpchp e1000 floppy mptspi sg ext3 jbd megaraid_mbox megaraid_mm m
> ptscsih mptbase sd_mod scsi_mod
> Nov  8 17:21:53 poseidon kernel: CPU:    3
> Nov  8 17:21:53 poseidon kernel: EIP:    0060:[<f8b4b5a4>]    Not tainted VLI
> Nov  8 17:21:53 poseidon kernel: EFLAGS: 00010206   (2.6.13-1.1532_FC4smp)
> Nov  8 17:21:53 poseidon kernel: EIP is at smbiod+0xef/0x184 [smbfs]
> Nov  8 17:21:53 poseidon kernel: eax: 12221400   ebx: d1de9000   ecx: eceb6f98   edx: 0321cf60
> Nov  8 17:21:53 poseidon kernel: esi: 06000000   edi: eceb6000   ebp: eceb6fc4   esp: eceb6fbc
> Nov  8 17:21:53 poseidon kernel: ds: 007b   es: 007b   ss: 0068
> Nov  8 17:21:53 poseidon kernel: Process smbiod (pid: 16251, threadinfo=eceb6000 task=ed2b8aa0)
> Nov  8 17:21:53 poseidon kernel: Stack: f8b4cbd7 eceb6000 00000000 ed2b8aa0 c01347c2 eceb6fd0 eceb
> 6fd0 f8b4b4b5
> Nov  8 17:21:53 poseidon kernel:        00000000 00000000 00000000 c0101ca1 00000000 00000000 0000
> 0000 00000000
> Nov  8 17:21:53 poseidon kernel:        00000000
> Nov  8 17:21:53 poseidon kernel: Call Trace:
> Nov  8 17:21:53 poseidon kernel:  [<c01347c2>] autoremove_wake_function+0x0/0x37
> Nov  8 17:21:53 poseidon kernel:  [<f8b4b4b5>] smbiod+0x0/0x184 [smbfs]
> Nov  8 17:21:53 poseidon kernel:  [<c0101ca1>] kernel_thread_helper+0x5/0xb
> Nov  8 17:21:53 poseidon kernel: Code: 0f 85 90 00 00 00 f0 0f ba 35 6c 48 b5 f8 01 b8 c8 25 b5 f8
>  e8 0c ca 7c c7 8b 1d c0 25 b5 f8 81 fb c0 25 b5 f8 74 79 8b 33 eb 0e <8b> 06 89 f3 81 fe c0 25 b5
>  f8 74 50 89 c6 8b 43 08 85 c0 75 eb
> Nov  8 18:02:42 poseidon syslogd 1.4.1: restart.
> 
> ################################################################################
> Oct 26 09:30:11 poseidon kernel: smb_lookup: find //fabnet failed, error=-5
> Oct 26 09:30:11 poseidon kernel: smb_add_request: request [e1a5e280, mid=104] timed out!
> Oct 26 09:30:11 poseidon kernel: smb_lookup: find //fabnet failed, error=-5
> Oct 26 09:30:11 poseidon kernel: smb_add_request: request [c5610280, mid=65] timed out!
> Oct 26 09:30:11 poseidon kernel: smb_lookup: find //mdsystem failed, error=-5
> Oct 26 09:30:15 poseidon ntpd[2219]: ntpd exiting on signal 15
> Oct 26 09:30:15 poseidon rpc.statd[1782]: Caught signal 15, un-registering and exiting.
> Oct 26 09:30:15 poseidon auditd[1796]: The audit daemon is exiting.
> Oct 26 09:30:15 poseidon kernel: audit(1130333415.760:21310): audit_pid=0 old=1796 by auid=4294967
> 295
> Oct 26 09:30:15 poseidon kernel: audit(1130333415.900:21311): SELinux:  unrecognized netlink messa
> ge type=1009 for sclass=49
> Oct 26 09:30:15 poseidon kernel:
> Oct 26 09:30:15 poseidon kernel: audit(1130333415.900:21311): arch=40000003 syscall=102 success=ye
> s exit=16 a0=b a1=bfc8d790 a2=80510f8 a3=bfc93bb8 items=0 pid=18765 auid=4294967295 uid=0 gid=0 eu
> id=0 suid=0 fsuid=0 egid=0 sgid=0 fsgid=0 comm="auditctl" exe="/sbin/auditctl"
> Oct 26 09:30:15 poseidon kernel: audit(1130333415.900:21311): saddr=100000000000000000000000
> Oct 26 09:30:15 poseidon kernel: audit(1130333415.900:21311): nargs=6 a0=3 a1=bfc91a1c a2=10 a3=0
> a4=bfc93bb8 a5=c
> Oct 26 09:30:16 poseidon kernel: audit(1130333416.000:21312): SELinux:  unrecognized netlink messa
> ge type=1009 for sclass=49
> Oct 26 09:30:16 poseidon kernel:
> Oct 26 09:30:16 poseidon kernel: audit(1130333416.000:21312): arch=40000003 syscall=102 success=ye
> s exit=16 a0=b a1=bfc8d780 a2=80510f8 a3=bfc93ba8 items=0 pid=18765 auid=4294967295 uid=0 gid=0 eu
> id=0 suid=0 fsuid=0 egid=0 sgid=0 fsgid=0 comm="auditctl" exe="/sbin/auditctl"
> Oct 26 09:30:16 poseidon kernel: audit(1130333416.000:21312): saddr=100000000000000000000000  
> Oct 26 09:30:16 poseidon kernel: audit(1130333416.000:21312): nargs=6 a0=3 a1=bfc91a0c a2=10 a3=0
> a4=bfc93ba8 a5=c
> Oct 26 09:30:16 poseidon kernel: Kernel logging (proc) stopped.
> Oct 26 09:30:16 poseidon kernel: Kernel log daemon terminating.
> Oct 26 09:30:17 poseidon exiting on signal 15
> Oct 26 09:33:37 poseidon syslogd 1.4.1: restart.
> 
> ################################################################################
> Oct 29 12:46:10 poseidon kernel: Unable to handle kernel paging request at virtual address 4f3e353
> c
> Oct 29 12:46:10 poseidon kernel:  printing eip:
> Oct 29 12:46:10 poseidon kernel: f8b4b5a4
> Oct 29 12:46:10 poseidon kernel: *pde = 132d2001
> Oct 29 12:46:10 poseidon kernel: Oops: 0000 [#1]
> Oct 29 12:46:10 poseidon kernel: SMP
> Oct 29 12:46:10 poseidon kernel: Modules linked in: loop nfs lockd nfs_acl smbfs radeon drm parpor
> t_pc lp parport autofs4 i2c_dev i2c_core rfcomm l2cap bluetooth sunrpc ipv6 dm_mod video button ba
> ttery ac uhci_hcd ehci_hcd hw_random shpchp e1000 floppy mptspi sg ext3 jbd megaraid_mbox megaraid
> _mm mptscsih mptbase sd_mod scsi_mod
> Oct 29 12:46:10 poseidon kernel: CPU:    2
> Oct 29 12:46:10 poseidon kernel: EIP:    0060:[<f8b4b5a4>]    Not tainted VLI
> Oct 29 12:46:10 poseidon kernel: EFLAGS: 00010206   (2.6.13-1.1526_FC4smp)
> Oct 29 12:46:10 poseidon kernel: EIP is at smbiod+0xef/0x184 [smbfs]
> Oct 29 12:46:10 poseidon kernel: eax: 32312039   ebx: f7b42800   ecx: dcd2ef98   edx: 03214f60
> Oct 29 12:46:10 poseidon kernel: esi: 4f3e353c   edi: dcd2e000   ebp: dcd2efc4   esp: dcd2efbc
> Oct 29 12:46:10 poseidon kernel: ds: 007b   es: 007b   ss: 0068
> Oct 29 12:46:10 poseidon kernel: Process smbiod (pid: 26546, threadinfo=dcd2e000 task=f7961560)
> Oct 29 12:46:10 poseidon kernel: Stack: f8b4cbd7 dcd2e000 00000000 f7961560 c01347c2 dcd2efd0 dcd2
> efd0 f8b4b4b5
> Oct 29 12:46:10 poseidon kernel:        00000000 00000000 00000000 c0101ca1 00000000 00000000 0000
> 0000 00000000
> Oct 29 12:46:10 poseidon kernel:        00000000
> Oct 29 12:46:10 poseidon kernel: Call Trace:
> Oct 29 12:46:10 poseidon kernel:  [<c01347c2>] autoremove_wake_function+0x0/0x37
> Oct 29 12:46:10 poseidon kernel:  [<f8b4b4b5>] smbiod+0x0/0x184 [smbfs]
> Oct 29 12:46:10 poseidon kernel:  [<c0101ca1>] kernel_thread_helper+0x5/0xb
> Oct 29 12:46:10 poseidon kernel: Code: 0f 85 90 00 00 00 f0 0f ba 35 6c 48 b5 f8 01 b8 c8 25 b5 f8
>  e8 cc c8 7c c7 8b 1d c0 25 b5 f8 81 fb c0 25 b5 f8 74 79 8b 33 eb 0e <8b> 06 89 f3 81 fe c0 25 b5
>  f8 74 50 89 c6 8b 43 08 85 c0 75 eb
> Oct 29 12:46:10 poseidon kernel:  <5>smb_lookup: find //34 failed, error=-512
> Oct 29 12:46:10 poseidon kernel: smb_lookup: find //34 failed, error=-512
> Oct 29 12:46:39 poseidon last message repeated 279 times
> Oct 29 12:46:39 poseidon kernel: smb_add_request: request [eee02e80, mid=12] timed out!
> Oct 29 12:46:40 poseidon kernel: smb_lookup: find //34 failed, error=-512
> Oct 29 12:47:09 poseidon last message repeated 288 times
> Oct 29 12:47:09 poseidon kernel: smb_add_request: request [eee02e80, mid=13] timed out!
> Oct 29 13:58:20 poseidon syslogd 1.4.1: restart.
> 
> ################################################################################
> Nov 25 15:05:34 poseidon automount[14437]: failed to mount /win/prober01
> Nov 25 15:05:41 poseidon automount[14451]: >> Error connecting to xxx.xxx.xxx.xxx (No route to host)
> Nov 25 15:05:41 poseidon automount[14451]: >> 14453: Connection to SAW4341 failed
> Nov 25 15:05:41 poseidon automount[14451]: >> SMB connection failed
> Nov 25 15:05:41 poseidon automount[14451]: mount(generic): failed to mount //SAW4341/fabdata (type
>  smbfs) on /win/prober01
> Nov 25 15:05:41 poseidon automount[14451]: failed to mount /win/prober01
> Nov 25 15:07:55 poseidon kernel: BUG: spinlock lockup on CPU#1, smbmnt/14461, f8b7c790 (Not tainte
> d)
> Nov 25 15:07:55 poseidon kernel:  [<c01decc3>] __spin_lock_debug+0xac/0xcf
> Nov 25 15:07:55 poseidon kernel:  [<c01ded32>] _raw_spin_lock+0x4c/0x6a
> Nov 25 15:07:55 poseidon kernel:  [<f8b75251>] smbiod_register_server+0xd/0x39 [smbfs]
> Nov 25 15:07:55 poseidon kernel:  [<f8b743da>] smb_fill_super+0x23b/0x3b5 [smbfs]
> Nov 25 15:07:55 poseidon kernel:  [<c01d9aba>] idr_get_new_above_int+0x5e/0xe9
> Nov 25 15:07:55 poseidon kernel:  [<c017de5f>] get_filesystem+0xf/0x36
> Nov 25 15:07:55 poseidon kernel:  [<c0169d70>] sget+0x161/0x16d
> Nov 25 15:07:55 poseidon kernel:  [<c016a420>] set_anon_super+0x0/0xa1
> Nov 25 15:07:55 poseidon kernel:  [<c016a6cf>] get_sb_nodev+0x37/0x71
> Nov 25 15:07:55 poseidon kernel:  [<c016a84a>] do_kern_mount+0xaf/0x14a
> Nov 25 15:07:55 poseidon kernel:  [<f8b7419f>] smb_fill_super+0x0/0x3b5 [smbfs]
> Nov 25 15:07:55 poseidon kernel:  [<c017f314>] do_new_mount+0x6b/0x90
> Nov 25 15:07:55 poseidon kernel:  [<c017f991>] do_mount+0x18b/0x1a9
> Nov 25 15:07:55 poseidon kernel:  [<c017fd62>] sys_mount+0x77/0xae
> Nov 25 15:07:55 poseidon kernel:  [<c01039e1>] syscall_call+0x7/0xb
> Nov 25 15:57:41 poseidon kernel: input: AT Translated Set 2 keyboard on isa0060/serio0
> Nov 25 16:01:30 poseidon syslogd 1.4.1: restart.
> Nov 25 16:01:30 poseidon kernel: klogd 1.4.1, log source = /proc/kmsg started.
> Nov 25 16:01:30 poseidon kernel: Linux version 2.6.14-1.1637_FC4smp (bhcompile at hs20-bc1-4.build.re
> dhat.com) (gcc version 4.0.1 20050727 (Red Hat 4.0.1-5)) #1 SMP Wed Nov 9 18:34:11 EST 2005
----
smbfs is a kernel module and has nothing to do with samba. You might
want to create a bugzilla entry at http://bugzilla.redhat.com/bugzilla
where it will be looked at.

Craig


-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.



More information about the samba mailing list