[Samba] Tall tale of woe....

Ross McInnes (Systems) sysrm at stvincent.ac.uk
Mon Dec 15 12:18:23 GMT 2003


Jerry...

> It logs to stdout.

Ah ok so redirect to another file will be in order.

> 
> I think the key will be figuring out which tdb the
> runaway smbd is reading.
> 
> Probably.  Does ifconfig show an abnormal amount of errors?
> If not, then you are probably ok wrt duplex settings, et. al.
> 
> And to clarify, when the smbd starts sucking up CPU, check to
> which client it is connected to and look at the traffic
> pattern from that client to see if the smbd process is doing
> real work on behalf of the client.
> 

no its fine, so thats one less thing to worry about.. or not.

eth0      Link encap:Ethernet  HWaddr 00:06:5B:F2:89:25
          inet addr:172.16.128.254  Bcast:172.16.255.255  Mask:255.255.0.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:1106496 errors:0 dropped:0 overruns:0 frame:0
          TX packets:1078245 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:100
          RX bytes:228665930 (218.0 Mb)  TX bytes:768785456 (733.1 Mb)
          Interrupt:28 Base address:0xdce0 Memory:fe8e0000-fe900000

half way through writing this reply the server just panic'd and halted.

on the screen was (or there abouts)

smbd process PID 19579, stackpage = f300f000

calltrace [<c013e86b>] __kmem_cache_alloc

followed by

e1000_alloc_rx_buffers
e1000_alloc_rx_irq

might put some light onto it.
dont suppose you know where RH writes panics to? i cant seem to find it.

when i look at the samba.log there is nothing untoward

[2003/12/15 11:29:06, 1] smbd/service.c:make_connection(636)
  m6-1 (172.16.175.10) connect to service dmn01 as user dmn01 (uid=1269, 
gid=102) (pid 18746)
[2003/12/15 11:29:07, 0] lib/util_sock.c:read_data(436)
  read_data: read failure for 4. Error = Connection reset by peer
[2003/12/15 11:29:07, 1] smbd/service.c:close_cnum(677)
  m5-3 (172.16.142.30) closed connection to service exams
[2003/12/15 11:29:07, 1] smbd/service.c:close_cnum(677)
  m5-3 (172.16.142.30) closed connection to service shared
[2003/12/15 11:29:07, 1] smbd/service.c:close_cnum(677)
  m5-3 (172.16.142.30) closed connection to service intranet
[2003/12/15 11:29:07, 1] smbd/service.c:close_cnum(677)
  m5-3 (172.16.142.30) closed connection to service winfiles
[2003/12/15 11:29:07, 1] smbd/service.c:close_cnum(677)
  m5-3 (172.16.142.30) closed connection to service netlogon
[2003/12/15 11:29:07, 1] smbd/service.c:close_cnum(677)
  m5-3 (172.16.142.30) closed connection to service ab02
[2003/12/15 11:46:24, 1] smbd/service.c:make_connection(636)
  premises (172.16.180.10) connect to service rsmith as user rsmith 
(uid=1029, gid=101) (pid 890)

m6-8 (172.16.175.80) connect to service pn02 as user pn02 
(uid=2906, gid=102) (pid 19579)
[2003/12/15 11:27:49, 1] smbd/service.c:make_connection(636)

is the offending user/pid nothing untoward in his account or network 
traffic to or from his computer at the time.

unfortunatly i was unaware of the slowdown/problems so i was unable to 
perform strace on the pid.

im guessing it panics when the offending pid is left alone, and not kill 
-9 'd like i normally do.

Many thanks

A perturbed Ross McInnes



More information about the samba mailing list