[ctdb]CTDB CRASH

guoleia at 139.com guoleia at 139.com
Thu Jan 19 01:25:29 MST 2012


Hi folks,

I'm using ctdb for a samba cluster.
ctdb version:1.0.114-1
smbd version 3.5.6
Linux server:suse10

After add a new node to the cluster , finding the abnoral situation
1、There're running cluster(including node_A,node_B,node_C), the master primary node is node_A.
2、Adding a new node(node_D), and append the ip of node_D to the "/etc/ctdb/nodes" file in node_A and node_B and node_C,
make the file which's in the catalog "/etc/ctdb/nodes" of node_D consistent with the others.
3、Run the command "ctdb reloadnodes" on node_A, meanwhile running the command "service ctdb start" on node_D , finding that the operation will cause ctdb process crash in some nodes.

***************************************************************
Program terminated with signal 6, Aborted.
#0  0x00002b5d9ce0dbb5 in raise () from /lib64/libc.so.6
(gdb) bt
#0  0x00002b5d9ce0dbb5 in raise () from /lib64/libc.so.6
#1  0x00002b5d9ce0efb0 in abort () from /lib64/libc.so.6
#2  0x0000000000452975 in smb_panic (why=0x480d7c "internal error") at lib/util/fault.c:163
#3  0x0000000000452a88 in fault_report (sig=11) at lib/util/fault.c:180
#4  0x0000000000452abd in sig_fault (sig=11) at lib/util/fault.c:195
#5  <signal handler called>
#6  0x0000000000445b62 in ctdb_canonicalize_ip (ip=0x7348a00000002, cip=0x7fff0deeab90) at common/ctdb_util.c:504
#7  0x0000000000445c3f in ctdb_same_ip (tip1=0x7348a00000002, tip2=0x2b5d9d4a7fc0) at common/ctdb_util.c:522
#8  0x0000000000445d47 in ctdb_same_sockaddr (ip1=0x7348a00000002, ip2=0x2b5d9d4a7fc0) at common/ctdb_util.c:549
#9  0x000000000042eca6 in ctdb_tcp_find (array=0x2b5d9d477fc0, tcp=0x2b5d9d4a7fc0) at server/ctdb_takeover.c:1684
#10 0x000000000042f3a8 in ctdb_remove_tcp_connection (ctdb=0x2b5d9d04bc10, conn=0x2b5d9d4a7fc0) at server/ctdb_takeover.c:1795
#11 0x000000000042f68c in ctdb_takeover_client_destructor_hook (client=0x2b5d9d4effc0) at server/ctdb_takeover.c:1848
#12 0x0000000000404d34 in ctdb_client_destructor (client=0x2b5d9d4effc0) at server/ctdb_daemon.c:185
#13 0x0000000000454056 in _talloc_free (ptr=0x2b5d9d4effc0) at ./lib/talloc/talloc.c:372
#14 0x00000000004549ff in talloc_free (ptr=0x2b5d9d4effc0) at ./lib/talloc/talloc.c:752
#15 0x0000000000405b1b in ctdb_daemon_read_cb (data=0x0, cnt=0, args=0x2b5d9d4effc0) at server/ctdb_daemon.c:518
#16 0x0000000000443c1c in queue_io_read (queue=0x2b5d9d45dfa0) at common/ctdb_io.c:167
#17 0x000000000044405c in queue_io_handler (ev=0x2b5d9d10efd0, fde=0x2b5d9d489fc0, flags=1, private_data=0x2b5d9d45dfa0) at common/ctdb_io.c:233
#18 0x0000000000465b0d in epoll_event_loop (std_ev=0x2b5d9d110fe0, tvalp=0x7fff0deeaf20) at lib/events/events_standard.c:309
#19 0x0000000000466356 in std_event_loop_once (ev=0x2b5d9d10efd0) at lib/events/events_standard.c:550
#20 0x0000000000463811 in event_loop_once (ev=0x2b5d9d10efd0) at lib/events/events.c:294
#21 0x000000000042929e in ctdb_event_script_args (ctdb=0x2b5d9d04bc10, call=CTDB_EVENT_RELEASE_IP, fmt=0x477b2e "%s %s %u") at server/eventscript.c:802
#22 0x000000000042f737 in ctdb_release_all_ips (ctdb=0x2b5d9d04bc10) at server/ctdb_takeover.c:1868
#23 0x000000000041e3c0 in ctdb_control_dispatch (ctdb=0x2b5d9d04bc10, c=0x2b5d9d6d8fc0, indata={dptr = 0x2b5d9d6d8ffc "", dsize = 0}, outdata=0x2b5d9d6deff0, srcnode=0, errormsg=0x7fff0deeb1c0, async_reply=0x7fff0deeb1cf)
    at server/ctdb_control.c:285
#24 0x0000000000420115 in ctdb_request_control (ctdb=0x2b5d9d04bc10, hdr=0x2b5d9d6d8fc0) at server/ctdb_control.c:643
#25 0x000000000041c0ee in ctdb_input_pkt (ctdb=0x2b5d9d04bc10, hdr=0x2b5d9d6d8fc0) at server/ctdb_server.c:401
#26 0x000000000041c4fb in queue_next_trigger (ev=0x2b5d9d10efd0, te=0x2b5d9d6dafc0, t={tv_sec = 0, tv_usec = 0}, private_data=0x2b5d9d6ceff0) at server/ctdb_server.c:484
#27 0x00000000004652b8 in common_event_loop_timer_delay (ev=0x2b5d9d10efd0) at lib/events/events_timed.c:152
#28 0x0000000000466312 in std_event_loop_once (ev=0x2b5d9d10efd0) at lib/events/events_standard.c:543
#29 0x00000000004663b3 in std_event_loop_wait (ev=0x2b5d9d10efd0) at lib/events/events_standard.c:569
#30 0x0000000000463830 in event_loop_wait (ev=0x2b5d9d10efd0) at lib/events/events.c:302
#31 0x0000000000406960 in ctdb_start_daemon (ctdb=0x2b5d9d04bc10, do_fork=true, use_syslog=false) at server/ctdb_daemon.c:854
#32 0x00000000004047a9 in main (argc=4, argv=0x7fff0deebb28) at server/ctdbd.c:337
***************************************************************


More information about the samba-technical mailing list