[distcc] Re: upgrading from 2.16 to 2.17: compiler crashes

Dimitri Papadopoulos-Orfanos papadopo at www.NOSPAM.fr
Thu Aug 26 07:24:13 GMT 2004


Hi,

This thread was about a distcc 2.17 bug causing the client to crash 
under heavy load (many machines in the farm are shut down).

Unfortunately distcc starts crashing only after many other instances 
have been started, so it's not easy to run the client in a debugger and 
get it to crash.

I do have core files, but they're not very informative, the stack seems 
to be messed up.

Any ideas on how to debug this? Any clue where the error might be in the 
new timeout code?


I wrote:
> 
> I've updated distcc from 2.16 to 2.17 and I'm experiencing crashes.
> [...]
> distcc[20711] ERROR: Connect timeout
> distcc[20723] ERROR: Connect timeout
> distcc[20724] ERROR: Connect timeout
> distcc[20731] ERROR: Connect timeout
> distcc[20727] ERROR: Connect timeout
> distcc[20736] ERROR: Connect timeout
> distcc[20730] ERROR: Connect timeout
> distcc[20728] ERROR: Connect timeout
> distcc[20729] ERROR: Connect timeout
> distcc[20740] ERROR: Connect timeout
> make[3]: *** [mesh/mesher.o] Segmentation fault (core dumped)
> make[3]: *** Waiting for unfinished jobs....
> make[3]: *** [mesh/reducedNeigh.o] Segmentation fault (core dumped)
> make[3]: *** [mesh/vertices.o] Segmentation fault (core dumped)
> make[3]: *** [mesh/splitting.o] Segmentation fault (core dumped)
> make[3]: *** [mesh/surface.o] Segmentation fault (core dumped)
> make[3]: *** [mesh/surf2facet.o] Segmentation fault (core dumped)
> make[3]: *** [mesh/triangles.o] Segmentation fault (core dumped)
> make[3]: *** [mesh/voxel2facet.o] Segmentation fault (core dumped)
> make[3]: *** [mesh/meshMerge.o] Segmentation fault (core dumped)
> make[3]: *** [mesh/smoothing.o] Segmentation fault (core dumped)
> make[2]: *** [all] Error 2
> make[2]: Leaving directory 
> [...]
> $ gdb /usr/local/distcc/bin/distcc 
> aimsalgo-linux-default/src/aimsalgo/core.20711
> [...]
> Core was generated by `g++ -c -Wall -W -D_REENTRANT 
> -DCARTO_DEBUGMODE="default" -DAIMS -I/home/usernam'.
> Program terminated with signal 11, Segmentation fault.
> [...]
> #0  0xbffff0c0 in ?? ()
> (gdb) bt
> #0  0xbffff0c0 in ?? ()
> Cannot access memory at address 0x50ea
> (gdb)
> $


Regards,
Dimitri



More information about the distcc mailing list