[distcc] Aborting jobs in status "Connect"

Martin Pool mbp at samba.org
Mon Nov 17 00:42:17 GMT 2003


On  1 Oct 2003, Christian Breimann <distccGroupEmail.20.cbreimann at spamgourmet.com> wrote:
> Hello,
> 
> I am using distcc to distribute jobs on several computers in an university 
> network. Some of them usually have problems like a very high load caused by 
> unsuccesful closed X-sessions or strange cups-daemon-processes. Due to 
> problems like this, some machines allow normal users like me to connect, 
> but not to log-in, i.e., resulting in "hanging" ssh- or rsh-sessions. Only 
> root is able to log-in and terminate such high-load-processes or reboot the 
> machine. I cannot ask my system administrator to do this several times a 
> day.
> 
> However, in these cases, distcc seems to have a similar problem. The 
> graphical monitor shows that distcc is in "Connect"-Status for several 
> seconds or even minutes without anything happening for that job. All other 
> machines get their jobs, finish them and get new jobs, only this one 
> machine hangs. After everything has been completed, I can terminate the 
> make-run using CTRL-C and start it again, so that the last jobs gets 
> finished this time on another machine.
> 
> So I wonder, whether distcc can do the following for me:
> If a distributed job remains in "Connect"-status for a certain amount of 
> time, perhaps a user-defined number of seconds or a default of 10 seconds, 
> distcc should kill this job, mark the machine as not available and 
> redistribute the job in the same way as if the machine is not reachable at 
> all.
> If "Send"-status is reached before this time limit everything should be 
> processed as before.

Hi,

Can you please turn on verbose client logging and show me at what
stage the client is hung?

-- 
Martin 
                               linux.conf.au -- Adelaide, January 2004



More information about the distcc mailing list