timeout and process cleanup bugs 2.4.7pre1+

tim.conway at philips.com tim.conway at philips.com
Sat Nov 10 04:29:14 EST 2001


I have written an application in perl, to run multiple rsyncs, retrying 
those which fail.
I have started getting this error on very large modules since i moved to 
the latest CVS version.
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
ToolSyncProgress Succeeded Getting big/tools/vlsi_local/etc from willy at 
1004767211:2001:11:3:6:0:11.
ToolSyncProgress Getting big1/cadappl1/hpux from willy at 
1004767211:2001:11:3:6:0:11.
rsync: connection unexpectedly closed (50466734 bytes read so far)
rsync error: error in rsync protocol data stream (code 12) at io.c(139)
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
The script checks exit status and retries, ordinarily.
However, when i get THIS error, the rsync process seems to be killing the 
perl process.  I think it's a bug in the part of the code that cleans up 
the spawned processes.
Here's another example, which shows that the timeout as total transfer 
time but is still active.  It also ends by killing its calling process. 
The error looks different, though, but it's on a local nfs-to-nfs, instead 
of rsyncd-to-nfs.
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to 
chi-tools2 at 1004913115:2001:11:4:22:31:55.
io timeout after 3600 seconds - exiting
rsync error: timeout in data send/receive (code 30) at io.c(75)
ToolSyncProgress chinetsvr failed Replicating big1/cadappl1/hpux from 
/wan/chi-tools1/chi-tools1 to /wan/chi-tools2/chi-tools2 at 
1004925444:2001:11:5:1:57:24 with 7680, retrying.
ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to 
chi-tools2 at 1004925444:2001:11:5:1:57:24.
io timeout after 3600 seconds - exiting
rsync error: timeout in data send/receive (code 30) at io.c(75)
ToolSyncProgress chinetsvr failed Replicating big1/cadappl1/hpux from 
/wan/chi-tools1/chi-tools1 to /wan/chi-tools2/chi-tools2 at 
1004937647:2001:11:5:5:20:47 with 7680, retrying.
ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to 
chi-tools2 at 1004937647:2001:11:5:5:20:47.
io timeout after 3600 seconds - exiting
rsync error: timeout in data send/receive (code 30) at io.c(75)
ToolSyncProgress ToolSyncProgress Replicating io timeout after 3600 
seconds - exiting
rsync error: timeout in data send/receive (code 30) at io.c(75)
ToolSyncProgress ToolSyncProgress Replicating rsync error: received 
SIGUSR1 or SIGINT (code 20) at main.c(741)
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Any ideas?  I can try to track the killing and timeout logic, if there's 
not already a fix out there that was just accidentally dropped.

Tim Conway
tim.conway at philips.com
303.682.4917
Philips Semiconductor - Longmont TC
1880 Industrial Circle, Suite D
Longmont, CO 80501
Available via SameTime Connect within Philips, n9hmg on AIM
perl -e 'print pack(nnnnnnnnnnnn, 
19061,29556,8289,28271,29800,25970,8304,25970,27680,26721,25451,25970), 
".\n" '
"There are some who call me.... Tim?"




More information about the rsync mailing list