timeout and process cleanup bugs 2.4.7pre1+

Dave Dykstra dwd at bell-labs.com
Tue Nov 13 03:55:06 EST 2001


On Fri, Nov 09, 2001 at 10:29:14AM -0700, tim.conway at philips.com wrote:
> I have written an application in perl, to run multiple rsyncs, retrying 
> those which fail.
> I have started getting this error on very large modules since i moved to 
> the latest CVS version.
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> ToolSyncProgress Succeeded Getting big/tools/vlsi_local/etc from willy at 
> 1004767211:2001:11:3:6:0:11.
> ToolSyncProgress Getting big1/cadappl1/hpux from willy at 
> 1004767211:2001:11:3:6:0:11.
> rsync: connection unexpectedly closed (50466734 bytes read so far)
> rsync error: error in rsync protocol data stream (code 12) at io.c(139)
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> The script checks exit status and retries, ordinarily.
> However, when i get THIS error, the rsync process seems to be killing the 
> perl process.  I think it's a bug in the part of the code that cleans up 
> the spawned processes.
> Here's another example, which shows that the timeout as total transfer 
> time but is still active.  It also ends by killing its calling process. 
> The error looks different, though, but it's on a local nfs-to-nfs, instead 
> of rsyncd-to-nfs.
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to 
> chi-tools2 at 1004913115:2001:11:4:22:31:55.
> io timeout after 3600 seconds - exiting
> rsync error: timeout in data send/receive (code 30) at io.c(75)
> ToolSyncProgress chinetsvr failed Replicating big1/cadappl1/hpux from 
> /wan/chi-tools1/chi-tools1 to /wan/chi-tools2/chi-tools2 at 
> 1004925444:2001:11:5:1:57:24 with 7680, retrying.
> ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to 
> chi-tools2 at 1004925444:2001:11:5:1:57:24.
> io timeout after 3600 seconds - exiting
> rsync error: timeout in data send/receive (code 30) at io.c(75)
> ToolSyncProgress chinetsvr failed Replicating big1/cadappl1/hpux from 
> /wan/chi-tools1/chi-tools1 to /wan/chi-tools2/chi-tools2 at 
> 1004937647:2001:11:5:5:20:47 with 7680, retrying.
> ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to 
> chi-tools2 at 1004937647:2001:11:5:5:20:47.
> io timeout after 3600 seconds - exiting
> rsync error: timeout in data send/receive (code 30) at io.c(75)
> ToolSyncProgress ToolSyncProgress Replicating io timeout after 3600 
> seconds - exiting
> rsync error: timeout in data send/receive (code 30) at io.c(75)
> ToolSyncProgress ToolSyncProgress Replicating rsync error: received 
> SIGUSR1 or SIGINT (code 20) at main.c(741)
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Any ideas?  I can try to track the killing and timeout logic, if there's 
> not already a fix out there that was just accidentally dropped.


I don't recall anybody else reporting a similar problem, so please try
to track and debug it further.

- Dave Dykstra




More information about the rsync mailing list