timeout and process cleanup bugs 2.4.7pre1+
Dave Dykstra
dwd at bell-labs.com
Tue Nov 13 03:55:06 EST 2001
On Fri, Nov 09, 2001 at 10:29:14AM -0700, tim.conway at philips.com wrote:
> I have written an application in perl, to run multiple rsyncs, retrying
> those which fail.
> I have started getting this error on very large modules since i moved to
> the latest CVS version.
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> ToolSyncProgress Succeeded Getting big/tools/vlsi_local/etc from willy at
> 1004767211:2001:11:3:6:0:11.
> ToolSyncProgress Getting big1/cadappl1/hpux from willy at
> 1004767211:2001:11:3:6:0:11.
> rsync: connection unexpectedly closed (50466734 bytes read so far)
> rsync error: error in rsync protocol data stream (code 12) at io.c(139)
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> The script checks exit status and retries, ordinarily.
> However, when i get THIS error, the rsync process seems to be killing the
> perl process. I think it's a bug in the part of the code that cleans up
> the spawned processes.
> Here's another example, which shows that the timeout as total transfer
> time but is still active. It also ends by killing its calling process.
> The error looks different, though, but it's on a local nfs-to-nfs, instead
> of rsyncd-to-nfs.
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to
> chi-tools2 at 1004913115:2001:11:4:22:31:55.
> io timeout after 3600 seconds - exiting
> rsync error: timeout in data send/receive (code 30) at io.c(75)
> ToolSyncProgress chinetsvr failed Replicating big1/cadappl1/hpux from
> /wan/chi-tools1/chi-tools1 to /wan/chi-tools2/chi-tools2 at
> 1004925444:2001:11:5:1:57:24 with 7680, retrying.
> ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to
> chi-tools2 at 1004925444:2001:11:5:1:57:24.
> io timeout after 3600 seconds - exiting
> rsync error: timeout in data send/receive (code 30) at io.c(75)
> ToolSyncProgress chinetsvr failed Replicating big1/cadappl1/hpux from
> /wan/chi-tools1/chi-tools1 to /wan/chi-tools2/chi-tools2 at
> 1004937647:2001:11:5:5:20:47 with 7680, retrying.
> ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to
> chi-tools2 at 1004937647:2001:11:5:5:20:47.
> io timeout after 3600 seconds - exiting
> rsync error: timeout in data send/receive (code 30) at io.c(75)
> ToolSyncProgress ToolSyncProgress Replicating io timeout after 3600
> seconds - exiting
> rsync error: timeout in data send/receive (code 30) at io.c(75)
> ToolSyncProgress ToolSyncProgress Replicating rsync error: received
> SIGUSR1 or SIGINT (code 20) at main.c(741)
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Any ideas? I can try to track the killing and timeout logic, if there's
> not already a fix out there that was just accidentally dropped.
I don't recall anybody else reporting a similar problem, so please try
to track and debug it further.
- Dave Dykstra
More information about the rsync
mailing list