Same hang with 3.0.7

Whit Blauvelt whit at transpect.com
Fri Feb 12 07:06:55 MST 2010


The end of the strace for the initiating system, to which the directory tree
is being updated, follows. This is while the load is rapidly rising, and the
rsync display on the console has simply hung. I'll put the whole strace file
at http://transpect.com/rsync-11184.out.bz2 for anyone who wants more.

...
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\3\210\3\213\3\216\0032872504/2872504/287250332"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\210\3\213\3\216\0032870448/2870448/2870449375"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\3\204\3\205\3\205\3\205\3\205\3\210\3\213\0032852636/"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\5@`K\5D{\207"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\3\201\3\201\3\201\3\201\3\201\3\201\3\201\3\201\3\201"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\3\207\3\207\3\207\3\207\3\207\3\207\3\207\3\207\3\207"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\265\234\0\0\0\0"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\201\3\202\3\202\3\202\3\202\3\202\3\202\3\202\3\202\3"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\265"..., 4092) = 4092
[pid 11187] <... select resumed> )      = 1 (out [5])
[pid 11188] select(6, [5], [], NULL, {60, 0} <unfinished ...>
[pid 11187] write(5, "\0\0\0\0\0\0\360?\0\0\0\374\17\0\7\0\0\0\0\0\0\0\0\0\0"..., 2097152 <unfinished ...>
[pid 11188] <... select resumed> )      = 1 (in [5], left {60, 0})
[pid 11187] <... write resumed> )       = 112000
[pid 11188] read(5,  <unfinished ...>
[pid 11187] select(7, [3 4], [3 5], NULL, NULL <unfinished ...>
[pid 11188] <... read resumed> "\374\17\0\7", 4) = 4
[pid 11187] <... select resumed> )      = 1 (out [3])
[pid 11188] select(6, [5], [], NULL, {60, 0} <unfinished ...>
[pid 11187] write(3, "\n\372\212\312\356\365\262\316\235\223u?p5\240\374M\221"..., 48 <unfinished ...>
[pid 11188] <... select resumed> )      = 1 (in [5], left {60, 0})
[pid 11187] <... write resumed> )       = 48
[pid 11188] read(5,  <unfinished ...>
[pid 11187] select(7, [3 4], [5], NULL, NULL <unfinished ...>
[pid 11188] <... read resumed> "\3a\3o\3\177\3\204\3\205\3\205\3\205\3\206\3\207\3\207"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\3Y\3^\3a\3a\3a\3o\3\177\3\204\3\205\3\205\3\205\3\206"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "Y\3Y\3Y\3Y\3^\3a\3a\3a\3o\3\177\3\204\3\205\3\205\3\205"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\374\17\0\7", 4)   = 4
[pid 11188] select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0})
[pid 11188] read(5, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 4092) = 4092
[pid 11188] select(6, [5], [], NULL, {60, 0} <unfinished ...>
[pid 11187] <... select resumed> )      = 1 (in [3])
[pid 11188] <... select resumed> )      = 1 (in [5], left {60, 0})

ctrl-C hit here.

Whit


On Thu, Feb 11, 2010 at 09:45:14PM -0500, Matt McCutchen wrote:
> On Thu, 2010-02-11 at 20:38 -0500, Whit Blauvelt wrote: 
> > I've had 2.6.9 running to sync a directory tree between two Ubuntu 8.04 LTS
> > systems for well over a year, every hour, without trouble until today.
> > Today, it runs through the whole tree, deletes, moves files over, then hangs
> > at the very end - sending the system load, which normally is minimal, up
> > over 10 for an extended time until killed.
> 
> On Thu, 2010-02-11 at 21:30 -0500, Whit Blauvelt wrote:
> > After also updating the remote system from which the files are being fetched
> > to 3.0.7 - both are 3.0.7 now - still the same hang in the same place.
> 
> Please run all three rsync processes under strace and provide the last
> part of the strace output for each process.  There are some tips on how
> to do this at http://rsync.samba.org/issues.html .
> 
> -- 
> Matt


More information about the rsync mailing list