Very surprising behaviour with --files-from

Robin Lee Powell rlpowell at digitalkingdom.org
Fri Dec 10 10:45:54 MST 2010


On Fri, Dec 10, 2010 at 09:11:39AM -0800, Robin Lee Powell wrote:
> 
> $ wc -l /tmp/list
> 1000 /tmp/list
> 
> $ rsync -i -aPv --ignore-existing --files-from=/tmp/list /backups/ ut00-s00010:/backups/
> building file list ...
> 3937 files to consider
> 
> I am totally baffled.
> 
> That's not such a big deal, but the list I'm *actually* using has
> twenty *million* files in it.  At a couple hundred files a second,
> if it's going to check 4 times the number of files, that's a *huge*
> time waste.  What's going on?
> 
> Here's what the list looks like:
> 
> $ head /tmp/list
> cpool/b/c/5/bc5ea7a79a4824c6729645c66b562e6b
> cpool/7/7/8/77865de94585b4581f07e54065c7b1e3
> cpool/2/5/0/250f326bfa69c9da011f809a8b46cea7
> cpool/3/3/8/3382672447e7f9a00ea755cee7ad5187
> cpool/1/0/e/10eec0876f979ca8773f63e697be0adf
> cpool/0/e/b/0ebf2a81c863702baa4eb38ec3cef655
> cpool/3/6/c/36c915e781561292d9ae73e127504d0d
> cpool/b/5/0/b50dcb17dac0808c4b5de1a9a3b747af
> cpool/8/5/f/85fb8dc29ed1597c3fd0725ff91da279
> cpool/9/0/8/90829abb5879fcbe39c2f55c4211b3c5
> 
> They are all like that, and they are all files, not directories.
> 
> I thought it could be rsync checking the directories that have those
> files in them, but there are only 4300 directories,

I'm trying it with 

$ wc -l /tmp/list
1000000 /tmp/list

and currently it's up to:

 2198200 files...

So again, that's a *huge* amount of wasted time.  Why?

And why isn't it transferring incrementally?  It's rsync 3.0 on both
ends.

-Robin

-- 
http://singinst.org/ :  Our last, best hope for a fantastic future.
Lojban (http://www.lojban.org/): The language in which "this parrot
is dead" is "ti poi spitaki cu morsi", but "this sentence is false"
is "na nei".   My personal page: http://www.digitalkingdom.org/rlp/


More information about the rsync mailing list