single rsync of large directory structure versus several smaller rsync's

jw schultz jw at pegasys.ws
Sun Mar 16 20:40:54 EST 2003


On Sun, Mar 16, 2003 at 09:05:45AM +0000, Pete Mocat wrote:
> Hello,
> 
> I am working on a backup script using rsync and have a question.
> 
> Rather than doing a single rsync of /home/ I have set my script up
> to loop over /etc/passwd and do a seperate rsync of each user home dir 
> (/home/user1, /home/user2) etc.
> 
> In the end, I should have the exact same backup either way. However I feel 
> safest doing it as multiple rsyncs (one for each user). I am doing this 
> because I want to avoid a case in which some fatal error on a particular 
> dir kills off the whole rsync. For example, given:
> 
> /home/adoe
> /home/jdoe
> /home/zdoe
> 
> If I do:
> 
> rsync -vaze ssh /home/
> 
> I am afraid that if /home/adoe had a fatel error (one that kills rsync) all 
> dirs past it would not be backed up and given an over-worked sysadmin 
> (never!) we might miss the error until the day we need the backups.
> 
> If I do a seperate:
> 
> rsync -vaze ssh /home/adoe
> rsync -vaze ssh /home/jdoe
> rsync -vaze ssh /home/zdoe
> 
> etc.
> 
> I should still have a backup of each dir even if some fail.
> 
> Is this being too paranoid?

How paranoid is too paranoid?

> Is there something built into rsync to avoid this?

If an error looks to be isolated to a single file rsync
continues onward and merely logs the error.  Your script
should examine the exit status and/or logs and notify if
there is an error.

> Does anyone have any better ideas for me? I want to make the best use of 
> rsync possible.
> 
> Thanks for your advice and comments!

I don't think you are being paranoid enough.  Unless your
backups are subsequently backed up again you only have
one backup here.  What is going to happen if a file gets
trashed and it isn't noticed for a few days?

I would suggest you look into Mike Rubel's snapshot
system, Dirvish or another system that has already dealt
with these issues and can maintain and manage multiple
backup images.  I cannot speak for the others but i know
that Dirvish is in production use at multiple sites.

-- 
________________________________________________________________
	J.W. Schultz            Pegasystems Technologies
	email address:		jw at pegasys.ws

		Remember Cernan and Schmitt


More information about the rsync mailing list