[clug] using rsync for backups
Kim Holburn
kim.holburn at gmail.com
Sun May 25 18:13:28 GMT 2008
Can't you run an rsync daemon on the backup device? Then hopefully
the filelist would be generated when the process started up.
On 2008/May/23, at 4:21 AM, Michael James wrote:
> From my experience backing up 3/4 of a terrabyte using rsync,
> I've had some thoughts for possible improvements.
>
>
>
> Using prepared filelists:
>
> A normal rsync job sets up a sender and a reciever process
> which each generate filelists. Then the lists are compared,
> a task list is created, and the 2 processes get to work
> transferring new and changed files.
>
> When the reciever is a backup, no changes happen between rsyncs.
> So the filelist is (theoretically at least) the same as it was last
> time.
>
> Can rsync (or could it be modified to) use the pre-existing filelist?
> So the sender process could load the filelist from last time,
> prepare the task list and save the reciever all that preparation.
> For my 3/4 of a terrabyte system that's ~3 hours of work.
> Occasionally the reciever could re-scan the backup,
> just to make sure the filelist reflects reality.
>
>
>
> Coping with dormant branches by tarring, zipping and archiving:
>
> A lot of the space on my disks is taken up by files
> that mustn't be lost, but probably won't be needed again.
> As long as they can be easily recovered,
> making people wait a day to get them back wouldn't be a problem.
> So they are candidates for shipping to a tape-based data silo.
>
> So how about having a cron job that "find"s unread branches,
> tars and zips them up, puts the .tgz file into a separate archive
> area,
> and replaces the head directory with a special link.
> The archive area is rsync-ed to the silo without --delete
> so once the .tgz file has been transferred
> it can be deleted from the source.
>
> I'd prefer to use tar as this would cut our use of inodes in the silo,
> which has also been an issue.
>
>
>
>
> So much for the backing up, any fool can backup, but what about
> recovery?
>
> The source archive area is mounted using some fuse magic
> so users can "ls -l" it and even see the rsync sizes, dates
> and checksums. Once they ask for any contents,
> the fuse magic sets about recovering the tgz file
> and unpacking it back into its original location.
> I don't know what happens to the requesting process
> during the wait. It hangs? It returns a warning? An error?
>
>
>
> Any comments?
>
> --
> There is no perl one line hack
> that a page of java won't do more elegantly.
> --
> linux mailing list
> linux at lists.samba.org
> https://lists.samba.org/mailman/listinfo/linux
--
Kim Holburn
IT Network & Security Consultant
Ph: +39 06 855 4294 M: +39 3494957443
mailto:kim at holburn.net aim://kimholburn
skype://kholburn - PGP Public Key on request
Democracy imposed from without is the severest form of tyranny.
-- Lloyd Biggle, Jr. Analog, Apr 1961
More information about the linux
mailing list