[Bug 10244] New: link-by-hash patch: speed enhancement by hash calculation on source side

samba-bugs at samba.org samba-bugs at samba.org
Sun Nov 3 12:21:51 MST 2013


https://bugzilla.samba.org/show_bug.cgi?id=10244

           Summary: link-by-hash patch: speed enhancement by hash
                    calculation on source side
           Product: rsync
           Version: 3.1.0
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: P5
         Component: core
        AssignedTo: wayned at samba.org
        ReportedBy: M_Leipold at gmx.net
         QAContact: rsync-qa at samba.org


The link-by-hash patch actually is working perfectly in reducing needed storage
on the destination. But to do so changed/non existing/renamed/moved files are
first transfered from source to destination and only then the hash of the file
(the one for link-by-hash) is generated, the hash dir is checked and in case
the file is already existing the file is replaced by a hard link to the hash
dir.

In a setup of synchronizing two PCs/Servers via network (especially Internet) a
lot of network capacity and time could be saved if the hash (for link-by-hash)
would already be generated by the source side instance of rsync. This hash then
could be send to the destination rsync to check if the file is already existing
in the hash dir. In case the file existed only the hard link needs to be
generated but no file transfer would be necessary.

If possible this would not only speed up the "file transfer" but also solve the
problem of renamed and moved files (at least on a setup with link-by-hash)

Could you please check if the described setup could be possible.

Thanks in advance.

-- 
Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.


More information about the rsync mailing list