[Bug 10353] New: link-by-hash collision detection

samba-bugs at samba.org samba-bugs at samba.org
Mon Dec 30 10:25:32 MST 2013


https://bugzilla.samba.org/show_bug.cgi?id=10353

           Summary: link-by-hash collision detection
           Product: rsync
           Version: 3.1.1
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P5
         Component: core
        AssignedTo: wayned at samba.org
        ReportedBy: jimklimov at gmail.com
         QAContact: rsync-qa at samba.org


The link-by-hash should include a mode to verify that the original file content
is indeed identical to the content of the file into which it might be
hardlinked per the hash value.

If the hash algorithm happens to be weak (allowing two files of the same size
with same hash and different content - i.e. a hash collision), the
hash-filenames should include a unique suffix (i.e. 123abcd.1024.0;1 and
123abcd.1024.0;2 to differentiate two files with different contents), and if
such filename patterns exist - all copies should be considered for link-by-hash
deduplication.

-- 
Configure bugmail: https://bugzilla.samba.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.


More information about the rsync mailing list