dedup-{format,sift}.c: Find ways to save space by making hardlinks. mdw/dedup
authorMark Wooding <mwooding@good.com>
Fri, 6 Mar 2015 18:12:54 +0000 (18:12 +0000)
committerMark Wooding <mwooding@good.com>
Fri, 6 Mar 2015 18:12:54 +0000 (18:12 +0000)
commitcb8be37013a6aefa145183fa3948e3062317203c
treea9e15b4ece30461506f2e2c6390da3d7f76852e7
parente0de3610917976a1cade67f4c4a9975144bdf628
dedup-{format,sift}.c: Find ways to save space by making hardlinks.

Currently a bit work-in-progress.

  * dedup-format.c reads fshash files and reformats them a little, most
    importantly by adding the volume label from the fshash file to each
    line.

  * dedup-sift.c reads a sorted stream of dedup-formatted records and
    (currently) writes a report about which additional hardlinks can be
    made.  The tool is very careful not to corrupt the existing hardlink
    structure in a single volume, and not to hardlink files which are
    actually different, even if the fshash records are misleading.

Documentation is needed before these are ready for prime time.
Makefile.am
dedup-format.c [new file with mode: 0644]
dedup-sift.c [new file with mode: 0644]