I was reading that it is possible for 2 different files to produce the same MD5 hash.
As part of my exiftool musings I am making a thing that looks across all the drives on a computer and catalogs the images. Part of that process is dealing with the duplicates. I was thinking MD5 hash and number of file bytes should be a good indicator of uniqueness - also there may be exiftool data that assures uniqueness.
Any thoughts on MD5 and file uniqueness? I am trying to not trust stuff like file data and time - unless it comes from exiftool data not file system data.
As part of my exiftool musings I am making a thing that looks across all the drives on a computer and catalogs the images. Part of that process is dealing with the duplicates. I was thinking MD5 hash and number of file bytes should be a good indicator of uniqueness - also there may be exiftool data that assures uniqueness.
Any thoughts on MD5 and file uniqueness? I am trying to not trust stuff like file data and time - unless it comes from exiftool data not file system data.
Comment