Popular recipes tagged "meta:requires=md5" and "files"http://code.activestate.com/recipes/tags/meta:requires=md5+files/2008-01-24T10:24:31-08:00ActiveState Code RecipesShelf alternative using hashed keys as filenames and pickled values (Python)
2008-01-24T10:24:31-08:00Brian Bushhttp://code.activestate.com/recipes/users/4119104/http://code.activestate.com/recipes/543263-shelf-alternative-using-hashed-keys-as-filenames-a/
<p style="color: grey">
Python
recipe 543263
by <a href="/recipes/users/4119104/">Brian Bush</a>
(<a href="/recipes/tags/files/">files</a>).
</p>
<p>Shelving dictionaries are quick and easy, until they grow too large and access is slowed to a crawl. This recipe is a directory cache with the filename an md5 of the key and the value is the file contents (as a pickled dump).</p>
Dupinator -- detect and delete duplicate files (Python)
2005-01-09T12:31:21-08:00Bill Bumgarnerhttp://code.activestate.com/recipes/users/2250923/http://code.activestate.com/recipes/362459-dupinator-detect-and-delete-duplicate-files/
<p style="color: grey">
Python
recipe 362459
by <a href="/recipes/users/2250923/">Bill Bumgarner</a>
(<a href="/recipes/tags/files/">files</a>).
</p>
<p>Point this script at a folder or several folders and it will find and delete all duplicate files within the folders, leaving behind the first file found of any set of duplicates. It is designed to handle hundreds of thousands of files of any size at a time and to do so quickly. It was written to eliminate duplicates across several photo libraries that had been shared between users. As the script was a one-off to solve a very particular problem, there are no options nor is it refactoring into any kind of modules or reusable functions.</p>
Simple md5 sum utility (Python)
2004-01-30T07:38:46-08:00Nick Vargishhttp://code.activestate.com/recipes/users/1603464/http://code.activestate.com/recipes/266486-simple-md5-sum-utility/
<p style="color: grey">
Python
recipe 266486
by <a href="/recipes/users/1603464/">Nick Vargish</a>
(<a href="/recipes/tags/files/">files</a>).
</p>
<p>I'm used to having the md5sum utility on Linux systems, so I was surprised that Mac OS-X doesn't seem to have it. Rather than finding and compiling the C code, I took advantage of the fact that 10.3 includes Python, and rolled my own.</p>