Fast, secure, efficient backup program
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
#计算机科学#A powerful and modular toolkit for record linkage and duplicate detection in Python
Remove duplicates from MASSIVE wordlist, without sorting it (for dictionary-based password cracking)
Make CSS easier and more maintainable by using JavaScript
🆔 Command line tool for deduplicating CSV files
🆔 Examples for using the dedupe library
Identifying and removing near-duplicate images using perceptual hashing.
Fast block-level out-of-band BTRFS deduplication tool.
Check your files for data corruption and run quick file deduplication
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
A simple command line interface to the datamade/dedupe library.
Self-contained C# library for data deduplication using Sqlite