GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

dedupe

Website
Wikipedia
https://static.github-zh.com/github_avatars/restic?size=40
restic / restic

Fast, secure, efficient backup program

GoresticbackupEntity resolutiondedupesecure-by-default
Go 30.01 k
2 天前
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / dedupe

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

deduperecord-linkagePythonpython-libraryEntity resolutionclustering
Python 4.37 k
2 个月前
https://static.github-zh.com/github_avatars/scinos?size=40
scinos / yarn-deduplicate

Deduplication tool for yarn.lock files

Yarnduplicatesdedupe
TypeScript 1.39 k
9 天前
zinggAI/zingg
https://static.github-zh.com/github_avatars/zinggAI?size=40
zinggAI / zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

fuzzymatchfuzzy-matchingEntity resolutiondedupemasterdatadataengineering数据科学Apache Spark机器学习dataqualityanalyticsdatalakemaster-data-managementcustomer-data-platformdatabrickssnowflakecdpmdm
Java 1.09 k
13 天前
J535D165/recordlinkage
https://static.github-zh.com/github_avatars/J535D165?size=40
J535D165 / recordlinkage

#计算机科学#A powerful and modular toolkit for record linkage and duplicate detection in Python

record-linkageEntity resolutiondedupe机器学习隐私Pythonpython-librarysimilarity
Python 1.02 k
2 年前
https://static.github-zh.com/github_avatars/nil0x42?size=40
nil0x42 / duplicut

Remove duplicates from MASSIVE wordlist, without sorting it (for dictionary-based password cracking)

hashcatpasswordhashescrackingCwordlistwordlistsuniquniquededupewordlist-generatorduplicate-detectionpassword-crackingdictionary
C++ 951
4 个月前
https://static.github-zh.com/github_avatars/Zygo?size=40
Zygo / bees

Best-Effort Extent-Same, a btrfs dedupe agent

btrfsdedupe
C++ 833
2 个月前
https://static.github-zh.com/github_avatars/blakeembrey?size=40
blakeembrey / free-style

Make CSS easier and more maintainable by using JavaScript

CSSJavaScriptdedupehashTypeScriptcss-in-jscss-stringminification
TypeScript 707
2 年前
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / csvdedupe

🆔 Command line tool for deduplicating CSV files

dedupe命令行界面record-linkageEntity resolutioncsv-files
Python 428
5 年前
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / dedupe-examples

🆔 Examples for using the dedupe library

deduperecord-linkageEntity resolutionPython
Python 413
1 年前
https://static.github-zh.com/github_avatars/knjcode?size=40
knjcode / imgdupes

Identifying and removing near-duplicate images using perceptual hashing.

ImagededupePerceptual hashingperceptual-hashes
Python 377
5 个月前
https://static.github-zh.com/github_avatars/kornelski?size=40
kornelski / dupe-krill

A fast file deduplicator

deduperust-librarymacOS
Rust 198
1 个月前
https://static.github-zh.com/github_avatars/Lakshmipathi?size=40
Lakshmipathi / dduper

Fast block-level out-of-band BTRFS deduplication tool.

btrfsEntity resolutiondedupe
Python 181
10 个月前
https://static.github-zh.com/github_avatars/kdeldycke?size=40
kdeldycke / mail-deduplicate

📧 CLI to deduplicate mails from mail boxes

Pythonmaildedupe命令行界面maildirmailboxmboxEntity resolutioncleanupemail
Python 181
11 天前
https://static.github-zh.com/github_avatars/laktak?size=40
laktak / chkbit

Check your files for data corruption and run quick file deduplication

backupcloud-backupdedupeEntity resolutionbtrfs
Go 163
12 天前
https://static.github-zh.com/github_avatars/jason89521?size=40
jason89521 / daxus

Daxus is a server state management library for React that provides full control over data, leading to a better user experience.

Reactdatahookcachededupe用户体验(UX)
TypeScript 96
1 年前
https://static.github-zh.com/github_avatars/jRimbault?size=40
jRimbault / yadf

Yet Another Dupes Finder

duplicate-detectionEntity resolutionfdupesdedupe
Rust 63
13 天前
https://static.github-zh.com/github_avatars/Senzing?size=40
Senzing / awesome

Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.

Entity resolutionentityresolutionrecord-linkageHackathon-KitAwesome Listsdedupefuzzy-matchingfuzzymatchidentityentities
Python 62
3 天前
https://static.github-zh.com/github_avatars/zayne-labs?size=40
zayne-labs / callapi

A lightweight fetching library packed with essential features - retries, interceptors, request deduplication and much more, all while still retaining a similar API surface with regular Fetch.

retriesfetchparams插件Query (disambiguation)dedupeschematypesafevalidation
TypeScript 55
3 天前
https://static.github-zh.com/github_avatars/dssg?size=40
dssg / pgdedupe

A simple command line interface to the datamade/dedupe library.

Entity resolutiondedupePythondata-cleaningrecord-linkagePostgreSQL数据库
Jupyter Notebook 42
3 年前
loading...