GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

dedupe

Website
Wikipedia
https://static.github-zh.com/github_avatars/restic?size=40
restic / restic

Fast, secure, efficient backup program

GoresticbackupEntity resolutiondedupesecure-by-default
Go 28.98 k
13 天前
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / dedupe

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

deduperecord-linkagePythonpython-libraryEntity resolutionclustering
Python 4.31 k
7 个月前
https://static.github-zh.com/github_avatars/scinos?size=40
scinos / yarn-deduplicate

Deduplication tool for yarn.lock files

Yarnduplicatesdedupe
TypeScript 1.39 k
16 天前
zinggAI/zingg
https://static.github-zh.com/github_avatars/zinggAI?size=40
zinggAI / zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

fuzzymatchfuzzy-matchingEntity resolutiondedupemasterdatadataengineering数据科学Apache Spark机器学习dataqualityanalyticsdatalakemaster-data-managementcustomer-data-platformdatabrickssnowflakecdpmdm
Java 1.04 k
3 天前
J535D165/recordlinkage
https://static.github-zh.com/github_avatars/J535D165?size=40
J535D165 / recordlinkage

#计算机科学#A powerful and modular toolkit for record linkage and duplicate detection in Python

record-linkageEntity resolutiondedupe机器学习隐私Pythonpython-librarysimilarity
Python 1.01 k
1 年前
https://static.github-zh.com/github_avatars/nil0x42?size=40
nil0x42 / duplicut

Remove duplicates from MASSIVE wordlist, without sorting it (for dictionary-based password cracking)

hashcatpasswordhashescrackingCwordlistwordlistsuniquniquededupewordlist-generatorduplicate-detectionpassword-crackingdictionary
C++ 938
1 个月前
https://static.github-zh.com/github_avatars/Zygo?size=40
Zygo / bees

Best-Effort Extent-Same, a btrfs dedupe agent

btrfsdedupe
C++ 795
3 个月前
https://static.github-zh.com/github_avatars/blakeembrey?size=40
blakeembrey / free-style

Make CSS easier and more maintainable by using JavaScript

CSSJavaScriptdedupehashTypeScriptcss-in-jscss-stringminification
TypeScript 708
1 年前
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / csvdedupe

🆔 Command line tool for deduplicating CSV files

dedupe命令行界面record-linkageEntity resolutioncsv-files
Python 422
5 年前
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / dedupe-examples

🆔 Examples for using the dedupe library

deduperecord-linkageEntity resolutionPython
Python 413
10 个月前
https://static.github-zh.com/github_avatars/knjcode?size=40
knjcode / imgdupes

Identifying and removing near-duplicate images using perceptual hashing.

ImagededupePerceptual hashingperceptual-hashes
Python 363
2 个月前
https://static.github-zh.com/github_avatars/kornelski?size=40
kornelski / dupe-krill

A fast file deduplicator

deduperust-librarymacOS
Rust 194
2 年前
https://static.github-zh.com/github_avatars/kdeldycke?size=40
kdeldycke / mail-deduplicate

📧 CLI to deduplicate mails from mail boxes

Pythonmaildedupe命令行界面maildirmailboxmboxEntity resolutioncleanupemail
Python 180
6 天前
https://static.github-zh.com/github_avatars/Lakshmipathi?size=40
Lakshmipathi / dduper

Fast block-level out-of-band BTRFS deduplication tool.

btrfsEntity resolutiondedupe
Python 178
7 个月前
https://static.github-zh.com/github_avatars/laktak?size=40
laktak / chkbit

Check your files for data corruption and run quick file deduplication

backupcloud-backupdedupeEntity resolutionbtrfs
Go 144
12 天前
https://static.github-zh.com/github_avatars/jason89521?size=40
jason89521 / daxus

Daxus is a server state management library for React that provides full control over data, leading to a better user experience.

Reactdatahookcachededupe用户体验(UX)
TypeScript 96
8 个月前
https://static.github-zh.com/github_avatars/jRimbault?size=40
jRimbault / yadf

Yet Another Dupes Finder

duplicate-detectionEntity resolutionfdupesdedupe
Rust 61
2 个月前
https://static.github-zh.com/github_avatars/Senzing?size=40
Senzing / awesome

Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.

Entity resolutionentityresolutionrecord-linkageHackathon-KitAwesome Listsdedupefuzzy-matchingfuzzymatchidentityentities
Python 59
2 个月前
https://static.github-zh.com/github_avatars/dssg?size=40
dssg / pgdedupe

A simple command line interface to the datamade/dedupe library.

Entity resolutiondedupePythondata-cleaningrecord-linkagePostgreSQL数据库
Jupyter Notebook 42
2 年前
https://static.github-zh.com/github_avatars/jchristn?size=40
jchristn / WatsonDedupe

Self-contained C# library for data deduplication using Sqlite

compresscompressionEntity resolutionstoragededupeNuGet
C# 36
2 年前
loading...