GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

Entity resolution

Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.

Created by Halbert L. Dunn

发布于 1946

Repository
entity-resolution
Website
Wikipedia
维基百科

相关主题

人工智能自然语言处理
https://static.github-zh.com/github_avatars/explosion?size=40
explosion / spaCy

#自然语言处理#工业级的 Python/CPython 自然语言处理(NLP)库

自然语言处理数据科学机器学习Pythoncython人工智能spaCynlp-library神经网络neural-networks深度学习named-entity-recognitionEntity resolutiontext-classificationtokenization
Python 31.77 k
18 天前
https://static.github-zh.com/github_avatars/restic?size=40
restic / restic

Fast, secure, efficient backup program

GoresticbackupEntity resolutiondedupesecure-by-default
Go 28.98 k
13 天前
https://static.github-zh.com/github_avatars/borgbackup?size=40
borgbackup / borg

#安全#Deduplicating archiver with compression and authenticated encryption.

PythoncompressionsshEntity resolutionbackupborgbackupencryption
Python 11.94 k
1 天前
https://static.github-zh.com/github_avatars/kopia?size=40
kopia / kopia

#安全#Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.

Entity resolutionbackupgoogle-cloud-storageencryptioncloudHacktoberfest
Go 9.77 k
2 天前
https://static.github-zh.com/github_avatars/prometheus?size=40
prometheus / alertmanager

Prometheus Alertmanager

监控alertmanagerpagerdutyemailnotificationsopsgenieSlackEntity resolutionHacktoberfest
Go 7.04 k
14 天前
https://static.github-zh.com/github_avatars/arsenetar?size=40
arsenetar / dupeguru

Find duplicate files

PythonEntity resolution
Python 6.25 k
10 个月前
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / dedupe

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

deduperecord-linkagePythonpython-libraryEntity resolutionclustering
Python 4.31 k
7 个月前
https://static.github-zh.com/github_avatars/openvenues?size=40
openvenues / libpostal

#自然语言处理#A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

address-parser机器学习自然语言处理addressinternationalCEntity resolutionrecord-linkagededuping
C 4.27 k
6 天前
https://static.github-zh.com/github_avatars/rustic-rs?size=40
rustic-rs / rustic

#安全#rustic - fast, encrypted, and deduplicated backups powered by Rust

backupEntity resolutionencryptionRustresticHacktoberfest
Rust 2.41 k
9 天前
https://static.github-zh.com/github_avatars/mhx?size=40
mhx / dwarfs

A fast high compression read-only file system for Linux, Windows and macOS

filesystemsquashfscompressionfuse-filesystemC++lzmazstdfuseEntity resolutionarchivingLinuxWindowsflacmacOS
C++ 2.33 k
18 天前
https://static.github-zh.com/github_avatars/sahib?size=40
sahib / rmlint

Extremely fast tool to remove duplicates and other lint from your filesystem

CPythonfilesystemlintduplicatesfdupesEntity resolution
C 2.09 k
1 个月前
https://static.github-zh.com/github_avatars/borgmatic-collective?size=40
borgmatic-collective / borgmatic

Simple, configuration-driven backup software for servers and workstations

PythonbackupborgborgbackupServerEntity resolutionhealthchecksMariaDBMongoDBMySQLntfyPostgreSQLSQLitelokizabbixapprisebtrfs
Python 1.99 k
5 天前
https://static.github-zh.com/github_avatars/didi?size=40
didi / ChineseNLP

#自然语言处理#Datasets, SOTA results of every fields of Chinese NLP

自然语言处理chinese-nlpmachine-translationchinese-word-segmentationEntity resolutionquestion-answeringnlp-tasks
HTML 1.8 k
3 年前
https://static.github-zh.com/github_avatars/moj-analytical-services?size=40
moj-analytical-services / splink

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

record-linkageApache SparkEntity resolutionfuzzy-matching数据科学duckdb
Python 1.62 k
2 天前
cupcakearmy/autorestic
https://static.github-zh.com/github_avatars/cupcakearmy?size=40
cupcakearmy / autorestic

Config driven, easy backup cli for restic.

resticbackup命令行界面configurationpruningincrementalEntity resolution
Go 1.57 k
3 个月前
zinggAI/zingg
https://static.github-zh.com/github_avatars/zinggAI?size=40
zinggAI / zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

fuzzymatchfuzzy-matchingEntity resolutiondedupemasterdatadataengineering数据科学Apache Spark机器学习dataqualityanalyticsdatalakemaster-data-managementcustomer-data-platformdatabrickssnowflakecdpmdm
Java 1.04 k
3 天前
J535D165/recordlinkage
https://static.github-zh.com/github_avatars/J535D165?size=40
J535D165 / recordlinkage

#计算机科学#A powerful and modular toolkit for record linkage and duplicate detection in Python

record-linkageEntity resolutiondedupe机器学习隐私Pythonpython-librarysimilarity
Python 1.01 k
1 年前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / NeMo-Curator

#大语言模型#Scalable data pre processing and curation toolkit for LLMs

data-curation大语言模型datadata-prepdata-preparationdata-processingdata-qualitydatacurationdatarecipesEntity resolutionfine-tuninglarge-language-modelslarge-scale-data-processingllmappsPython
Jupyter Notebook 946
2 天前
https://static.github-zh.com/github_avatars/JohnSnowLabs?size=40
JohnSnowLabs / nlu

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

nlunatural-language-understandingtext-classificationtransformerslanguage-detectionnamed-entity-recognitionseq2seqt5lemmatizerspell-checkersentence-embeddingssentiment-analysisStreamlitpandasdependency-parsingEntity resolution
Python 918
5 个月前
https://static.github-zh.com/github_avatars/heathersherry?size=40
heathersherry / Knowledge-Graph-Tutorials-and-Papers

#Awesome#Insightful Tutorials and Papers about Knowledge Graphs

entity-extractionEntity resolutioninformation-extractionknowledge-baseknowledge-graphnamed-entity-recognitionkgqaknowledge-graph-embeddingsAwesome Lists
896
4 天前
loading...