GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

record-linkage

Website
Wikipedia
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / dedupe

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

deduperecord-linkagePythonpython-libraryEntity resolutionclustering
Python 4.31 k
7 个月前
https://static.github-zh.com/github_avatars/openvenues?size=40
openvenues / libpostal

#自然语言处理#A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

address-parser机器学习自然语言处理addressinternationalCEntity resolutionrecord-linkagededuping
C 4.27 k
6 天前
https://static.github-zh.com/github_avatars/moj-analytical-services?size=40
moj-analytical-services / splink

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

record-linkageApache SparkEntity resolutionfuzzy-matching数据科学duckdb
Python 1.62 k
3 天前
J535D165/recordlinkage
https://static.github-zh.com/github_avatars/J535D165?size=40
J535D165 / recordlinkage

#计算机科学#A powerful and modular toolkit for record linkage and duplicate detection in Python

record-linkageEntity resolutiondedupe机器学习隐私Pythonpython-librarysimilarity
Python 1.01 k
1 年前
https://static.github-zh.com/github_avatars/Yomguithereal?size=40
Yomguithereal / talisman

#自然语言处理#Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

自然语言处理机器学习fuzzy-matchingclusteringrecord-linkageinformation-retrievalEntity resolution
JavaScript 716
1 年前
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / csvdedupe

🆔 Command line tool for deduplicating CSV files

dedupe命令行界面record-linkageEntity resolutioncsv-files
Python 422
5 年前
https://static.github-zh.com/github_avatars/dedupeio?size=40
dedupeio / dedupe-examples

🆔 Examples for using the dedupe library

deduperecord-linkageEntity resolutionPython
Python 413
10 个月前
https://static.github-zh.com/github_avatars/J535D165?size=40
J535D165 / data-matching-software

#Awesome#A list of free data matching and record linkage software.

record-linkageEntity resolutionfuzzy-matching软件Awesome Lists机器学习Open Source
384
1 年前
https://static.github-zh.com/github_avatars/Bergvca?size=40
Bergvca / string_grouper

Super Fast String Matching in Python

fuzzy-matchingrecord-linkagestring-matching
Python 369
3 个月前
https://static.github-zh.com/github_avatars/maxharlow?size=40
maxharlow / csvmatch

🔎 Finds fuzzy matches between CSV files

Entity resolutionfuzzy-matchingrecord-linkageCSV
Python 189
3 个月前
https://static.github-zh.com/github_avatars/vintasoftware?size=40
vintasoftware / entity-embed

#计算机科学#PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.

Entity resolutionrecord-linkagerepresentation-learningembeddings深度学习PythonPyTorch
Jupyter Notebook 153
3 年前
https://static.github-zh.com/github_avatars/dice-group?size=40
dice-group / LIMES

#计算机科学#Link Discovery Framework for Metric Spaces.

机器学习人工智能optimizationlinked-dataSemantic Webrecord-linkagescalabilityRDF (Resource Description Framework)
JavaScript 130
10 个月前
https://static.github-zh.com/github_avatars/zouzias?size=40
zouzias / spark-lucenerdd

Spark RDD with Lucene's query and entity linkage capabilities

Apache Sparklucenerecord-linkageEntity resolutionlinkageHacktoberfest
Scala 128
7 天前
https://static.github-zh.com/github_avatars/ropeladder?size=40
ropeladder / record-linkage-resources

Resources for tackling record linkage / deduplication / data matching problems

record-linkageEntity resolutionPythonJavaJavaScript
123
1 年前
https://static.github-zh.com/github_avatars/dell-research-harvard?size=40
dell-research-harvard / linktransformer

#自然语言处理#A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning

深度学习Entity resolutionhuggingface-transformers自然语言处理Pythonrecord-linkagesentence-transformerstransformers
Python 119
2 个月前
https://static.github-zh.com/github_avatars/usc-isi-i2?size=40
usc-isi-i2 / rltk

Record Linkage ToolKit (Find and link entities)

linkagesimilaritystring-similarityrecord-linkageEntity resolution
Python 110
2 年前
https://static.github-zh.com/github_avatars/Wikidata?size=40
Wikidata / soweego

Link Wikidata items to large catalogs

wikimediawikidataknowledge-graphrecord-linkageEntity resolutionidentifiers
Python 96
3 个月前
https://static.github-zh.com/github_avatars/fritshermans?size=40
fritshermans / deduplipy

Python package for deduplication/entity resolution using active learning

Entity resolutionfuzzy-matchingrecord-linkage
Python 80
10 个月前
https://static.github-zh.com/github_avatars/OlivierBinette?size=40
OlivierBinette / Awesome-Entity-Resolution

#Awesome#List of entity resolution software and resources.

Awesome ListsEntity resolutionrecord-linkage
73
4 个月前
https://static.github-zh.com/github_avatars/data61?size=40
data61 / anonlink

Python implementation of anonymous linkage using cryptographic linkage keys

privacy-enhancing-technologiesprivacy-preservingrecord-linkageEntity resolutionCryptography
Python 65
1 年前
loading...