Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.
#自然语言处理#工业级的 Python/CPython 自然语言处理(NLP)库
Fast, secure, efficient backup program
#安全#Deduplicating archiver with compression and authenticated encryption.
#安全#Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Prometheus Alertmanager
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
#自然语言处理#A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
#安全#rustic - fast, encrypted, and deduplicated backups powered by Rust
A fast high compression read-only file system for Linux, Windows and macOS
Extremely fast tool to remove duplicates and other lint from your filesystem
Simple, configuration-driven backup software for servers and workstations
#自然语言处理#Datasets, SOTA results of every fields of Chinese NLP
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Config driven, easy backup cli for restic.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
#计算机科学#A powerful and modular toolkit for record linkage and duplicate detection in Python
#大语言模型#Scalable data pre processing and curation toolkit for LLMs
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
#Awesome#Insightful Tutorials and Papers about Knowledge Graphs