#

string-similarity

rapidfuzz/RapidFuzz
https://static.github-zh.com/github_avatars/rapidfuzz?size=40
Python 3.39 k
21 小时前
https://static.github-zh.com/github_avatars/aceakash?size=40

Finds degree of similarity between two strings, based on Dice's Coefficient, which is mostly better than Levenshtein distance.

JavaScript 2.53 k
2 年前
https://static.github-zh.com/github_avatars/adrg?size=40

Go metrics for calculating string similarity and other string utility functions

Go 396
10 天前
https://static.github-zh.com/github_avatars/rapidfuzz?size=40

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

C++ 350
5 个月前
https://static.github-zh.com/github_avatars/rapidfuzz?size=40
C++ 325
20 天前
https://static.github-zh.com/github_avatars/rapidfuzz?size=40

The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity

119
6 个月前
https://static.github-zh.com/github_avatars/rieck?size=40
C 117
6 年前
https://static.github-zh.com/github_avatars/usc-isi-i2?size=40
Python 110
2 年前
https://static.github-zh.com/github_avatars/Daniel-Liu-c0deb0t?size=40

#算法刷题#Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.

Rust 108
3 年前
https://static.github-zh.com/github_avatars/stephenjjbrown?size=40
JavaScript 105
2 年前
https://static.github-zh.com/github_avatars/agext?size=40

Levenshtein distance and similarity metrics with customizable edit costs and Winkler-like bonus for common prefix.

Go 90
5 年前
https://static.github-zh.com/github_avatars/vickumar1981?size=40

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard si...

Scala 81
3 年前
https://static.github-zh.com/github_avatars/Daniel-Liu-c0deb0t?size=40

Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.

Java 77
1 年前
https://static.github-zh.com/github_avatars/rapidfuzz?size=40

Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity

Python 74
2 年前
https://static.github-zh.com/github_avatars/hyperjumptech?size=40

Beda is a golang library for detecting how similar a two string

Go 55
5 年前
https://static.github-zh.com/github_avatars/hbakhtiyor?size=40
Go 45
7 年前
https://static.github-zh.com/github_avatars/umbertogriffo?size=40

A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.

Java 44
3 年前
https://static.github-zh.com/github_avatars/iesl?size=40

Learning String Alignments for Entity Aliases

Python 37
6 年前
https://static.github-zh.com/github_avatars/lewinfox?size=40

Fuzzy string matching in R. Inspired by Python's thefuzz (but without the Python).

R 36
1 个月前
loading...
Website
Wikipedia