😎 Finding duplicate images made easy!
Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.
Image similarity in Golang. Version 4 (LATEST)
Tool to detect (and get rid of) similar images using perceptual hashing (pHash lib)
A utility for locating near duplicate photos irrespective of image resolution, compression settings or file format.
A Python tool to identify and remove similar-looking images from a dataset. Utilizes image preprocessing and hashing techniques for efficient comparison.
#网络爬虫#Downloader with custom wildcard system: cherry-picking internet with asterisks for HTML or right-carets for API, whether it's for time-critical website moments or just for laziness. Features directory...
#计算机科学#🏍️ A clustering tool providing exact and near de-duplication of images using vector embeddings.
A CLI tool for images analysis: checking image integrity, images deduplication, image retrieval.
a Python command-line tool that identifies and groups similar images using average hashing. It supports single-level and recursive directory scanning, adjustable similarity threshold, and presents res...
The extended version of simhash supports fingerprint extraction of documents and images.
高效的Python图像查重工具,支持百万级图片文件的重复检测。集成多种算法包括MD5哈希、感知哈希(dHash/pHash/aHash)和C++加速库,可识别完全相同、分辨率调整、部分截取和水印变更的重复图像。
Sort duplicate images into separate folders
A python program to detect duplicate images in a specified folder.
A utility for testing the performance of de-duplication algorithms by randomly generating “noisy” images in a dataset.
This Python script helps in identifying and moving duplicate images within a specified directory to a designated duplicates folder.
A Python notebook combining MD5 and perceptual hashing to detect exact-duplicate images
Get Similarity adalah alat berbasis Python dengan antarmuka GUI yang memungkinkan pengguna menyaring gambar berkualitas rendah dan mengelompokkan gambar serupa secara otomatis menggunakan embedding CL...
Finds the images in the directory that are most similar to the others and deletes the N most similar ones. Use to remove similar images before training stable diffusion models.