#计算机科学#纯JavaScript OCR(文字识别),能识别超过100种语言文字
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Trained models with fast variant of the "best" LSTM models + legacy models
A wrapper to work with Tesseract OCR inside PHP.
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
#大语言模型#Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
#安卓#Experimental optical character recognition app
A Python wrapper for the tesseract-ocr API
Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.
Automation Utility - Recorder & Script Generator
Python tool for grabbing text via screenshot
#安卓#Android document document scanning app
Precompiled packages for AWS Lambda
#大语言模型#Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list gen...
#安卓#Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
Ruby library for working with the Tesseract OCR.