GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

hocr

Website
Wikipedia
https://static.github-zh.com/github_avatars/UglyToad?size=40
UglyToad / PdfPig

Read and extract text and other content from PDFs in C# (port of PDFBox)

pdfboxpdfpdf-documentC#netstandardpdf-extractorpdf-document-processorpdf-filesalto-xmlhocrlayout-analysisdocument-analysispage-xmlpdf-generation
C# 2.05 k
15 天前
https://static.github-zh.com/github_avatars/manisandro?size=40
manisandro / gImageReader

A Gtk/Qt front-end to tesseract-ocr.

QtOCRpdf-documentC++tesseract-ocrGTKhocrscanner
C++ 1.77 k
12 天前
https://static.github-zh.com/github_avatars/mittagessen?size=40
mittagessen / kraken

OCR engine for all the languages

OCRneural-networksalto-xmlhocrhandwritten-text-recognitionlayout-analysisoptical-character-recognitionpage-xml
Python 835
11 天前
https://static.github-zh.com/github_avatars/BobLd?size=40
BobLd / DocumentLayoutAnalysis

Document Layout Analysis resources repos for development with PdfPig.

document-layout-analysislayout-analysistable-extractionpdfC#hocrpage-xmlalto-xml
C# 619
2 年前
https://static.github-zh.com/github_avatars/UB-Mannheim?size=40
UB-Mannheim / ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

OCRhocrpage-xmlvalidationtransformation
JavaScript 188
1 个月前
https://static.github-zh.com/github_avatars/cneud?size=40
cneud / ocr-conversion

Conversions between various OCR formats

alto-xmlhocrpage-xmlOCR
78
2 年前
https://static.github-zh.com/github_avatars/filak?size=40
filak / hOCR-to-ALTO

Convert between Tesseract hOCR and ALTO XML using XSL stylesheets

hocr
XSLT 55
25 天前
https://static.github-zh.com/github_avatars/dbmdz?size=40
dbmdz / mirador-textoverlay

Text Overlay plugin for Mirador 3

OCRoptical-character-recognitionhocralto-xml
JavaScript 54
2 个月前
https://static.github-zh.com/github_avatars/UB-Mannheim?size=40
UB-Mannheim / ocr-gt-tools

Ergonomic line-by-line transcription of scanned text.

OCRhocrtranscriptionground-truthweb-interface
JavaScript 51
4 年前
https://static.github-zh.com/github_avatars/dmi3kno?size=40
dmi3kno / hocr

Text-to-tibble

OCRtesseracttesseract-ocrRrstatshocr
R 36
5 年前
https://static.github-zh.com/github_avatars/fakabbir?size=40
fakabbir / OCR

Probabilistic Key Value pair extraction using word weights from Invoices - Non Searchable PDF

OCRhocrtesseractPython
Python 18
4 年前
https://static.github-zh.com/github_avatars/macabeus?size=40
macabeus / pyslibtesseract

✏️ Integration of Tesseract for Python using a shared library

tesseracthocrOCR
Python 12
9 年前
https://static.github-zh.com/github_avatars/GeReV?size=40
GeReV / hocr-editor-ts

A visual hOCR file editor

OCRhocrtesseract-ocr
TypeScript 10
1 年前
https://static.github-zh.com/github_avatars/iilei?size=40
iilei / hocr-to-json

OCRhocr
JavaScript 4
2 年前
https://static.github-zh.com/github_avatars/GeReV?size=40
GeReV / HocrEditor

A visual editor for .hocr files.

hocrtesseract-ocrOCR
C# 4
4 个月前
https://static.github-zh.com/github_avatars/hadro?size=40
hadro / new-york-city-directories

Some basic data and text extraction from the New York City Directories

digital-humanitiespdfsOCRhocr
4
8 年前
https://static.github-zh.com/github_avatars/hadro?size=40
hadro / brewery-guides

The data for guides to breweries across the United States from 1896 to 1918

hocrdatadatasetdigital-humanitiesOpen Data
3
8 年前
https://static.github-zh.com/github_avatars/jlieth?size=40
jlieth / hocr-parser

Python parser for hOCR files using lxml

PythonhocrOCRparsing-library
Python 3
5 年前
https://static.github-zh.com/github_avatars/emmeryn?size=40
emmeryn / hocr-turtletext

A gem that parses positional text from hOCR output and provides convenience methods to find text.

hocrextract-textgemRails
Ruby 3
3 年前
https://static.github-zh.com/github_avatars/mayurcybercz?size=40
mayurcybercz / AI-Exam-evaluation

#自然语言处理#CLI-Tool to recognise handwritten text from answer sheets using Tesseract OCR. Using this extracted text to evaluate marks using NLP

tesseract-ocrhocr自然语言处理命令行界面JSONPythonnltk
Jupyter Notebook 3
6 年前
loading...