GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

page-xml

Website
Wikipedia
https://static.github-zh.com/github_avatars/UglyToad?size=40
UglyToad / PdfPig

Read and extract text and other content from PDFs in C# (port of PDFBox)

pdfboxpdfpdf-documentC#netstandardpdf-extractorpdf-document-processorpdf-filesalto-xmlhocrlayout-analysisdocument-analysispage-xmlpdf-generation
C# 2.04 k
15 天前
https://static.github-zh.com/github_avatars/mittagessen?size=40
mittagessen / kraken

OCR engine for all the languages

OCRneural-networksalto-xmlhocrhandwritten-text-recognitionlayout-analysisoptical-character-recognitionpage-xml
Python 835
10 天前
https://static.github-zh.com/github_avatars/BobLd?size=40
BobLd / DocumentLayoutAnalysis

Document Layout Analysis resources repos for development with PdfPig.

document-layout-analysislayout-analysistable-extractionpdfC#hocrpage-xmlalto-xml
C# 619
2 年前
https://static.github-zh.com/github_avatars/lquirosd?size=40
lquirosd / P2PaLA

Page to PAGE Layout Analysis Tool

深度神经网络handwritten-text-recognitiondocument-layout-analysispage-xmlPyTorchpix2pixGenerative Adversarial Network机器视觉image-segmentation
Python 191
3 年前
https://static.github-zh.com/github_avatars/UB-Mannheim?size=40
UB-Mannheim / ocr-fileformat

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

OCRhocrpage-xmlvalidationtransformation
JavaScript 188
1 个月前
https://static.github-zh.com/github_avatars/cneud?size=40
cneud / ocr-conversion

Conversions between various OCR formats

alto-xmlhocrpage-xmlOCR
78
2 年前
https://static.github-zh.com/github_avatars/qurator-spk?size=40
qurator-spk / dinglehopper

An OCR evaluation tool

OCRalto-xmlpage-xmlpage
Python 66
1 个月前
https://static.github-zh.com/github_avatars/kba?size=40
kba / transkribus-to-prima

Convert Transkribus PAGE-XML to standard PAGE-XML

OCRpage-xml
Python 12
1 年前
https://static.github-zh.com/github_avatars/UB-Mannheim?size=40
UB-Mannheim / blatt

NLP-helper for OCR-ed pages in PAGE XML format

page-xml
Python 10
6 个月前
https://static.github-zh.com/github_avatars/VRI-UFPR?size=40
VRI-UFPR / page-xml-draw

A powerful CLI tool for visualization and encoding of PAGE-XML files

page-xml可视化OpenCVOCRlayout-analysissegmentation
Python 6
4 年前
https://static.github-zh.com/github_avatars/slub?size=40
slub / textract2page

Convert AWS Textract JSON to PRImA PAGE XML

OCRpage-xmlPython
Python 6
4 个月前
https://static.github-zh.com/github_avatars/Heresta?size=40
Heresta / OCR17plus

Data for layout analysis and HTR.

XMLalto-xmlpage-xmlpngdatasetOCRsegmentation
Python 4
4 年前
https://static.github-zh.com/github_avatars/IMAGO-Catalogues-Jjanes?size=40
IMAGO-Catalogues-Jjanes / cataloguesSegmentationOCR

Dataset and models for catalogs' Layout analysis and HTR

OCRsegmentationpage-xmlalto-xmlcatalog
Python 2
4 年前
https://static.github-zh.com/github_avatars/qurator-spk?size=40
qurator-spk / ocrd_repair_inconsistencies

Automatically re-order lines, words and glyphs to become textually consistent with their parents.

OCRpage-xmlpage
Python 2
1 年前
https://static.github-zh.com/github_avatars/OCR-D?size=40
OCR-D / gt_structure_1_4

About The repo gt_structure_1_4 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

ground-truthpage-xmlrepositorysegmentation
1
1 年前
https://static.github-zh.com/github_avatars/OCR-D?size=40
OCR-D / gt_structure_1_3

The repo gt_structure_1_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

ground-truthrepositorysegmentationpage-xml
0
1 年前
https://static.github-zh.com/github_avatars/VRI-UFPR?size=40
VRI-UFPR / ocrd-page-xml-draw

OCR-D wrapper for page-xml-draw

可视化segmentationlayout-analysisOCRpage-xml
Python 0
4 年前
https://static.github-zh.com/github_avatars/OCR-D?size=40
OCR-D / gt_structure_1_2

The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

ground-truthpage-xmlrepositorysegmentation
0
1 年前
https://static.github-zh.com/github_avatars/Lemmbraalemao-DPB?size=40
Lemmbraalemao-DPB / German-Brazilian-Newspapers-Dataset_1

The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.

ground-truthOCRpage-xmltraining
0
2 个月前
https://static.github-zh.com/github_avatars/OCR-D?size=40
OCR-D / gt_structure_1_1

The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

ground-truthpage-xmlrepositorysegmentation
0
1 年前
loading...