#

page-xml

https://static.github-zh.com/github_avatars/BobLd?size=40
C# 624
2 年前
https://static.github-zh.com/github_avatars/UB-Mannheim?size=40

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

JavaScript 196
4 个月前
https://static.github-zh.com/github_avatars/cneud?size=40

Conversions between various OCR formats

80
2 年前
https://static.github-zh.com/github_avatars/qurator-spk?size=40
Python 66
24 天前
https://static.github-zh.com/github_avatars/kba?size=40

Convert Transkribus PAGE-XML to standard PAGE-XML

Python 12
1 年前
https://static.github-zh.com/github_avatars/UB-Mannheim?size=40

NLP-helper for OCR-ed pages in PAGE XML format

Python 10
9 个月前
https://static.github-zh.com/github_avatars/VRI-UFPR?size=40

A powerful CLI tool for visualization and encoding of PAGE-XML files

Python 6
4 年前
https://static.github-zh.com/github_avatars/slub?size=40

Convert AWS Textract JSON to PRImA PAGE XML

Python 6
7 个月前
https://static.github-zh.com/github_avatars/Heresta?size=40
Python 4
4 年前
https://static.github-zh.com/github_avatars/IMAGO-Catalogues-Jjanes?size=40
Python 2
4 年前
https://static.github-zh.com/github_avatars/qurator-spk?size=40

Automatically re-order lines, words and glyphs to become textually consistent with their parents.

Python 2
2 年前
https://static.github-zh.com/github_avatars/OCR-D?size=40

About The repo gt_structure_1_4 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

1
1 年前
https://static.github-zh.com/github_avatars/OCR-D?size=40

The repo gt_structure_1_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

0
1 年前
https://static.github-zh.com/github_avatars/OCR-D?size=40

The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

0
1 年前
https://static.github-zh.com/github_avatars/Lemmbraalemao-DPB?size=40

The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.

0
5 个月前
https://static.github-zh.com/github_avatars/OCR-D?size=40

The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

0
1 年前
loading...
Website
Wikipedia