#计算机科学#A Unified Toolkit for Deep Learning Based Document Image Analysis
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
The official repo for “DocScanner: Robust Document Image Rectification with Progressive Learning”, IJCV, 2025.
文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSharpening / HandwritingDenoisingBeautifying / DocShadowRemoval / ...
The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.
#计算机科学#Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents
The ScriptNet / competitions site.
Sophia Trikoupi dataset (Collection of 46 handwritten, annotated pages)
This script automates the process of extracting text from various file formats (images, PDFs, DOCX) using Optical Character Recognition (OCR) powered by Azure Cognitive Services. The script supports i...