document-ai · GitHub Topics

#自然语言处理#Unilm是一个跨任务、语言和模式的大规模自监督预训练模型

自然语言处理 pre-trained-model unilm minilm layoutlm layoutxlm beit document-ai trocr beit-3 foundation-models xlm-e deepnet 大语言模型 multimodal mllm kosmos kosmos-1 textdiffuser bitnet

Python 21.72 k

2 个月前

clovaai / donut

#自然语言处理#Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

document-ai eccv-2022 multimodal-pre-trained-model OCR 自然语言处理机器视觉

Python 6.53 k

1 年前

deepdoctection / deepdoctection

#自然语言处理#A Repo For Document AI

document-parser document-image-analysis table-recognition OCR document-ai document-understanding Python document-layout-analysis table-detection PyTorch Tensorflow layoutlm 自然语言处理

Python 2.95 k

1 天前

tstanislawek / awesome-document-understanding

#自然语言处理#A curated list of resources for Document Understanding (DU) topic

Awesome Lists 机器学习 information-extraction key-information-extraction document-understanding robotic-process-automation document-analysis document-layout-analysis OCR 自然语言处理深度学习 pdf rpa pdf-documents document-intelligence unstructured-data document-ai

1.46 k

2 年前

jpWang / LiLT

#自然语言处理#Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

自然语言处理 document-ai document-analysis document-understanding information-extraction multimodal-pre-trained-model

Python 354

3 年前

SCUT-DLVCLab / Document-AI-Recommendations

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

document-ai document-understanding key-information-extraction table-structure-recognition

200

6 个月前

doc-analysis / ReadingBank

#自然语言处理#ReadingBank: A Benchmark Dataset for Reading Order Detection

OCR 自然语言处理 document-understanding document-ai document-intelligence

109

1 年前

clovaai / webvicob

#自然语言处理#Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023

document-ai 自然语言处理 OCR

Python 108

2 年前

nttmdlab-nlp / SlideVQA

#自然语言处理#SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

aaai2023 机器视觉 document-ai 自然语言处理 OCR

Python 94

5 个月前

ZeningLin / ViBERTgrid-PyTorch

An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"

key-information-extraction document-ai information-extraction document-analysis

Python 53

2 年前

whn09 / table_structure_recognition

Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.

document-ai table-detection OCR table table-structure-recognition yolov5 yolov8

Jupyter Notebook 51

1 年前

DunnBC22 / Vision_Audio_and_Multimodal_Projects

This repository includes all computer vision, audio, document AI, and multimodal projects.

audio-classification 机器视觉 document-ai multimodal-deep-learning optical-character-recognition object-detection transfer-learning transformers

Jupyter Notebook 46

1 年前

googleapis / python-documentai-toolbox

Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from ...

人工智能 document-ai Google 云 generative-ai vertex-ai

Python 46

6 个月前

nttmdlab-nlp / VDocRAG

#自然语言处理# [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents

机器视觉 document-ai 自然语言处理 OCR

Python 39

4 个月前

ZeningLin / PEneo

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

document-ai document-understanding key-information-extraction OCR

Python 37

5 个月前

Unstructured-IO / community

#计算机科学#Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

community data-pipeline 深度学习 document-ai document-parsing 机器学习 nlp-parsing ocr-python Open Source

2 年前

qyhou / curated-table-structure-recognition

A curated list of resources on Table Structure Recognition

document-ai document-intelligence table-recognition table-structure-recognition

1 个月前

SCUT-DLVCLab / RFUND

[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"

document-understanding document-ai key-information-extraction OCR

9 个月前

Shulk97 / daniel

#自然语言处理#This repository contain the implementation of DANIEL. (A fast Document Attention Network for Information Extraction and Labeling of handwritten documents)

机器视觉 document-ai multimodal-pre-trained-model 自然语言处理 OCR

Python 17

5 天前

chenxn2020 / GOSE

[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"

document-ai relation-extraction

Python 17

2 年前