GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

document-ai

Website
Wikipedia
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / unilm

#自然语言处理#Unilm是一个跨任务、语言和模式的大规模自监督预训练模型

自然语言处理pre-trained-modelunilmminilmlayoutlmlayoutxlmbeitdocument-aitrocrbeit-3foundation-modelsxlm-edeepnet大语言模型multimodalmllmkosmoskosmos-1textdiffuserbitnet
Python 21.58 k
1 个月前
https://static.github-zh.com/github_avatars/clovaai?size=40
clovaai / donut

#自然语言处理#Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

document-aieccv-2022multimodal-pre-trained-modelOCR自然语言处理机器视觉
Python 6.46 k
1 年前
https://static.github-zh.com/github_avatars/deepdoctection?size=40
deepdoctection / deepdoctection

#自然语言处理#A Repo For Document AI

document-parserdocument-image-analysistable-recognitionOCRdocument-aidocument-understandingPythondocument-layout-analysistable-detectionPyTorchTensorflowlayoutlm自然语言处理
Python 2.9 k
2 天前
https://static.github-zh.com/github_avatars/tstanislawek?size=40
tstanislawek / awesome-document-understanding

#自然语言处理#A curated list of resources for Document Understanding (DU) topic

Awesome Lists机器学习information-extractionkey-information-extractiondocument-understandingrobotic-process-automationdocument-analysisdocument-layout-analysisOCR自然语言处理深度学习pdfrpapdf-documentsdocument-intelligenceunstructured-datadocument-ai
1.45 k
2 年前
https://static.github-zh.com/github_avatars/jpWang?size=40
jpWang / LiLT

#自然语言处理#Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

自然语言处理document-aidocument-analysisdocument-understandinginformation-extractionmultimodal-pre-trained-model
Python 351
3 年前
https://static.github-zh.com/github_avatars/SCUT-DLVCLab?size=40
SCUT-DLVCLab / Document-AI-Recommendations

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

document-aidocument-understandingkey-information-extractiontable-structure-recognition
198
5 个月前
https://static.github-zh.com/github_avatars/clovaai?size=40
clovaai / webvicob

#自然语言处理#Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023

document-ai自然语言处理OCR
Python 108
2 年前
https://static.github-zh.com/github_avatars/doc-analysis?size=40
doc-analysis / ReadingBank

#自然语言处理#ReadingBank: A Benchmark Dataset for Reading Order Detection

OCR自然语言处理document-understandingdocument-aidocument-intelligence
107
1 年前
https://static.github-zh.com/github_avatars/nttmdlab-nlp?size=40
nttmdlab-nlp / SlideVQA

#自然语言处理#SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

aaai2023机器视觉document-ai自然语言处理OCR
Python 92
4 个月前
https://static.github-zh.com/github_avatars/ZeningLin?size=40
ZeningLin / ViBERTgrid-PyTorch

An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"

key-information-extractiondocument-aiinformation-extractiondocument-analysis
Python 53
2 年前
https://static.github-zh.com/github_avatars/whn09?size=40
whn09 / table_structure_recognition

Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.

document-aitable-detectionOCRtabletable-structure-recognitionyolov5yolov8
Jupyter Notebook 47
1 年前
https://static.github-zh.com/github_avatars/googleapis?size=40
googleapis / python-documentai-toolbox

Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from ...

人工智能document-aiGoogle 云generative-aivertex-ai
Python 46
5 个月前
https://static.github-zh.com/github_avatars/DunnBC22?size=40
DunnBC22 / Vision_Audio_and_Multimodal_Projects

This repository includes all computer vision, audio, document AI, and multimodal projects.

audio-classification机器视觉document-aimultimodal-deep-learningoptical-character-recognitionobject-detectiontransfer-learningtransformers
Jupyter Notebook 45
1 年前
https://static.github-zh.com/github_avatars/ZeningLin?size=40
ZeningLin / PEneo

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

document-aidocument-understandingkey-information-extractionOCR
Python 36
4 个月前
https://static.github-zh.com/github_avatars/nttmdlab-nlp?size=40
nttmdlab-nlp / VDocRAG

#自然语言处理# [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents

机器视觉document-ai自然语言处理OCR
Python 32
2 个月前
https://static.github-zh.com/github_avatars/Unstructured-IO?size=40
Unstructured-IO / community

#计算机科学#Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

communitydata-pipeline深度学习document-aidocument-parsing机器学习nlp-parsingocr-pythonOpen Source
28
2 年前
https://static.github-zh.com/github_avatars/qyhou?size=40
qyhou / curated-table-structure-recognition

A curated list of resources on Table Structure Recognition

document-aidocument-intelligencetable-recognitiontable-structure-recognition
22
1 个月前
https://static.github-zh.com/github_avatars/SCUT-DLVCLab?size=40
SCUT-DLVCLab / RFUND

[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"

document-understandingdocument-aikey-information-extractionOCR
20
8 个月前
https://static.github-zh.com/github_avatars/chenxn2020?size=40
chenxn2020 / GOSE

[Paper] Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document"

document-airelation-extraction
Python 17
2 年前
https://static.github-zh.com/github_avatars/Shulk97?size=40
Shulk97 / daniel

#自然语言处理#This repository contain the implementation of DANIEL. (A fast Document Attention Network for Information Extraction and Labeling of handwritten documents)

机器视觉document-aimultimodal-pre-trained-model自然语言处理OCR
Python 15
20 天前
loading...