GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

document-understanding

Website
Wikipedia
https://static.github-zh.com/github_avatars/infiniflow?size=40
infiniflow / ragflow

#自然语言处理#RAGFlow 是一款基于深度文档理解构建的开源 RAG(Retrieval-Augmented Generation)引擎

document-understanding大语言模型ragtable-structure-recognition深度学习document-parser自然语言处理pdf-to-textretrieval-augmented-generation聊天机器人agentagentsgraphragtext2sqlai-searchChatGPTdeepseekdeepseek-r1ollama人工智能
Python 55.11 k
2 天前
https://static.github-zh.com/github_avatars/deepdoctection?size=40
deepdoctection / deepdoctection

#自然语言处理#A Repo For Document AI

document-parserdocument-image-analysistable-recognitionOCRdocument-aidocument-understandingPythondocument-layout-analysistable-detectionPyTorchTensorflowlayoutlm自然语言处理
Python 2.85 k
4 天前
https://static.github-zh.com/github_avatars/X-PLUG?size=40
X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

chart-understandingdocument-understandingmllmmultimodalmultimodal-large-language-modelstable-understanding
Python 2.2 k
16 天前
https://static.github-zh.com/github_avatars/AlibabaResearch?size=40
AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

人工智能documentaimultimodalmultimodal-deep-learningOCR机器视觉vision-language-transformerend-to-end-ocrscene-text-detectionscene-text-detection-recognitionscene-text-recognitiontext-detectiontext-recognitionvision-languagedocumentdocument-analysisdocument-recognitiondocument-understandingdocument-intelligencevision-language-model
C++ 1.73 k
2 个月前
https://static.github-zh.com/github_avatars/tstanislawek?size=40
tstanislawek / awesome-document-understanding

#自然语言处理#A curated list of resources for Document Understanding (DU) topic

Awesome Lists机器学习information-extractionkey-information-extractiondocument-understandingrobotic-process-automationdocument-analysisdocument-layout-analysisOCR自然语言处理深度学习pdfrpapdf-documentsdocument-intelligenceunstructured-datadocument-ai
1.42 k
2 年前
https://static.github-zh.com/github_avatars/OpenBMB?size=40
OpenBMB / VisRAG

Parsing-free RAG supported by VLMs

ragretrievalretrieval-augmented-generationvision-language-modelmulti-modalmulti-modalitydocument-retrievaldocument-understanding
Python 732
4 个月前
https://static.github-zh.com/github_avatars/wenwenyu?size=40
wenwenyu / PICK-pytorch

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

key-information-extractiondocument-analysisgraph-neural-networksgraph-learningdocument-understanding
Python 563
1 年前
https://static.github-zh.com/github_avatars/jpWang?size=40
jpWang / LiLT

#自然语言处理#Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

自然语言处理document-aidocument-analysisdocument-understandinginformation-extractionmultimodal-pre-trained-model
Python 352
3 年前
https://static.github-zh.com/github_avatars/GoogleCloudPlatform?size=40
GoogleCloudPlatform / document-ai-samples

#计算机科学#Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud

document-understanding机器学习OCRpdfPythonsamples
Jupyter Notebook 280
1 个月前
https://static.github-zh.com/github_avatars/SCUT-DLVCLab?size=40
SCUT-DLVCLab / Document-AI-Recommendations

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

document-aidocument-understandingkey-information-extractiontable-structure-recognition
193
4 个月前
https://static.github-zh.com/github_avatars/MathamPollard?size=40
MathamPollard / awesome-table-structure-recognition

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

table-detectiontable-structure-recognitiontable-extractiontable-functional-analysisdocument-understanding
190
9 个月前
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / chug

#数据仓库#Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

机器视觉数据集distributed-trainingdocument-understandingmulti-modal-learningpdf-document
Python 157
1 年前
https://static.github-zh.com/github_avatars/Alpha-Innovator?size=40
Alpha-Innovator / DocGenome

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

document-understandingquestion-answering
Jupyter Notebook 133
5 个月前
https://static.github-zh.com/github_avatars/andreagemelli?size=40
andreagemelli / doc2graph

#自然语言处理#Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.

深度学习document-understandinggeometric-deep-learninggnnkey-information-extractionlayout-analysis自然语言处理table-detectionPyTorch
Jupyter Notebook 123
2 年前
https://static.github-zh.com/github_avatars/doc-analysis?size=40
doc-analysis / ReadingBank

#自然语言处理#ReadingBank: A Benchmark Dataset for Reading Order Detection

OCR自然语言处理document-understandingdocument-aidocument-intelligence
105
10 个月前
https://static.github-zh.com/github_avatars/LynnHaDo?size=40
LynnHaDo / Document-Layout-Analysis

Object Detection Model for Scanned Documents

document-understandingobject-detectionPythonyolov8
Jupyter Notebook 93
3 个月前
https://static.github-zh.com/github_avatars/LynnHaDo?size=40
LynnHaDo / Checkbox-Detection

#计算机科学#Checkbox Detection Model for Scanned Documents

document-understandingobject-detectionPythonyolov8机器视觉copy-paste深度学习
Jupyter Notebook 76
3 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / CompHRDoc

Datasets and Evaluation Scripts for CompHRDoc

document-understanding
Python 44
4 个月前
https://static.github-zh.com/github_avatars/ZeningLin?size=40
ZeningLin / PEneo

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

document-aidocument-understandingkey-information-extractionOCR
Python 34
2 个月前
https://static.github-zh.com/github_avatars/NExTplusplus?size=40
NExTplusplus / TAT-DQA

TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

document-understandingquestion-answeringvqa
23
9 个月前
loading...