GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

document-understanding

Website
Wikipedia
https://static.github-zh.com/github_avatars/infiniflow?size=40
infiniflow / ragflow

#大语言模型#RAGFlow 是一款基于深度文档理解构建的开源 RAG(Retrieval-Augmented Generation)引擎

document-understanding大语言模型rag深度学习document-parserretrieval-augmented-generationagentgraphragai-searchdeepseekdeepseek-r1ollama人工智能agentic-aimcpopenaiagenticdeep-researchagentic-workflowmulti-agent
TypeScript 64.21 k
2 小时前
https://static.github-zh.com/github_avatars/deepdoctection?size=40
deepdoctection / deepdoctection

#自然语言处理#A Repo For Document AI

document-parserdocument-image-analysistable-recognitionOCRdocument-aidocument-understandingPythondocument-layout-analysistable-detectionPyTorchTensorflowlayoutlm自然语言处理
Python 2.95 k
1 天前
https://static.github-zh.com/github_avatars/X-PLUG?size=40
X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

chart-understandingdocument-understandingmllmmultimodalmultimodal-large-language-modelstable-understanding
Python 2.25 k
3 个月前
https://static.github-zh.com/github_avatars/AlibabaResearch?size=40
AlibabaResearch / AdvancedLiterateMachinery

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

人工智能documentaimultimodalmultimodal-deep-learningOCR机器视觉vision-language-transformerend-to-end-ocrscene-text-detectionscene-text-detection-recognitionscene-text-recognitiontext-detectiontext-recognitionvision-languagedocumentdocument-analysisdocument-recognitiondocument-understandingdocument-intelligencevision-language-model
C++ 1.77 k
5 个月前
https://static.github-zh.com/github_avatars/tstanislawek?size=40
tstanislawek / awesome-document-understanding

#自然语言处理#A curated list of resources for Document Understanding (DU) topic

Awesome Lists机器学习information-extractionkey-information-extractiondocument-understandingrobotic-process-automationdocument-analysisdocument-layout-analysisOCR自然语言处理深度学习pdfrpapdf-documentsdocument-intelligenceunstructured-datadocument-ai
1.46 k
2 年前
https://static.github-zh.com/github_avatars/OpenBMB?size=40
OpenBMB / VisRAG

Parsing-free RAG supported by VLMs

ragretrievalretrieval-augmented-generationvision-language-modelmulti-modalmulti-modalitydocument-retrievaldocument-understanding
Python 783
7 个月前
https://static.github-zh.com/github_avatars/wenwenyu?size=40
wenwenyu / PICK-pytorch

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

key-information-extractiondocument-analysisgraph-neural-networksgraph-learningdocument-understanding
Python 568
1 年前
https://static.github-zh.com/github_avatars/jpWang?size=40
jpWang / LiLT

#自然语言处理#Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

自然语言处理document-aidocument-analysisdocument-understandinginformation-extractionmultimodal-pre-trained-model
Python 354
3 年前
https://static.github-zh.com/github_avatars/GoogleCloudPlatform?size=40
GoogleCloudPlatform / document-ai-samples

#计算机科学#Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud

document-understanding机器学习OCRpdfPythonsamples
Jupyter Notebook 288
3 天前
https://static.github-zh.com/github_avatars/MathamPollard?size=40
MathamPollard / awesome-table-structure-recognition

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

table-detectiontable-structure-recognitiontable-extractiontable-functional-analysisdocument-understanding
210
1 年前
https://static.github-zh.com/github_avatars/SCUT-DLVCLab?size=40
SCUT-DLVCLab / Document-AI-Recommendations

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

document-aidocument-understandingkey-information-extractiontable-structure-recognition
200
6 个月前
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / chug

#数据仓库#Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

机器视觉数据集distributed-trainingdocument-understandingmulti-modal-learningpdf-document
Python 158
1 年前
https://static.github-zh.com/github_avatars/Alpha-Innovator?size=40
Alpha-Innovator / DocGenome

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

document-understandingquestion-answering
Jupyter Notebook 144
8 个月前
https://static.github-zh.com/github_avatars/andreagemelli?size=40
andreagemelli / doc2graph

#自然语言处理#Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.

深度学习document-understandinggeometric-deep-learninggnnkey-information-extractionlayout-analysis自然语言处理table-detectionPyTorch
Jupyter Notebook 130
2 年前
https://static.github-zh.com/github_avatars/doc-analysis?size=40
doc-analysis / ReadingBank

#自然语言处理#ReadingBank: A Benchmark Dataset for Reading Order Detection

OCR自然语言处理document-understandingdocument-aidocument-intelligence
109
1 年前
https://static.github-zh.com/github_avatars/LynnHaDo?size=40
LynnHaDo / Document-Layout-Analysis

Object Detection Model for Scanned Documents

document-understandingobject-detectionPythonyolov8
Jupyter Notebook 94
6 个月前
https://static.github-zh.com/github_avatars/LynnHaDo?size=40
LynnHaDo / Checkbox-Detection

#计算机科学#Checkbox Detection Model for Scanned Documents

document-understandingobject-detectionPythonyolov8机器视觉copy-paste深度学习
Jupyter Notebook 86
6 个月前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / CompHRDoc

Datasets and Evaluation Scripts for CompHRDoc

document-understanding
Python 49
7 个月前
https://static.github-zh.com/github_avatars/ZeningLin?size=40
ZeningLin / PEneo

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

document-aidocument-understandingkey-information-extractionOCR
Python 37
5 个月前
https://static.github-zh.com/github_avatars/NExTplusplus?size=40
NExTplusplus / TAT-DQA

TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

document-understandingquestion-answeringvqa
24
1 年前
loading...