GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

pdf-to-markdown

Website
Wikipedia
https://static.github-zh.com/github_avatars/run-llama?size=40
run-llama / llama_cloud_services

Knowledge Agents and Management in the Cloud

documentParsingpdfpdf-document-processorpptxstructured-datadocument-parserdocument-parsingdocx-to-markdownpdf-to-excelpdf-to-jsonpdf-to-textppt-to-jsontablesppt-to-markdownpdf-to-markdown
Python 4.01 k
5 天前
https://static.github-zh.com/github_avatars/wisupai?size=40
wisupai / e2m

#大语言模型#E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M o...

大语言模型Markdownpdf-to-markdown
Jupyter Notebook 1.09 k
9 个月前
https://static.github-zh.com/github_avatars/drmingler?size=40
drmingler / docling-api

Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is...

APIFastAPImarkdown-parserpdf-conversionpdf-converterpdf-parserpdf-parsingpdf-to-markdown
Python 617
3 个月前
https://static.github-zh.com/github_avatars/iamarunbrahma?size=40
iamarunbrahma / vision-parse

Parse PDFs into markdown using Vision LLMs

document-parserpdf-parserpdf-to-markdowntext-extraction
Python 386
4 个月前
https://static.github-zh.com/github_avatars/shoryasethia?size=40
shoryasethia / markdrop

#大语言模型#A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functio...

Open Sourcepypi-packageimage-to-text大语言模型pdf-to-markdownpdf-to-texttable-to-textagents
Python 118
3 个月前
https://static.github-zh.com/github_avatars/iamarunbrahma?size=40
iamarunbrahma / pdf-to-markdown

Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced info...

document-conversiondocument-processinginformation-retrievalpdf-parsingpdf-to-markdownPythonragretrieval-augmented-generationtext-extractionpdf-converter
Python 83
7 个月前
https://static.github-zh.com/github_avatars/drmingler?size=40
drmingler / smart-llm-loader

smart-llm-loader is a lightweight yet powerful Python package that transforms any document into LLM-ready chunks. Spend less time on preprocessing headaches and more time building what matters. From R...

聊天机器人chunkingclaudegeminilangchainllama-indexMarkdownopenaipdf-converterpdf-parserpdf-to-markdownrag
Python 65
4 个月前
https://static.github-zh.com/github_avatars/altaidevorg?size=40
altaidevorg / llm-food

#大语言模型#Serving files for hungry LLMs

gemini大语言模型pdf-to-markdowntext-extraction
Python 18
12 天前
https://static.github-zh.com/github_avatars/iw4p?size=40
iw4p / url-to-markdown

#大语言模型#URL to Markdown API is a service that convert web content into clean, structured Markdown format through a simple HTTP GET request. It's built using FastAPI and the MarkItDown library, offering a stra...

html-to-markdown大语言模型Markdownpdf-to-markdownvector
Python 17
4 个月前
https://static.github-zh.com/github_avatars/muchdogesec?size=40
muchdogesec / file2txt

Turn a supported list of filetypes (e.g. .docx) into a markdown structured text file. Also optionally defangs indicators and extract texts from images. Built for threat intel use-cases.

html-to-markdownimage-to-textMarkdownOCRpdf-to-markdown
Python 12
13 天前
https://static.github-zh.com/github_avatars/hparreao?size=40
hparreao / doclingconverter

Quick way to convert files (PDF, DOCX, HTML, PPTX, Images) to (MD, JSON, YAML) using Docling and Streamlit

markdown-converterpdf-converterpdf-to-jsonpdf-to-markdownStreamlit
Python 8
7 个月前
https://static.github-zh.com/github_avatars/gani114433?size=40
gani114433 / OCR_workflow

#自然语言处理#N8N OCR workflow

celerycomputational-linguisticscorpus-toolsmarkdown-parser自然语言处理OCROpenCVpdfpdf-filespdf-to-markdownrobotic-process-automationworkflow
4
11 天前
https://static.github-zh.com/github_avatars/iamarunbrahma?size=40
iamarunbrahma / rag-ingest

RAG-Ingest: A tool for converting PDFs to markdown and indexing them for enhanced Retrieval Augmented Generation (RAG) capabilities.

aws-s3hybrid-searchinformation-retrievalllamaindexollamapdf-to-markdownqdrantretrieval-augmented-generation
Python 3
7 个月前
https://static.github-zh.com/github_avatars/olegiv?size=40
olegiv / pdf_2_md

自动化命令行界面Markdownpdfpdf-to-markdownPythonsummarizationtoc
Python 1
2 个月前
https://static.github-zh.com/github_avatars/MansurPro?size=40
MansurPro / DocuParse

DocuParse is a high-performance tool for converting PDF documents into clean, structured Markdown files. Designed for speed and accuracy, it extracts and formats content while minimizing errors like h...

document-layout-analysisgoogle-colabhuggingface-transformerspdf-parsingpdf-to-markdowntesseract-ocrtext-extraction
1
6 个月前
https://static.github-zh.com/github_avatars/aidayang?size=40
aidayang / Marker-OneClick

PDF转Markdown软件Marker免安装一键启动整合包

pdf-to-jsonpdf-to-markdownPython
1
3 个月前
https://static.github-zh.com/github_avatars/Jarus77?size=40
Jarus77 / markdrop

#大语言模型# A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functi...

agentsimage-to-text大语言模型MarkdownOpen Sourcepdf-to-markdownpypi-packagetable-to-text
Python 1
2 个月前
https://static.github-zh.com/github_avatars/LatentSpaceIITB?size=40
LatentSpaceIITB / markdrop

A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functio...

Markdownpdf-to-markdown
Python 0
2 个月前
https://static.github-zh.com/github_avatars/credeed?size=40
credeed / credeed-pdf-to-markdown

Convert PDF to Markdown using AI, can be used for Agent to understand documents.

ai-agentdocument-processingpdf-to-markdownrisk-assessment
Python 0
2 个月前
https://static.github-zh.com/github_avatars/laurentvv?size=40
laurentvv / pdf2md-ai

#大语言模型#A powerful Python tool that extracts text and images from PDF documents and converts them to clean, well-formatted Markdown files

图像处理大语言模型pdf-processingpdf-to-markdownPython
Python 0
3 个月前
loading...