GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

pdf-to-markdown

Website
Wikipedia
https://static.github-zh.com/github_avatars/run-llama?size=40
run-llama / llama_cloud_services

Knowledge Agents and Management in the Cloud

documentParsingpdfpdf-document-processorpptxstructured-datadocument-parserdocument-parsingdocx-to-markdownpdf-to-excelpdf-to-jsonpdf-to-textppt-to-jsontablesppt-to-markdownpdf-to-markdown
TypeScript 4.07 k
14 小时前
https://static.github-zh.com/github_avatars/wisupai?size=40
wisupai / e2m

#大语言模型#E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M o...

大语言模型Markdownpdf-to-markdown
Jupyter Notebook 1.2 k
1 年前
https://static.github-zh.com/github_avatars/drmingler?size=40
drmingler / docling-api

Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is...

APIFastAPImarkdown-parserpdf-conversionpdf-converterpdf-parserpdf-parsingpdf-to-markdown
Python 640
5 个月前
https://static.github-zh.com/github_avatars/iamarunbrahma?size=40
iamarunbrahma / vision-parse

Parse PDFs into markdown using Vision LLMs

document-parserpdf-parserpdf-to-markdowntext-extraction
Python 408
6 个月前
https://static.github-zh.com/github_avatars/shoryasethia?size=40
shoryasethia / markdrop

#大语言模型#A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functio...

Open Sourcepypi-packageimage-to-text大语言模型pdf-to-markdownpdf-to-texttable-to-textagents
Python 135
1 个月前
https://static.github-zh.com/github_avatars/iamarunbrahma?size=40
iamarunbrahma / pdf-to-markdown

Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced info...

document-conversiondocument-processinginformation-retrievalpdf-parsingpdf-to-markdownPythonragretrieval-augmented-generationtext-extractionpdf-converter
Python 88
8 个月前
https://static.github-zh.com/github_avatars/drmingler?size=40
drmingler / smart-llm-loader

smart-llm-loader is a lightweight yet powerful Python package that transforms any document into LLM-ready chunks. Spend less time on preprocessing headaches and more time building what matters. From R...

聊天机器人chunkingclaudegeminilangchainllama-indexMarkdownopenaipdf-converterpdf-parserpdf-to-markdownrag
Python 67
6 个月前
https://static.github-zh.com/github_avatars/iw4p?size=40
iw4p / url-to-markdown

#大语言模型#URL to Markdown API is a service that convert web content into clean, structured Markdown format through a simple HTTP GET request. It's built using FastAPI and the MarkItDown library, offering a stra...

html-to-markdown大语言模型Markdownpdf-to-markdownvector
Python 27
5 个月前
https://static.github-zh.com/github_avatars/altaidevorg?size=40
altaidevorg / llm-food

#大语言模型#Serving files for hungry LLMs

gemini大语言模型pdf-to-markdowntext-extraction
Python 22
2 个月前
https://static.github-zh.com/github_avatars/muchdogesec?size=40
muchdogesec / file2txt

Turn a supported list of filetypes (e.g. .docx) into a markdown structured text file. Also optionally defangs indicators and extract texts from images. Built for threat intel use-cases.

html-to-markdownimage-to-textMarkdownOCRpdf-to-markdown
Python 12
1 个月前
https://static.github-zh.com/github_avatars/hparreao?size=40
hparreao / doclingconverter

Quick way to convert files (PDF, DOCX, HTML, PPTX, Images) to (MD, JSON, YAML) using Docling and Streamlit

markdown-converterpdf-converterpdf-to-jsonpdf-to-markdownStreamlit
Python 10
23 天前
https://static.github-zh.com/github_avatars/gani114433?size=40
gani114433 / OCR_workflow

#自然语言处理#N8N OCR workflow

celerycomputational-linguisticscorpus-toolsmarkdown-parser自然语言处理OCROpenCVpdfpdf-filespdf-to-markdownrobotic-process-automationworkflow
6
14 天前
https://static.github-zh.com/github_avatars/iamarunbrahma?size=40
iamarunbrahma / rag-ingest

RAG-Ingest: A tool for converting PDFs to markdown and indexing them for enhanced Retrieval Augmented Generation (RAG) capabilities.

aws-s3hybrid-searchinformation-retrievalllamaindexollamapdf-to-markdownqdrantretrieval-augmented-generation
Python 4
8 个月前
https://static.github-zh.com/github_avatars/NanoNets?size=40
NanoNets / llm-data-converter

Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.

document-conversionhtml-to-markdownlayout-detectionMarkdownpdf-to-markdownppt-to-markdowntext-extraction
Python 3
2 天前
https://static.github-zh.com/github_avatars/olegiv?size=40
olegiv / pdf_2_md

自动化命令行界面Markdownpdfpdf-to-markdownPythonsummarizationtoc
Python 1
4 个月前
https://static.github-zh.com/github_avatars/Priyanshu845438?size=40
Priyanshu845438 / PDF-MarkDown_Converter

Python-based PDF to Markdown converter | Extracts clean Markdown from PDFs with OCR, tables, code and layout support

人工智能cli-toolmarkdown-converterOCRpdf-converterpdf-to-markdownPythontext-extraction
Python 1
7 天前
https://static.github-zh.com/github_avatars/MansurPro?size=40
MansurPro / DocuParse

DocuParse is a high-performance tool for converting PDF documents into clean, structured Markdown files. Designed for speed and accuracy, it extracts and formats content while minimizing errors like h...

document-layout-analysisgoogle-colabhuggingface-transformerspdf-parsingpdf-to-markdowntesseract-ocrtext-extraction
Python 1
19 天前
https://static.github-zh.com/github_avatars/aidayang?size=40
aidayang / Marker-OneClick

PDF转Markdown软件Marker免安装一键启动整合包

pdf-to-jsonpdf-to-markdownPython
1
4 个月前
https://static.github-zh.com/github_avatars/Jarus77?size=40
Jarus77 / markdrop

#大语言模型# A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functi...

agentsimage-to-text大语言模型MarkdownOpen Sourcepdf-to-markdownpypi-packagetable-to-text
Python 1
4 个月前
https://static.github-zh.com/github_avatars/LatentSpaceIITB?size=40
LatentSpaceIITB / markdrop

A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functio...

Markdownpdf-to-markdown
Python 0
4 个月前
loading...