GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

docx

Website
Wikipedia
docling-project/docling
https://static.github-zh.com/github_avatars/docling-project?size=40
docling-project / docling

Get your documents ready for gen AI

人工智能convertdocumentspdftablesdocument-parserdocument-parsingdocxHTMLMarkdownpdf-converterpdf-to-jsonpdf-to-textpptxxlsx
Python 38.22 k
2 天前
QuivrHQ/MegaParse
https://static.github-zh.com/github_avatars/QuivrHQ?size=40
QuivrHQ / MegaParse

#大语言模型#File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

docx大语言模型Parserpdfpowerpoint
Python 7.15 k
7 个月前
https://static.github-zh.com/github_avatars/ArtifexSoftware?size=40
ArtifexSoftware / pdf2docx

Open source Python library for converting PDF to DOCX.

pdf-converterdocx
Python 3.09 k
3 个月前
https://static.github-zh.com/github_avatars/jesselau76?size=40
jesselau76 / ebook-GPT-translator

Enjoy reading with your favorite style.

epubpdfPythontranslationtranslatordocxmobi
Python 1.68 k
2 年前
shcherbak-ai/contextgem
https://static.github-zh.com/github_avatars/shcherbak-ai?size=40
shcherbak-ai / contextgem

#自然语言处理#ContextGem: Effortless LLM extraction from documents

人工智能data-extractiondocument-intelligencegenerative-ailegaltech大语言模型llm-framework自然语言处理prompt-engineeringtext-analysisunstructured-datadocx
Python 1.49 k
9 小时前
https://static.github-zh.com/github_avatars/SkywalkerDarren?size=40
SkywalkerDarren / chatWeb

#网络爬虫#ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.

ChatGPTembeddinggpt-35-turboopenaipgvectorPostgreSQLvector-databasefaiss人工智能gpt爬虫docxpdf
Python 907
1 年前
https://static.github-zh.com/github_avatars/explosion?size=40
explosion / spacy-layout

#自然语言处理#📚 Process PDFs, Word documents and more with spaCy

docxgenerative-ai自然语言处理pdfpdf-converterragspaCydocument-layout-analysis
Python 741
6 个月前
https://static.github-zh.com/github_avatars/nolze?size=40
nolze / msoffcrypto-tool

#安全#Python tool and library for decrypting and encrypting MS Office files using passwords or other keys

ooxmldocxdocencryptiondecryption命令行界面xlsxxlspptpptx
Python 595
7 个月前
https://static.github-zh.com/github_avatars/ispras?size=40
ispras / dedoc

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic...

docdocxodtdocumentsexcelpdftxtOCRscanned-documentstable-recognitionHTMLhtml-parserpdf-parserdocument-analysis
Python 592
1 天前
https://static.github-zh.com/github_avatars/superstarryeyes?size=40
superstarryeyes / lue

Terminal eBook Reader with Text-to-Speech

book命令行界面docdocxebookepubmodularreader终端text-to-speechttstxtvoicepdf
Python 406
1 天前
https://static.github-zh.com/github_avatars/Lin-jun-xiang?size=40
Lin-jun-xiang / docGPT-langchain

#大语言模型#🔐Free GPT-3.5 chat with your docs (PDF, WORD, CSV, TXT)

chatpdflangchain大语言模型PythonStreamlitgpt4freeChatGPTgptopenaidocxpdfwordgpt3gpt4CSVtxtgpt-3rag
Python 254
2 年前
https://static.github-zh.com/github_avatars/nabilanavab?size=40
nabilanavab / ilovepdf

Telegram Bot that helps you to convert Images to pdf, pdf to images, 45+ file formats to pdf, more features Soon..

TelegrampillowpdfHerokuPythondocdocxpyrogrampyrogram-bot
Python 236
9 个月前
https://static.github-zh.com/github_avatars/AstraBert?size=40
AstraBert / PdfItDown

Convert Everything to PDF

CSVdocxHTMLJSONMarkdownpackagepdfpdf-conversionpowerpointpypiPythonXML
Python 162
4 个月前
https://static.github-zh.com/github_avatars/foliant-docs?size=40
foliant-docs / foliant

Comprehensive markdown-based documentation toolkit

文档LaTeXMarkdownPythondocx
Python 160
1 年前
https://static.github-zh.com/github_avatars/houking-can?size=40
houking-can / PDFConverter

Best PDF Converter! PDF to any format, pdf2word/excel/xml/html/txt...

table-extractiondocx
Python 156
4 年前
https://static.github-zh.com/github_avatars/greyovo?size=40
greyovo / markdocx

Convert Markdown to Word (.docx). / 将 markdown 文件转换为 Word(.docx)

Markdowndocxword
Python 98
4 年前
https://static.github-zh.com/github_avatars/pqzx?size=40
pqzx / html2docx

Convert html to docx

docxHTMLPython
Python 82
1 年前
https://static.github-zh.com/github_avatars/JSv4?size=40
JSv4 / Python-Redlines

Docx tracked change redlines for the Python ecosystem.

changelogdifferencedocx
Python 80
1 年前
https://static.github-zh.com/github_avatars/neka-nat?size=40
neka-nat / mineru-api

MinerU API server

docxMarkdownpdfxlsx
Python 69
9 个月前
https://static.github-zh.com/github_avatars/hj-long?size=40
hj-long / get_taobao_data

python: selenium + sqlite3 爬虫,实现将淘宝网站数据、1688网站数据的爬取,淘宝爬虫\1688爬虫;并保存到数据库中

docxselenium-pythonsqlite3
Python 68
2 年前
loading...