GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

pdf-processing

Website
Wikipedia
https://static.github-zh.com/github_avatars/dissorial?size=40
dissorial / doc-chatbot

Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.

openaiTypeScriptgpt-3gpt-4langchainMongooseNextopenai-apichat聊天机器人document-embeddingpdf-processingpineconeReactTailwind CSSvectorization
TypeScript 8581
2 年前
https://static.github-zh.com/github_avatars/allenai?size=40
allenai / papermage

#自然语言处理#library supporting NLP and CV research on scientific papers

机器视觉机器学习multimodal自然语言处理pdf-processingscientific-papersPython
Python 773
7 个月前
https://static.github-zh.com/github_avatars/ahmedkhemiri95?size=40
ahmedkhemiri95 / PDFs-TextExtract

Multiple and Large PDF Documents Text Extraction.

pdfParser数据科学Pythonpdf-processingextract-textpdf-documentpypdf2pdfs
Python 128
4 个月前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / document-processing-pipeline-for-regulated-industries

#计算机科学#A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata services.

机器学习Amazon Web Servicescdkaws-lambdaamazon-web-servicesamazon-textractamazon-dynamodbamazon-s3amazon-sqsaws-cdkpdf-processing图像处理data-analyticsdata-lineagedata-governance
Python 62
4 年前
https://static.github-zh.com/github_avatars/Govind-S-B?size=40
Govind-S-B / pdf-to-text-chroma-search

Python scripts that converts PDF files to text, splits them into chunks, and stores their vector representations using GPT4All embeddings in a Chroma DB. It also provides a script to query the Chroma ...

chromadbpdf-processingsimilarity-searchtext-extraction
Python 23
2 年前
https://static.github-zh.com/github_avatars/ManasMadan?size=40
ManasMadan / pdf-actions

A NPM Package built on top of pdf-lib that provides functonalities like merge, rotate, split,download pdf to disk and many more...

pdfpdf-mergerReactreact-componentpdf-processingpdf-libJavaScriptnpm
JavaScript 13
2 年前
https://static.github-zh.com/github_avatars/ranguy9304?size=40
ranguy9304 / LangGraphRAG

#自然语言处理#LangGraphRAG: A terminal-based Retrieval-Augmented Generation system using LangGraph. Features include message history caching, query transformation, and vector database retrieval. Ideal for NLP resea...

聊天机器人information-retrievallanggraph自然语言处理openai-apipdf-processingPythonragvector-databaseweb-scraping
Python 9
1 年前
https://static.github-zh.com/github_avatars/ManasMadan?size=40
ManasMadan / PDFActions

Built with pdf-actions NPM package.

Reactpdfreact-componentsreact-componentpdf-mergerpdf-libpdf-processing
JavaScript 7
1 年前
https://static.github-zh.com/github_avatars/Inc44?size=40
Inc44 / MaTools

An all-in-one GUI management toolkit built with PyQt6, offering a suite of tools for file synchronization, media organization, PDF merging, code formatting, and more.

applicationaudio-processingGUI图像处理OCRpdf-processingproductivityPythonQtRustspeech-recognitionvideo-processingyoutube-downloader
Python 6
3 个月前
https://static.github-zh.com/github_avatars/enesmanan?size=40
enesmanan / paper-bold

AI-powered RAG-based tool for summarizing, extracting insights, and answering questions about research papers with high accuracy

gemini-apilangchainpdf-processingragacademic-paper
HTML 5
3 个月前
https://static.github-zh.com/github_avatars/allanninal?size=40
allanninal / document-summarizer

#自然语言处理#The Document Summarizer leverages Hugging Face’s facebook/bart-large-cnn model to transform lengthy documents into concise summaries. Built with ReactJS (Vite) for the frontend and Flask for the backe...

ai-toolsFlaskhuggingface自然语言处理pdf-processingReactVite
JavaScript 4
6 个月前
https://static.github-zh.com/github_avatars/DioCrafts?size=40
DioCrafts / ai-book-summarizer

#自然语言处理#📚 AI-Powered Book PDF Knowledge Extractor & Summarizer Transform your PDF books into structured knowledge effortlessly! This tool leverages AI to analyze books page by page, extracting key insights, ...

人工智能自动化document-analysisknowledge-extraction机器学习Markdown自然语言处理openaipdfpdf-processingPythonstudy-materialstext-analysis
Python 4
5 个月前
https://static.github-zh.com/github_avatars/Yardenrsk?size=40
Yardenrsk / PsychometryReceiverCV

A side project to easily get and annotate questions and answers to the PsychometryBot project DB using computer vision and pdf parsing

opencv-pythonpandaspdf-processing
Python 3
3 年前
https://static.github-zh.com/github_avatars/thinhuos0913?size=40
thinhuos0913 / python_useful_mini_projects

This is some useful mini projects that I had worked for self-learning Python programming.

OCROpenCVPython图像处理pdf-processing
Python 3
1 年前
https://static.github-zh.com/github_avatars/Aleptonic?size=40
Aleptonic / PdfSnipper

PdfSnipper is a lightweight and efficient Python package designed to simplify the management of PDF files, pages, and their conversions during various NLP, Computer Vision (CV), or other data processi...

pdf-processingutilities
Python 3
4 个月前
https://static.github-zh.com/github_avatars/setuc?size=40
setuc / pdf-annotation-with-azure-doc-intel

Azure Document Intelligence Result Processor: A toolset for annotating PDFs based on Azure Document Intelligence analysis results, featuring a React web application and a standalone Python script for ...

JavaScriptpdf-processingPythonReactVite
JavaScript 2
3 个月前
https://static.github-zh.com/github_avatars/rithulkamesh?size=40
rithulkamesh / docproc

#计算机科学#Opinionated and Sophisticated Document Region Analyzer.

pdf-processingdocument-analysistext-extractionPythonOCR机器学习layout-analysiscontent-extractiontext-classificationdata-extractiondocument-parsing
Python 2
2 个月前
https://static.github-zh.com/github_avatars/Al-shwaib?size=40
Al-shwaib / Book-Preparation-for-Printing

A web application for preparing books and magazines for offset printing. Automatically arranges PDF pages for commercial A3 printing, supporting both Arabic (RTL) and English (LTR) books. تطبيق ويب ل...

flask-applicationpdf-processing
Python 2
5 个月前
https://static.github-zh.com/github_avatars/Farhaj499?size=40
Farhaj499 / RAG_with_Weaviate_DB

This project implements a Retrieval Augmented Generation (RAG) system that answers questions based on the PDF document. It utilizes Weaviate as a vector database for efficient retrieval of relevant in...

agentic-aiembeddingshuggingface-transformerslangchainpdf-processingPythonragretrieval-augmented-generationsemantic-searchvector-databaseweaviate
Jupyter Notebook 2
5 个月前
https://static.github-zh.com/github_avatars/arsath-eng?size=40
arsath-eng / RAG1-NVIDIA-GENAI

#大语言模型#A powerful Retrieval Augmented Generation (RAG) application built with NVIDIA AI endpoints and Streamlit. This solution enables intelligent document analysis and question-answering using state-of-the-...

document-analysisembeddingsfaisslangchain大语言模型pdf-processingquestion-answeringragStreamlitvector-store
Python 2
8 个月前
loading...