GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

information-retrieval

Website
Wikipedia
https://static.github-zh.com/github_avatars/JaidedAI?size=40
JaidedAI / EasyOCR

#计算机科学#Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

OCR深度学习crnnPyTorchlstm机器学习scene-textscene-text-recognitionoptical-character-recognitioncnndata-mining图像处理Pythoneasyocrinformation-retrieval
Python 26.94 k
9 个月前
deepset-ai/haystack
https://static.github-zh.com/github_avatars/deepset-ai?size=40
deepset-ai / haystack

#自然语言处理#Haystack 是一个开源 NLP 框架,利用预训练的 Transformer 模型。 帮组开发者能快速实现一个生产级的语义搜索、问答、摘要和文档排名的NLP应用

自然语言处理question-answeringPyTorchsemantic-searchinformation-retrievalsummarizationtransformers机器学习人工智能Pythonlarge-language-modelsgenerative-ai大语言模型ragretrieval-augmented-generationagentsagentgeminigpt-4orchestration
Python 21.15 k
2 天前
piskvorky/gensim
https://static.github-zh.com/github_avatars/piskvorky?size=40
piskvorky / gensim

#自然语言处理#Topic Modelling for Humans

gensimtopic-modelinginformation-retrieval机器学习自然语言处理数据科学Pythondata-miningword2vecword-embeddings神经网络fasttext
Python 16.06 k
5 天前
https://static.github-zh.com/github_avatars/arc53?size=40
arc53 / DocsGPT

#自然语言处理#DocsGPT 是一个用于“文档”的基于GPT聊天助手,能快速检索项目文档,帮助开发人员轻松地提出与项目相关的问题,并获得准确的答案

人工智能Python自然语言处理ReactWeb appChatGPTdocsgptinformation-retrievallanguage-model大语言模型机器学习PyTorchragsemantic-searchtransformersHacktoberfest
TypeScript 15.71 k
3 天前
weaviate/weaviate
https://static.github-zh.com/github_avatars/weaviate?size=40
weaviate / weaviate

#搜索#Weaviate 是一个开源矢量数据库,它同时存储对象和矢量,允许将矢量搜索与结构化过滤与云原生数据库的容错和可扩展性相结合,所有这些都可以通过 GraphQL、REST 和各种语言客户端访问。

搜索引擎semantic-searchsemantic-search-enginevector-searchvector-search-enginevector-databaseapproximate-nearest-neighbor-searchimage-searchhnswinformation-retrievalmlopsnearest-neighbor-searchneural-searchrecommender-systemsimilarity-searchvectorsgenerative-searchhybrid-searchweaviategRPC
Go 13.64 k
10 小时前
onyx-dot-app/onyx
https://static.github-zh.com/github_avatars/onyx-dot-app?size=40
onyx-dot-app / onyx

#大语言模型#Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.

enterprise-searchragai-chatChatGPTgen-aiNextPythoninformation-retrieval
Python 13.01 k
10 小时前
https://static.github-zh.com/github_avatars/Unstructured-IO?size=40
Unstructured-IO / unstructured

#自然语言处理#Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to...

深度学习document-parsing机器学习自然语言处理OCRinformation-retrievaldata-pipelinespreprocessingpdf-to-textpdfpdf-to-jsondocument-image-analysisdonutdocument-image-processingdocument-parserdocxlangchain大语言模型
HTML 11.49 k
2 天前
neuml/txtai
https://static.github-zh.com/github_avatars/neuml?size=40
neuml / txtai

#搜索#All-in-one 一站式 embedding 数据库,语义搜索、LLM 编排和语言模型workflows

Pythonsearch机器学习自然语言处理semantic-searchvector-searchtxtai大语言模型vector-databaselanguage-modeltransformerssentence-embeddingslarge-language-modelsinformation-retrieval搜索引擎embeddingsretrieval-augmented-generationrag人工智能
Python 11.08 k
3 天前
https://static.github-zh.com/github_avatars/FlagOpen?size=40
FlagOpen / FlagEmbedding

#大语言模型#Retrieval and Retrieval-augmented LLMs

embeddingsinformation-retrieval大语言模型sentence-embeddingstext-semantic-similarityretrieval-augmented-generation
Python 9.92 k
11 天前
marqo-ai/marqo
https://static.github-zh.com/github_avatars/marqo-ai?size=40
marqo-ai / marqo

#搜索#Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

深度学习information-retrieval机器学习vector-searchtensor-searchclipmulti-modal搜索引擎transformersvision-languagesemantic-searchvisual-search自然语言处理hnswknnHacktoberfestChatGPTgptlarge-language-models
Python 4.89 k
5 小时前
https://static.github-zh.com/github_avatars/apache?size=40
apache / lucene-solr

#搜索#Apache Lucene 和 Solr 已迁移至各自独立的仓库

lucenesolrsearchNoSQLJava后端搜索引擎information-retrieval
Java 4.38 k
9 个月前
https://static.github-zh.com/github_avatars/KittyKatt?size=40
KittyKatt / screenFetch

Fetches system/theme information in terminal for Linux desktop screenshots.

ShellBashDesktopinformation-retrieval
Shell 3.98 k
6 个月前
langroid/langroid
https://static.github-zh.com/github_avatars/langroid?size=40
langroid / langroid

#大语言模型#Harness LLMs with Multi-Agent Programming

agentsChatGPTgptgpt-4gpt4language-model大语言模型llm-agentmulti-agent-systemsopenai-api人工智能llm-frameworkllamalocal-llmfunction-callinginformation-retrievalragretrieval-augmented-generation
Python 3.4 k
2 天前
https://static.github-zh.com/github_avatars/catalyst-team?size=40
catalyst-team / catalyst

#自然语言处理#Accelerated deep learning R&D

深度学习reinforcement-learning机器学习机器视觉PyTorchPythondistributed-computinginfrastructureresearchreproducibility图像处理image-classificationimage-segmentationobject-detection自然语言处理text-classificationinformation-retrievalrecommender-systemmetric-learning
Python 3.35 k
1 年前
https://static.github-zh.com/github_avatars/SylphAI-Inc?size=40
SylphAI-Inc / AdalFlow

#自然语言处理#AdalFlow: The library to build & auto-optimize LLM applications.

agent框架大语言模型raggenerative-ai机器学习自然语言处理Pythonretriever人工智能聊天机器人information-retrievalquestion-answeringsummarizationbm25faissrerankeroptimizertrainerauto-prompting
Python 3.32 k
3 个月前
https://static.github-zh.com/github_avatars/apache?size=40
apache / lucene

#搜索#Apache Lucene 是一个用Java开发的全文搜索引擎

lucenesearchNoSQLJava后端搜索引擎information-retrieval
Java 3.01 k
3 天前
https://static.github-zh.com/github_avatars/tensorflow?size=40
tensorflow / ranking

#计算机科学#Learning to Rank in TensorFlow

ranking机器学习深度学习information-retrievallearning-to-rankrecommender-systems
Python 2.77 k
1 年前
https://static.github-zh.com/github_avatars/embeddings-benchmark?size=40
embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

benchmarkclusteringinformation-retrievalsentence-transformersststext-embeddingretrievalneural-searchsemantic-searchsberttext-classificationreranking
Python 2.6 k
3 天前
naiveHobo/InvoiceNet
https://static.github-zh.com/github_avatars/naiveHobo?size=40
naiveHobo / InvoiceNet

#计算机科学#Deep neural network to extract intelligent information from invoice documents.

invoiceinvoice-managementinvoicesinvoice-insightclassification深度学习深度神经网络Keraskeras-tensorflowkeras-neural-networksinvoice-pdfinvoice-softwareinformation-retrievalinformation-extractionbilling
Python 2.6 k
1 年前
ashvardanian/StringZilla
https://static.github-zh.com/github_avatars/ashvardanian?size=40
ashvardanian / StringZilla

Up to 10x faster strings for C, C++, Python, Rust, Swift & Go, leveraging NEON, AVX2, AVX-512, SVE, & SWAR to accelerate search, hashing, sort, edit distances, and memory ops 🦖

simdCSVdatasetndjsonstringstring-manipulationstring-matchingsubstringinformation-retrievalsorting-algorithmsJSONpattern-recognitionstring-parsingstring-searchParserbeautifulsoupHTML
C 2.59 k
12 天前
loading...