GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

document-embedding

Website
Wikipedia
https://static.github-zh.com/github_avatars/ddangelov?size=40
ddangelov / Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

topic-modelingword-embeddingsdocument-embeddingtopic-vectortopic-searchtext-searchtext-semantic-similaritytopic-modellingsemantic-searchbertsentence-transformerspre-trained-language-models
Python 3.06 k
7 个月前
https://static.github-zh.com/github_avatars/dissorial?size=40
dissorial / doc-chatbot

Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.

openaiTypeScriptgpt-3gpt-4langchainMongooseNextopenai-apichat聊天机器人document-embeddingpdf-processingpineconeReactTailwind CSSvectorization
TypeScript 8581
2 年前
https://static.github-zh.com/github_avatars/bobxwu?size=40
bobxwu / FASTopic

A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)

topic-modelingdocument-embeddingword-embeddings
Python 106
10 天前
https://static.github-zh.com/github_avatars/ddangelov?size=40
ddangelov / RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

REST APIsemantic-searchsemantic-search-enginetopic-modelingdocument-embeddingword-embeddingtext-searchtext-similarityFastAPIrestful-api
Python 90
3 年前
https://static.github-zh.com/github_avatars/EQTPartners?size=40
EQTPartners / pause

#自然语言处理#🍊 PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by EMNLP'2021 🌴

自然语言处理sentence-embeddingsdocument-embeddingclassification-algorithmsimilarity-search
Python 25
1 年前
https://static.github-zh.com/github_avatars/samhavens?size=40
samhavens / flair-as-service

#自然语言处理#Container-first, JSON-configurable, NLP REST service based on Flair

自然语言处理word-embeddingsdocument-embeddingKubernetesDocker
Python 10
6 年前
https://static.github-zh.com/github_avatars/chen0040?size=40
chen0040 / java-text-embedding

Word embedding in Java

word-embeddingsdocument-embeddingglove
Java 7
4 年前
https://static.github-zh.com/github_avatars/marcomoldovan?size=40
marcomoldovan / hierarchical-language-modeling

#自然语言处理#We address the task of learning contextualized word, sentence and document representations with a hierarchical language model by stacking Transformer-based encoders on a sentence level and subsequentl...

自然语言处理natural-language-understandingtransformertransfer-learningattention-mechanismrepresentation-learningword-embeddingssentence-embeddings深度学习机器学习PyTorchdocument-embeddingdocument-retrievalinformation-retrievallanguage-model
Jupyter Notebook 7
2 年前
https://static.github-zh.com/github_avatars/maxoodf?size=40
maxoodf / tgnews

#自然语言处理#Telegram Data Clustering Contest (Bossy Gnu's submission )

C++自然语言处理word2vecdocument-embeddingTelegram
C++ 4
4 年前
https://static.github-zh.com/github_avatars/ehtisham-sadiq?size=40
ehtisham-sadiq / Exploring-Word2Vec-and-Doc2Vec

#自然语言处理#Dive into the world of Word2Vec and Doc2Vec models to uncover insights and applications.

document-embedding机器学习自然语言处理神经网络text-analysisword-embeddings
Jupyter Notebook 2
1 年前
https://static.github-zh.com/github_avatars/jdenes?size=40
jdenes / TopicEmbeddings

An open-source framework to create and test document embeddings using topic models.

topic-modelsdocument-embeddingembeddings
Python 1
5 年前
https://static.github-zh.com/github_avatars/Tobsky?size=40
Tobsky / DocuQuery

This Streamlit application demonstrates the integration of ChatGroq (Llama3 model), OpenAIEmbeddings, and FAISS for document embedding and retrieval.

document-embeddinggenerative-aigroqllama3openairag
Python 1
1 年前
https://static.github-zh.com/github_avatars/cnuahs?size=40
cnuahs / semantic-history-search

A Chrome extension to provide semantic search over your browsing history.

Angularbrave-extensionbrowsing-historyChrome 插件document-embeddinghuggingface-transformerslangchainlangchain-jsmanifest-v3semantic-search
TypeScript 0
4 个月前
https://static.github-zh.com/github_avatars/ShantamShukla?size=40
ShantamShukla / medicalrag

#自然语言处理#Medical Retrieval-Augmented Generation (RAG) Knowledge Base - A Next.js and LangChain-powered app that processes and stores medical documents as vector embeddings in Pinecone for efficient similarity ...

人工智能document-embeddinghuggingfacelangchainNext自然语言处理pineconeragvector-database
TypeScript 0
8 个月前
https://static.github-zh.com/github_avatars/ChiaraDiBonaventura?size=40
ChiaraDiBonaventura / covid_opinion

#自然语言处理#Applying NLP to understand people's sentiment about Covid-19 and Government actions in Italy, conditional on their political affiliation.

自然语言处理covid19-dataCOVID-19topic-modeling数据可视化clusteringdata-preprocessingdata-cleaningdocument-embedding数据分析matrix-factorization
Jupyter Notebook 0
4 年前
https://static.github-zh.com/github_avatars/stko-lab?size=40
stko-lab / LD-Connect

LD Connect: A Linked Data Portal for IOS Press Scientometrics

coreference-resolutiondocument-embeddingknowledge-graphlinked-dataRDF (Resource Description Framework)Semantic Websparql
JavaScript 0
3 年前
https://static.github-zh.com/github_avatars/pprablanc?size=40
pprablanc / doc_embedding_topic_mod

Improving document embedding with weighted average of word embedding through topic modeling

document-embeddingtopic-modeling
R 0
5 年前
https://static.github-zh.com/github_avatars/inimah?size=40
inimah / Neural-Language-Models

#自然语言处理#Experiments on Neural Language Embeddings

深度学习自然语言处理language-modelword-embeddingsdocument-embeddingsequence-to-sequenceclusteringbinary-classificationsemi-supervised-learning
Python 0
8 年前
https://static.github-zh.com/github_avatars/leyresv?size=40
leyresv / Book_Recommendation_System

#自然语言处理#Content-based book recommendation system

cosine-similarity自然语言处理document-embedding
Python 0
2 年前