GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

multimodal-retrieval

Website
Wikipedia
https://static.github-zh.com/github_avatars/adithya-s-k?size=40
adithya-s-k / VARAG

Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine

multimodal-retrievalrag
Python 477
9 天前
https://static.github-zh.com/github_avatars/jolibrain?size=40
jolibrain / colette

#大语言模型#Multimodal RAG to search and interact locally with technical documents of any kind

大语言模型retrieval-augmented-generationsearchmultimodal-large-language-modelsmultimodal-retrievalvision-language-model
HTML 241
9 天前
https://static.github-zh.com/github_avatars/naver?size=40
naver / artemis

Official code release for ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity (published at ICLR 2022)

image-retrievalmultimodal-deep-learningmultimodal-retrieval
Python 52
2 年前
https://static.github-zh.com/github_avatars/JUNJIE99?size=40
JUNJIE99 / VISTA_Evaluation_FineTuning

Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original code and model can be accessed at FlagEmbedding.

multimodal-retrievalvision-language-model
Python 41
9 个月前
https://static.github-zh.com/github_avatars/TIBHannover?size=40
TIBHannover / cross-modal_entity_consistency

#计算机科学#This repository contains the dataset and source files to reproduce the results in the publication Müller-Budack et al. 2021: "Multimodal news analytics using measures of cross-modal entity and context...

multimodal-retrieval深度学习
Python 24
2 年前
https://static.github-zh.com/github_avatars/vikram-mm?size=40
vikram-mm / Multimodal-Image-Retrieval

Explores early fusion and late fusion approaches for Multimodal medical Image Retrieval

kmeansmultimodal-retrieval
Python 21
5 年前
https://static.github-zh.com/github_avatars/aimagelab?size=40
aimagelab / ReT

[CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval

embeddingsinformation-retrievalmultimodal-retrievalrecurrent-neural-networks
Python 18
4 个月前
https://static.github-zh.com/github_avatars/PanguIR?size=40
PanguIR / MRAGSurvey

A Survey of Multimodal Retrieval-Augmented Generation

large-language-models大语言模型multimodal-generationmultimodal-large-language-modelsmultimodal-retrieval
18
3 个月前
https://static.github-zh.com/github_avatars/sisinflab?size=40
sisinflab / Formal-MultiMod-Rec

Formalizing Multimedia Recommendation through Multimodal Deep Learning, accepted in ACM Transactions on Recommender Systems.

graph-neural-networksmultimodal-deep-learningPyTorchrecommender-systemreproducibilitymultimodal-retrieval
Python 13
1 年前
https://static.github-zh.com/github_avatars/Shuyu-XJTU?size=40
Shuyu-XJTU / CMP

The official code of "Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search"

multimodal-retrieval
Python 11
9 天前
https://static.github-zh.com/github_avatars/noagarcia?size=40
noagarcia / context-art-retrieval

Multimodal retrieval in art with context embeddings.

机器视觉artimage-retrievalmultimodal-retrieval
Python 11
4 年前
https://static.github-zh.com/github_avatars/sung-yeon-kim?size=40
sung-yeon-kim / GENIUS-CVPR25

Official Implementation of GENIUS: A Generative Framework for Universal Multimodal Search, CVPR 2025

multimodal-retrieval
Python 10
1 天前
https://static.github-zh.com/github_avatars/marialymperaiou?size=40
marialymperaiou / knowledge-enhanced-multimodal-learning

A list of research papers on knowledge-enhanced multimodal learning

image-text-matchingimage-text-retrievalknowledge-graphmultimodal-deep-learningmultimodal-retrievalvision-and-languagevision-and-language-pre-trainingvision-language-transformervisual-commonsense-reasoningvisual-question-answeringmulti-task-learning
7
3 年前
https://static.github-zh.com/github_avatars/marcomoldovan?size=40
marcomoldovan / multimodal-self-distillation

A generalized self-supervised training paradigm for unimodal and multimodal alignment and fusion.

multimodal-deep-learningPyTorchself-supervised-learningmultimodal-retrieval
Python 5
2 年前
https://static.github-zh.com/github_avatars/MMDocRAG?size=40
MMDocRAG / MMDocIR

The code used to train and run inference with MMDocIR

information-retrieval大语言模型multimodal-retrievalretrieval-augmented-generationvision-language-model
JavaScript 3
2 个月前
https://static.github-zh.com/github_avatars/aurooj?size=40
aurooj / VLM_SS

Mini-batch selective sampling for knowledge adaption of VLMs for mammography.

Medical imagingmultimodal-learningmultimodal-retrievalvision-and-languagevision-language-transformer
Jupyter Notebook 1
10 个月前
https://static.github-zh.com/github_avatars/TIBHannover?size=40
TIBHannover / iPatent

iPatent - Interactive Patent Search and Analysis

clusteringmultimodal-retrieval
Python 1
3 个月前
https://static.github-zh.com/github_avatars/catarinaopires?size=40
catarinaopires / eval-multimodal-medical-case-retrieval

#计算机科学#Evaluating dense model-based approaches for Multimodal Medical Case retrieval.

深度学习medicalmultimodal-retrieval
Python 0
8 个月前