GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

text-image-retrieval

Website
Wikipedia
alibaba/EasyNLP
https://static.github-zh.com/github_avatars/alibaba?size=40
alibaba / EasyNLP

#自然语言处理#EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

transformersbert自然语言处理pretrained-models深度学习PyTorchfewshot-learningknowledge-distillationknowledge-pretrainingtext-image-retrievaltext-to-image-synthesis机器学习text-classificationtransfer-learning
Python 2.16 k
8 个月前
https://static.github-zh.com/github_avatars/NVlabs?size=40
NVlabs / ODISE

#计算机科学#Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

深度学习instance-segmentationpanoptic-segmentationPyTorchsemantic-segmentationdiffusion-modelstext-image-retrievalzero-shot-learningopen-vocabulary-segmentation
Python 917
1 年前
https://static.github-zh.com/github_avatars/360CVGroup?size=40
360CVGroup / FG-CLIP

New generation of CLIP with fine grained discrimination capability, ICML2025

clipcross-modal-retrievaltext-image-retrieval
Python 257
4 天前
https://static.github-zh.com/github_avatars/xiaoyuan1996?size=40
xiaoyuan1996 / retrievalSystem

The back-end of cross-modal retrieval system,wihch will contain services such as semantic location .etc

remote-sensingtext-image-retrieval
Python 65
3 年前
https://static.github-zh.com/github_avatars/BIGBALLON?size=40
BIGBALLON / UME-Search

Toward Universal Multimodal Embedding

image-retrievalimage-searchlarge-language-modelstext-image-retrievalinformation-retrievalretrieval
Python 44
3 天前
https://static.github-zh.com/github_avatars/KimRass?size=40
KimRass / CLIP

PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k

clipmulti-modalzero-shot-classificationtext-image-retrieval
Python 12
1 年前
https://static.github-zh.com/github_avatars/haoxiangzhao12138?size=40
haoxiangzhao12138 / REIR

[ACMMM'25] Referring Expression Instance Retrieval and A Strong End-to-End Baseline

multimodal-deep-learningreferring-expression-comprehensiontext-image-retrieval
4
16 天前
https://static.github-zh.com/github_avatars/AIoT-Lab-BKAI?size=40
AIoT-Lab-BKAI / PIMA

#计算机科学#PIMA - A Novel Approach for Pill-Prescription Matching with GNN Assistance and Contrastive Learning

深度学习graph-neural-networkstext-image-retrieval
Jupyter Notebook 3
3 年前
https://static.github-zh.com/github_avatars/HTAnh2003?size=40
HTAnh2003 / LLM_Powered_Video_Search

The LLM-Powered Video Search System is an advanced multimodal video search solution that leverages Large Language Models (LLMs) to enhance video retrieval through text, image, and metadata queries.

clipDjangoDockerfaissmultimodalretrievalretrieval-augmented-generationtext-image-retrievalyolo
Jupyter Notebook 3
2 个月前
https://static.github-zh.com/github_avatars/MayssaJaz?size=40
MayssaJaz / Text2Image-Search

#搜索#A search engine, operating on the foundation of the OpenAI Clip Model to retrieve images corresponding to textual queries.

clipFastAPIopen-aiReact搜索引擎text-image-retrieval
Jupyter Notebook 1
1 年前
https://static.github-zh.com/github_avatars/lorenzo-stacchio?size=40
lorenzo-stacchio / Digimon_Dataset

#计算机科学#Digimon Dataset for MultiModal Machine Learning

clip深度学习image-generationtext-image-retrieval
Python 0
2 年前
https://static.github-zh.com/github_avatars/Chaouki-AI?size=40
Chaouki-AI / VisAlign

VisAlign: Aligning Visual Representations with Textual Semantics for Image Similarity and Retrieval

alignmentimage-retrievalimage-to-image-translationtext-image-retrieval
Jupyter Notebook 0
3 个月前