GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

vision-and-language-pre-training

Website
Wikipedia
https://static.github-zh.com/github_avatars/salesforce?size=40
salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

vision-languagevision-and-language-pre-trainingimage-text-retrievalimage-captioningvisual-question-answeringvision-language-transformer
Jupyter Notebook 5.31 k
10 个月前
https://static.github-zh.com/github_avatars/OFA-Sys?size=40
OFA-Sys / Chinese-CLIP

#自然语言处理#本项目为CLIP模型的中文版本,使用大规模中文数据进行训练(~2亿图文对),旨在帮助用户快速实现中文领域的图文特征&相似度计算、跨模态检索、零样本图片分类等任务

中文机器视觉multi-modal-learning自然语言处理PyTorchvision-and-language-pre-trainingimage-text-retrievalclippretrained-modelsvision-language深度学习multi-modalcontrastive-losstransformerscoreml-models
Python 5.28 k
10 个月前
https://static.github-zh.com/github_avatars/phellonchen?size=40
phellonchen / awesome-Vision-and-Language-Pre-training

Recent Advances in Vision and Language Pre-training (VLP)

vision-and-language-pre-trainingvision-and-languagepretrainingmultimodal-deep-learning
293
2 年前
https://static.github-zh.com/github_avatars/zhjohnchan?size=40
zhjohnchan / awesome-vision-and-language-pretraining

A curated list of vision-and-language pre-training (VLP). :-)

multi-modal-learningpre-trainingvision-and-language-pre-training
59
3 年前
https://static.github-zh.com/github_avatars/mala-lab?size=40
mala-lab / SIC-CADS

Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)

object-detectionopen-vocabulary-detectionvision-and-language-pre-trainingvision-language-modelfoundation-models
Python 25
1 年前
https://static.github-zh.com/github_avatars/PrithivirajDamodaran?size=40
PrithivirajDamodaran / vision-language-modelling-series

Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations

multimodal-deep-learningmultimodal-interactionsvision-and-languagevision-and-language-pre-training
Jupyter Notebook 14
3 年前
https://static.github-zh.com/github_avatars/JianqiangWan?size=40
JianqiangWan / VLPT-STD

Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)

vision-and-language-pre-trainingscene-text-detectionmultimodal-deep-learning
12
3 年前
https://static.github-zh.com/github_avatars/marialymperaiou?size=40
marialymperaiou / knowledge-enhanced-multimodal-learning

A list of research papers on knowledge-enhanced multimodal learning

image-text-matchingimage-text-retrievalknowledge-graphmultimodal-deep-learningvision-and-languagevision-and-language-pre-trainingvision-language-transformervisual-commonsense-reasoningvisual-question-answeringmulti-task-learning
7
3 年前
https://static.github-zh.com/github_avatars/SHTUPLUS?size=40
SHTUPLUS / GITM-MR

The official implementation for the ICCV 2023 paper "Grounded Image Text Matching with Mismatched Relation Reasoning".

vision-and-languagevision-language-modelvision-and-language-pre-training
Python 6
2 年前
https://static.github-zh.com/github_avatars/jyoung105?size=40
jyoung105 / koSigLIP

#自然语言处理#Korean version of CLIP which achieves Korean cross-modal retrieval and representation generation.

机器视觉contrastive-losscoreml-models深度学习image-text-retrievalkoreanmulti-modalmulti-modal-learning自然语言处理pretrained-modelsPyTorchtransformersvision-and-language-pre-trainingvision-language
0
7 个月前