GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

multimodal-pretraining

Website
Wikipedia
https://static.github-zh.com/github_avatars/baaivision?size=40
baaivision / Emu

Emu Series: Generative Multimodal Models from BAAI

foundation-modelsin-context-learninginstruct-tuningmultimodal-pretraininggenerative-pretraining-in-multimodalitymultimodal-generalist
Python 1.73 k
9 个月前
https://static.github-zh.com/github_avatars/Paranioar?size=40
Paranioar / Awesome_Matching_Pretraining_Transfering

#Awesome#The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insigh...

cross-modal-retrieval教程Awesome Listsimage-text-matchingimage-text-retrievallarge-language-modelslarge-vision-language-modelsmultimodal-pretrainingparameter-efficient-fine-tuningvision-and-languagemultimodal-large-language-models大语言模型text-to-image-generationtext-to-image-synthesistext-to-video-generation
423
6 个月前
https://static.github-zh.com/github_avatars/X-PLUG?size=40
X-PLUG / Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

benchmark中文datasetmllmmultimodalmultimodal-large-language-modelsmultimodal-pretrainingVideovideo-question-answeringyouku
Python 297
1 年前
https://static.github-zh.com/github_avatars/X-PLUG?size=40
X-PLUG / mPLUG-2

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

foundation-modelsmllmmultimodalmultimodal-pretrainingVideoimage-retrievalmplugvideo-question-answeringvqa
Python 227
2 年前