GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

visual-instruction-tuning

Website
Wikipedia
https://static.github-zh.com/github_avatars/BradyFU?size=40
BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

instruction-tuninginstruction-followinglarge-vision-language-modelvisual-instruction-tuningmulti-modalityin-context-learninglarge-language-modelslarge-vision-language-modelsmultimodal-chain-of-thoughtmultimodal-in-context-learningmultimodal-large-language-modelschain-of-thought
15.53 k
3 天前
https://static.github-zh.com/github_avatars/CircleRadon?size=40
CircleRadon / Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

mllmsamvisual-instruction-tuningpixel-understanding
Python 821
2 个月前
https://static.github-zh.com/github_avatars/ictnlp?size=40
ictnlp / LLaVA-Mini

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

efficientgpt4ogpt4vlarge-language-modelslarge-multimodal-modelsllavamultimodalVideovisionvision-language-modelvisual-instruction-tuningllamamultimodal-large-language-models
Python 488
5 个月前
https://static.github-zh.com/github_avatars/zjysteven?size=40
zjysteven / lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

finetuningfoundation-modelsinstruction-tuning大语言模型large-multimodal-modelsmultimodalmultimodal-large-language-modelsvision-languagevisual-instruction-tuningllava
Python 300
4 个月前
https://static.github-zh.com/github_avatars/BAAI-DCAI?size=40
BAAI-DCAI / DataOptim

#大语言模型#A collection of visual instruction tuning datasets.

大语言模型mllmvisual-instruction-tuning
Python 76
1 年前
https://static.github-zh.com/github_avatars/ChenDelong1999?size=40
ChenDelong1999 / polite-flamingo

🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)

large-language-modelsmultimodal-large-language-modelsvisual-instruction-tuning
Python 64
2 年前
https://static.github-zh.com/github_avatars/bigai-nlco?size=40
bigai-nlco / VideoTGB

#大语言模型#[EMNLP 2024] A Video Chat Agent with Temporal Prior

大语言模型mllmmultimodal-large-language-modelsspatial-temporalvisual-instruction-tuning
Python 31
3 个月前
https://static.github-zh.com/github_avatars/fraction-ai?size=40
fraction-ai / GAP

#大语言模型#Gamified Adversarial Prompting (GAP): Crowdsourcing AI-weakness-targeting data through gamification. Boost model performance with community-driven, strategic data collection

人工智能机器视觉visual-instruction-tuningvqaweb3大语言模型
Python 31
8 个月前
https://static.github-zh.com/github_avatars/zjr2000?size=40
zjr2000 / REVERIE

[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models

datasetmultimodal-large-language-modelsvision-languagevisual-instruction-tuning
Python 16
1 年前
https://static.github-zh.com/github_avatars/mixpeek?size=40
mixpeek / awesome-multimodal-search

#Awesome#Collections of multimodal search libraries, service and research papers

Awesome Listssimilarity-searchvector-searchvision-language-modelvisual-instruction-tuning
12
2 个月前
https://static.github-zh.com/github_avatars/jingyi0000?size=40
jingyi0000 / Awesome-Visual-Instruction-Tuning

Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey

surveyvisual-instruction-tuning
6
1 年前
https://static.github-zh.com/github_avatars/yueying-teng?size=40
yueying-teng / generate-language-image-instruction-following-data

#大语言模型#Mistral assisted visual instruction data generation by following LLaVA

langchain大语言模型mistralmultimodal-learningllavavllmvisual-instruction-tuning
Python 1
4 个月前