GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

audio-language

Website
Wikipedia
https://static.github-zh.com/github_avatars/OFA-Sys?size=40
OFA-Sys / ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

foundation-modelsmultimodalrepresentation-learningvision-languageaudio-languagevision-and-languagevision-transformercontrastive-loss
Python 1.04 k
8 个月前
https://static.github-zh.com/github_avatars/AudioLLMs?size=40
AudioLLMs / Awesome-Audio-LLM

Audio Large Language Models

audio-languageaudio-processing
Python 552
14 天前
https://static.github-zh.com/github_avatars/TXH-mercury?size=40
TXH-mercury / VAST

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

audio-languagedatasetvision-language
Jupyter Notebook 277
1 年前
https://static.github-zh.com/github_avatars/Sreyan88?size=40
Sreyan88 / GAMA

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

audiodatasetquestion-answeringreasoningaudio-language大语言模型multimodal-large-language-models
Python 127
6 个月前
https://static.github-zh.com/github_avatars/Sreyan88?size=40
Sreyan88 / CompA

#自然语言处理#Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

人工智能audiobenchmarkcompositionality机器学习自然语言处理audio-language
Python 18
1 年前