#

audio-language

https://static.github-zh.com/github_avatars/OFA-Sys?size=40

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 1.05 k
1 年前
https://static.github-zh.com/github_avatars/AudioLLMs?size=40
Python 742
3 个月前
https://static.github-zh.com/github_avatars/TXH-mercury?size=40

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 289
2 年前
https://static.github-zh.com/github_avatars/Sreyan88?size=40

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Python 144
10 个月前
https://static.github-zh.com/github_avatars/Sreyan88?size=40

#自然语言处理#Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

Python 19
1 年前
Website
Wikipedia