GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

audio-visual-learning

Website
Wikipedia
https://static.github-zh.com/github_avatars/ali-vilab?size=40
ali-vilab / dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

audio-visual-learningface-animationtalking-headvideo-generation
Python 1.73 k
1 年前
https://static.github-zh.com/github_avatars/tanshuai0219?size=40
tanshuai0219 / EDTalk

[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation

audio-visual-learningface-animationtalking-face-generationtalking-headvideo-generation
Python 416
6 个月前
https://static.github-zh.com/github_avatars/OpenNLPLab?size=40
OpenNLPLab / AVSBench

[ECCV 2022] & [IJCV 2024] Official implementation of the paper: Audio-Visual Segmentation (with Semantics)

audio-visual-learning
Python 397
7 个月前
https://static.github-zh.com/github_avatars/xid32?size=40
xid32 / NAACL_2025_TWM

We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFMs). This plug-and-play module can be easily integrated into exi...

multimodal-large-language-modelsaudio-visual-learningquestion-answeringvideo-captioning
Python 309
5 个月前
https://static.github-zh.com/github_avatars/YapengTian?size=40
YapengTian / AVE-ECCV18

Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018

audio-visual-learning
Python 185
4 年前
https://static.github-zh.com/github_avatars/alvinliu0?size=40
alvinliu0 / HA2G

[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"

audio-visual-learningcvpr2022
Python 139
2 年前
https://static.github-zh.com/github_avatars/rhgao?size=40
rhgao / co-separation

Co-Separating Sounds of Visual Objects (ICCV 2019)

audio-visual-learningsound-separationcross-modality
Python 95
2 年前
https://static.github-zh.com/github_avatars/yanbeic?size=40
yanbeic / CCL

PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning

distillationaudio-visual-learningcvpr2021contrastive-learningPyTorchvideo-recognition
Python 87
4 年前
https://static.github-zh.com/github_avatars/ttgeng233?size=40
ttgeng233 / UnAV

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)

audio-visual-learningmulti-modal-learning
Python 63
1 年前
https://static.github-zh.com/github_avatars/roger-tseng?size=40
roger-tseng / av-superb

A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)

audio-visual-learningrepresentation-learning
Python 51
1 年前
https://static.github-zh.com/github_avatars/praveena2j?size=40
praveena2j / JointCrossAttentional-AV-Fusion

ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition

affective-computingattention-modelaudio-visual-learningemotionemotion-recognitionmultimodal-learning
Python 45
1 年前
https://static.github-zh.com/github_avatars/jasongief?size=40
jasongief / PSP_CVPR_2021

[2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line

audio-visual-learning
Python 41
3 年前
https://static.github-zh.com/github_avatars/praveena2j?size=40
praveena2j / Joint-Cross-Attention-for-Audio-Visual-Fusion

IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"

affective-computingattentionattention-modelaudio-visual-learningemotion-recognitionmultimodal-learning
Python 38
7 个月前
https://static.github-zh.com/github_avatars/MengyuanChen21?size=40
MengyuanChen21 / CVPR2023-CMPAE

[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

audio-visual-learningcvpr2023video-understanding
Python 35
2 年前
https://static.github-zh.com/github_avatars/stoneMo?size=40
stoneMo / AVGN

Official implementation for AVGN

audio-visual-learning
Python 34
2 年前
https://static.github-zh.com/github_avatars/stoneMo?size=40
stoneMo / EZ-VSL

Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)

self-supervised-learningaudio-visual-learning
Python 33
3 年前
https://static.github-zh.com/github_avatars/kyuyeonpooh?size=40
kyuyeonpooh / objects-that-sound

#计算机科学#The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.

cross-modal-retrieval深度学习audio-visual-learning
Python 32
1 年前
https://static.github-zh.com/github_avatars/stoneMo?size=40
stoneMo / DeepAVFusion

Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".

attention-mechanismaudio-visual-learningmultimodal-learningself-supervised-learningtransformer-architecturemasked-image-modeling
Python 31
10 个月前
https://static.github-zh.com/github_avatars/jasongief?size=40
jasongief / CPSP

[2023 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line

audio-visual-learning
Python 29
2 年前
https://static.github-zh.com/github_avatars/praveena2j?size=40
praveena2j / Cross-Attentional-AV-Fusion

FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition

affective-computingattention-modelaudio-visual-learningemotionemotion-recognitionmultimodal-learning
Python 29
7 个月前
loading...