This is a public mirror of a bitbucket repository we're using for the Paul Allen Computing Challenge.
A fast and lightweight python-based CTC beam search decoder for speech recognition.
In defence of metric learning for speaker recognition
multi-task learning for text recognition with joint CTC-attention
Pytorch Bindings for warp-ctc
The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)
Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
#计算机科学#Pytorch implementation of Learning Disentangled Representations via Mutual Information Estimation (ECCV 2020)
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
#面试#AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
0 条讨论