#计算机科学#End-to-End Speech Processing Toolkit
翻译 - 端到端语音处理工具包
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...
#计算机科学#基于 so-vits-svc4.0(V1)的一个分支,支持实时推理和图形化推理界面,且兼容其模型。
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...
#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
zero-shot voice conversion & singing voice conversion, with real-time support
A simple, high-quality voice conversion tool focused on ease of use and performance.
#数据仓库#🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
#计算机科学#This is now the official location of the Merlin project.
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
#计算机科学#NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
The code for the bark-voicecloning model. Training and inference.
Unsupervised Speech Decomposition Via Triple Information Bottleneck
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
#计算机科学#Deep learning for audio processing