微软VALL-E X 零样本语音合成模型的开源实现
EmoTa is an open-access Tamil Speech Emotion Recognition dataset with 936 utterances from 22 native speakers, covering five emotions (anger, happiness, sadness, fear, and neutrality). It supports emot...
ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)
A modification on the Sharif Emotional Speech Database
#计算机科学#TTS (FastPitch) for German (Thorsten voice / emotional)
#自然语言处理#An extensive collection of Speech Emotion Recognition (SER) datasets across multiple languages, including English, Mandarin, Hindi, Spanish, Tamil, Arabic, and more. Perfect for training emotion detec...
Applying deep learning to translate animation and re-generate audio.
EMOLIPS: TWO-LEVEL APPROACH FOR LIP-READING EMOTIONAL SPEECH
#大语言模型#A GUI program for chat with chatbot such as chatgpt.
Edison AT is AI emotional depression program. Developed using Python.
This is a project dedicated to the classification of emotional speech and was created in class with Prof. Dr. Burkhardt at Technische Universität Berlin.
This is a project dedicated to the classification of emotional speech and was created in class with Prof. Dr. Burkhardt at Technische Universität Berlin.
Edison AT is AI emotional depression program. Developed using Python.