Extensions to YAML syntax for better python interaction
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.
Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端
Target speaker automatic speech recognition (TS-ASR)
Real-Time Speaker Diarization (SpeechBrain ECAPA-TDNN) & Speech-to-Text Demo (AZURE SPEECH SDK)
Incremental learning for automatic speech recognition (ASR)
#自然语言处理#Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB
A Streamlit web app for speaker diarization and identification in audio files. Upload or record audio, transcribe conversations, and automatically segment and label speakers using reference samples. T...
Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
#计算机科学#Processing EEG data using Speechbrain-MOABB and model tuning to get best results
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
[Research] A Perceptual Loss Based Complex Neural Beamforming for AmbiX 3D Speech Enhancement
Speaker verification of virtual assistants using ECAPA-TDNN model from SpeechBrain toolkit and transfer learning approach emphasizing on inter and intra comparision (text independent and dependent).
AudioSpeakerVerification: FastAPI-based API for Speaker Matching and Verification using SpeechBrain. Compare and verify speaker identities from audio files.
#计算机科学#A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models
Speech transcription and speech diarization
#计算机科学#Speech Emotion Recognition SE&R 2022
Dockerized Zeroc-ICE architecture processing voice commands from a Xamarin mobile application via an Automatic Speech Recognition (ASR) AI model using SpeechBrain.
Speech synthesis with conditioning on very small dataset. Using Nvidia's Tacotron2 and WaveGlow models with Pytorch.