Extensions to YAML syntax for better python interaction
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.
Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端
Target speaker automatic speech recognition (TS-ASR)
Incremental learning for automatic speech recognition (ASR)
#自然语言处理#Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB
Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
A Streamlit web app for speaker diarization and identification in audio files. Upload or record audio, transcribe conversations, and automatically segment and label speakers using reference samples. T...
#计算机科学#Processing EEG data using Speechbrain-MOABB and model tuning to get best results
Real-Time Speaker Diarization (SpeechBrain ECAPA-TDNN) & Speech-to-Text Demo (AZURE SPEECH SDK)
pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati
[Research] A Perceptual Loss Based Complex Neural Beamforming for AmbiX 3D Speech Enhancement
Speaker verification of virtual assistants using ECAPA-TDNN model from SpeechBrain toolkit and transfer learning approach emphasizing on inter and intra comparision (text independent and dependent).
AudioSpeakerVerification: FastAPI-based API for Speaker Matching and Verification using SpeechBrain. Compare and verify speaker identities from audio files.
This project is a Voice Identification System built using Python, leveraging SpeechBrain and ECAPA-TDNN for speaker verification. The system identifies users by comparing their voice embeddings with s...
#计算机科学#A Speech Recognition Framework for Banking Interactions using Convolutional Recurrent Dense Neural Networks and Language Models
Speech transcription and speech diarization
#计算机科学#Speech Emotion Recognition SE&R 2022
Dockerized Zeroc-ICE architecture processing voice commands from a Xamarin mobile application via an Automatic Speech Recognition (ASR) AI model using SpeechBrain.