Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
An audio/acoustic activity detection and audio segmentation tool
#安卓#Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Gecko - A Tool for Effective Annotation of Human Conversations
A statistical model-based Voice Activity Detection
Efficient voice activity detection algorithm using long-term speech information
#计算机科学#Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.
#IOS#iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Spoofing voice detection : 2nd YAICON
#计算机科学#End to end AWS SageMaker application for detecting the AWS Polly voice in an audio recording using Gluon and MXNet.
this is a p5js experiment that uses voice detection and cursor movement to multiply creative content in a variety of colours
#计算机科学#TranscribeTube is a Python tool that transcribes and generates subtitles for videos from local files or YouTube links using Hugging Face models. It features an interactive Gradio web interface, allowi...
Config files for my GitHub profile.
using a simple convolution neural network to classify voices based on the existence of wake word
A database of challenging voice utterances collected by the Biometrics Vision and Computing (BVC) group.
#自然语言处理#Voice detection, wake words and voice commands on the ESP32-S3 microcontroller.
#计算机科学#DΞCIBΞLION is an audio intelligence module forged in the labs of OBINexus, where noise meets logic and shouting is a feature, not a bug. It mathematically analyzes human vocal input to determine emoti...
A Python project that handles speech commands and retrieves results from Google or Wikipedia based on the spoken input. Functions are organized in separate files, with a single raw file to execute the...