vad · GitHub Topics

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformer PyTorch speech-recognition paraformer punctuation speaker-diarization rnnt audio-visual-speech-recognition pretrained-model voice-activity-detection Whisper dfsmn vad speechgpt speechllm

Python 11.76 k

8 天前

smacke / ffsubsync

自动化同步视频字幕，提升字幕编辑效率

subtitles Video audio FFmpeg vad fft synchronization sync subtitle captions vlc vlc-media-player srt srt-subtitles voice-activity-detection fast-fourier-transform alignment caption

Python 7.27 k

11 天前

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-detection voice-recognition voice-commands PyTorch onnx voice-activity-detection voice-control onnx-runtime onnxruntime speech speech-processing vad

Python 6.39 k

2 个月前

CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6

faster-whisper openai transcribe vad Whisper whisperx asr

Python 2.56 k

8 个月前

k2-fsa / sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, Lich...

Python speech-recognition C++asr C C#Go Kotlin vad voice-activity-detection

C++ 1.42 k

2 个月前

TEN-framework / ten-vad

Voice Activity Detector(VAD) from TEN: low-latency, high-performance and lightweight

conversational-ai real-time speech-processing vad voice-activity-detection voice-commands voice-recognition audio automatic-speech-recognition speech

C 1.04 k

10 天前

jtkim-kaist / VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

vad dnn lstm attention speech data voice-detection speech-recognition voice-activity-detection

MATLAB 862

4 年前

amsehili / auditok

An audio/acoustic activity detection and audio segmentation tool

voice-detection vad voice-activity-detection

Python 790

8 个月前

DmitryRyumin / ICASSP-2023-24-Papers

#人脸识别#ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...