GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

vad

Website
Wikipedia
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformerPyTorchspeech-recognitionparaformerpunctuationspeaker-diarizationrnntaudio-visual-speech-recognitionpretrained-modelvoice-activity-detectionWhisperdfsmnvadspeechgptspeechllm
Python 12.58 k
5 天前
https://static.github-zh.com/github_avatars/smacke?size=40
smacke / ffsubsync

自动化同步视频字幕,提升字幕编辑效率

subtitlesVideoaudioFFmpegvadfftsynchronizationsyncsubtitlecaptionsvlcvlc-media-playersrtsrt-subtitlesvoice-activity-detectionfast-fourier-transformalignmentcaption
Python 7.34 k
13 天前
https://static.github-zh.com/github_avatars/snakers4?size=40
snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-detectionvoice-recognitionvoice-commandsPyTorchonnxvoice-activity-detectionvoice-controlonnx-runtimeonnxruntimespeechspeech-processingvad
Python 6.83 k
19 天前
https://static.github-zh.com/github_avatars/CheshireCC?size=40
CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6

faster-whisperopenaitranscribevadWhisperwhisperxasr
Python 2.65 k
9 个月前
https://static.github-zh.com/github_avatars/k2-fsa?size=40
k2-fsa / sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, Lich...

Pythonspeech-recognitionC++asrCC#GoKotlinvadvoice-activity-detection
C++ 1.48 k
2 天前
https://static.github-zh.com/github_avatars/TEN-framework?size=40
TEN-framework / ten-vad

Voice Activity Detection (VAD) : low-latency, high-performance and lightweight

conversational-aireal-timespeech-processingvadvoice-activity-detectionvoice-commandsvoice-recognitionaudioautomatic-speech-recognitionspeechsilero-vad
C 1.42 k
12 天前
https://static.github-zh.com/github_avatars/jtkim-kaist?size=40
jtkim-kaist / VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

vaddnnlstmattentionspeechdatavoice-detectionspeech-recognitionvoice-activity-detection
MATLAB 863
4 年前
https://static.github-zh.com/github_avatars/amsehili?size=40
amsehili / auditok

An audio/acoustic activity detection and audio segmentation tool

voice-detectionvadvoice-activity-detection
Python 803
9 个月前
https://static.github-zh.com/github_avatars/FluidInference?size=40
FluidInference / FluidAudio

#IOS#Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

coremliOSmacOSspeaker-diarizationspeaker-identificationspeaker-recognitionSwiftaudioreal-timevadvoice-activity-detectionasrautomatic-speech-recognitionspeech-to-textaneNvidia
Swift 616
10 小时前
https://static.github-zh.com/github_avatars/DmitryRyumin?size=40
DmitryRyumin / ICASSP-2023-24-Papers

#人脸识别#ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processi...

asrdenoisingdomain-adaptationface-recognitionlanguage-modelingself-supervised-learningsemantic-segmentationsignal-processingspeech-recognitionvadgenerative-modelsimage-generationmusic-generationmultimodal-learning
Python 501
4 个月前
https://static.github-zh.com/github_avatars/shashikg?size=40
shashikg / WhisperS2T

#计算机科学#An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

asr深度学习speech-recognitionspeech-to-textWhispertensorrt-llmtensorrtvadvoice-activity-detection
Jupyter Notebook 465
1 年前
https://static.github-zh.com/github_avatars/gkonovalov?size=40
gkonovalov / android-vad

#安卓#Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

vadofflinereal-timeaudio-processingWebRTCAndroiddnnon-device-aisilero-vadneural-networksvoice-detection深度神经网络onnx-modelsvoice-activity-detection
C 408
2 个月前
https://static.github-zh.com/github_avatars/gtreshchev?size=40
gtreshchev / RuntimeAudioImporter

Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.

虚幻引擎audio-filesmp3blueprintsaudio插件ue4-pluginaudio-playerue5ue5-pluginunreal-engine-5vadvoice-activity-detection
C++ 393
7 个月前
https://static.github-zh.com/github_avatars/filippogiruzzi?size=40
filippogiruzzi / voice_activity_detection

#计算机科学#Voice Activity Detection based on Deep Learning & TensorFlow

voice-activity-detection深度学习speechTensorflowtime-seriestime-series-classificationresnetspeech-recognitionPython机器学习vad人工智能深度神经网络
Python 368
2 年前
https://static.github-zh.com/github_avatars/Baidu-AIP?size=40
Baidu-AIP / speech-vad-demo

集成Webrtc的VAD,用于切分音频文件

WebRTCvadspeech
C 343
5 年前
https://static.github-zh.com/github_avatars/EtienneAb3d?size=40
EtienneAb3d / WhisperHallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

asrsound-processingtext-to-speechvadWhisperaudio-processingvocals
Python 340
10 个月前
https://static.github-zh.com/github_avatars/Picovoice?size=40
Picovoice / cobra

On-device voice activity detection (VAD) powered by deep learning

voice-activity-detectionspeech-recognitionvadon-device
Python 228
1 个月前
https://static.github-zh.com/github_avatars/xiongyihui?size=40
xiongyihui / python-webrtc-audio-processing

Python bindings of WebRTC Audio Processing

Pythonvadagcns
C++ 197
4 个月前
https://static.github-zh.com/github_avatars/eesungkim?size=40
eesungkim / Voice_Activity_Detector

A statistical model-based Voice Activity Detection

vadvoice-detectionvoice-activity-detection
Jupyter Notebook 192
7 年前
https://static.github-zh.com/github_avatars/asiff00?size=40
asiff00 / On-Device-Speech-to-Speech-Conversational-AI

This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and na...

asraudio-processingconversational-aikokoro-ttsollamattsvadvoice-assistant
Python 192
5 个月前
loading...