GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

voice-activity-detection

Website
Wikipedia
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformerPyTorchspeech-recognitionparaformerpunctuationspeaker-diarizationrnntaudio-visual-speech-recognitionpretrained-modelvoice-activity-detectionWhisperdfsmnvadspeechgptspeechllm
Python 12.58 k
5 天前
noisetorch/NoiseTorch
https://static.github-zh.com/github_avatars/noisetorch?size=40
noisetorch / NoiseTorch

Real-time microphone noise suppression on Linux.

noise-reductionnoise-suppressionvoicevoice-activity-detectionvoice-activatedpulseaudioLinuxHacktoberfesthacktoberfest2023
Go 9.84 k
8 个月前
https://static.github-zh.com/github_avatars/pyannote?size=40
pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

PyTorchspeech-processingspeaker-diarizationvoice-activity-detectionpretrained-modelsspeaker-recognitionspeaker-verification
Jupyter Notebook 8.28 k
1 天前
https://static.github-zh.com/github_avatars/smacke?size=40
smacke / ffsubsync

自动化同步视频字幕,提升字幕编辑效率

subtitlesVideoaudioFFmpegvadfftsynchronizationsyncsubtitlecaptionsvlcvlc-media-playersrtsrt-subtitlesvoice-activity-detectionfast-fourier-transformalignmentcaption
Python 7.34 k
13 天前
https://static.github-zh.com/github_avatars/snakers4?size=40
snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-detectionvoice-recognitionvoice-commandsPyTorchonnxvoice-activity-detectionvoice-controlonnx-runtimeonnxruntimespeechspeech-processingvad
Python 6.83 k
20 天前
https://static.github-zh.com/github_avatars/jim-schwoebel?size=40
jim-schwoebel / voice_datasets

#数据仓库#🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

数据集datasetvoicedatavoice-controlvoice-synthesisvoice-commandsvoice-assistantvoice-recognitionvoice-chatvoice-activity-detectionvoice-conversionnoise
2.01 k
1 年前
https://static.github-zh.com/github_avatars/ricky0123?size=40
ricky0123 / vad

Voice activity detector (VAD) for the browser with a simple API

onnxruntimesilero-vadspeech-to-textTypeScriptvoice-activity-detectionWebweb-audio-api
TypeScript 1.59 k
10 天前
https://static.github-zh.com/github_avatars/k2-fsa?size=40
k2-fsa / sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, Lich...

Pythonspeech-recognitionC++asrCC#GoKotlinvadvoice-activity-detection
C++ 1.48 k
2 天前
juanmc2005/diart
https://static.github-zh.com/github_avatars/juanmc2005?size=40
juanmc2005 / diart

#计算机科学#A python package to build AI-powered real-time audio applications

speaker-diarizationstreaming-audioreal-time深度学习transcriptionvoice-activity-detection
Python 1.46 k
7 个月前
https://static.github-zh.com/github_avatars/TEN-framework?size=40
TEN-framework / ten-vad

Voice Activity Detection (VAD) : low-latency, high-performance and lightweight

conversational-aireal-timespeech-processingvadvoice-activity-detectionvoice-commandsvoice-recognitionaudioautomatic-speech-recognitionspeechsilero-vad
C 1.42 k
13 天前
https://static.github-zh.com/github_avatars/coqui-ai?size=40
coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

ttssttspeech-to-texttext-to-speechspeech-recognitionspeech-synthesisspeech-processingvoice-recognitionvoice-activity-detection声音克隆speech-separation
1.36 k
1 年前
https://static.github-zh.com/github_avatars/ggeop?size=40
ggeop / Python-ai-assistant

#自然语言处理#Python AI assistant 🧠

Pythonvoice-recognitionvoice-assistantvoice-controlvoice-activity-detectionvoice-chat自然语言处理voice-commands人工智能scikit-learnnltkgoogle-speech-to-textMongoDBpymongo
Python 991
10 个月前
https://static.github-zh.com/github_avatars/jtkim-kaist?size=40
jtkim-kaist / VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

vaddnnlstmattentionspeechdatavoice-detectionspeech-recognitionvoice-activity-detection
MATLAB 863
4 年前
https://static.github-zh.com/github_avatars/ina-foss?size=40
ina-foss / inaSpeechSegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

audio-analysisspeechmusicvoice-activity-detectionnoisesegmentationTransgender
Python 836
8 个月前
https://static.github-zh.com/github_avatars/amsehili?size=40
amsehili / auditok

An audio/acoustic activity detection and audio segmentation tool

voice-detectionvadvoice-activity-detection
Python 803
9 个月前
https://static.github-zh.com/github_avatars/iamsrikanthnani?size=40
iamsrikanthnani / pluely

#大语言模型#The Open Source Alternative to Cluely - A lightning-fast, privacy-first AI assistant that works seamlessly during meetings, interviews, and conversations without anyone knowing. Built with Tauri for n...

ai-assistantclaudedesktop-appgeminigrok大语言模型openaiReactRustshadcnspeech-to-textstealth-gameTailwind CSSTauriTypeScriptundetectablevoice-activity-detection
TypeScript 687
11 小时前
https://static.github-zh.com/github_avatars/FluidInference?size=40
FluidInference / FluidAudio

#IOS#Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

coremliOSmacOSspeaker-diarizationspeaker-identificationspeaker-recognitionSwiftaudioreal-timevadvoice-activity-detectionasrautomatic-speech-recognitionspeech-to-textaneNvidia
Swift 616
15 小时前
https://static.github-zh.com/github_avatars/baxtree?size=40
baxtree / subaligner

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/

subtitlescaptionsalignmentsubripvoice-activity-detectiontmptranscription
Python 483
1 个月前
https://static.github-zh.com/github_avatars/shashikg?size=40
shashikg / WhisperS2T

#计算机科学#An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

asr深度学习speech-recognitionspeech-to-textWhispertensorrt-llmtensorrtvadvoice-activity-detection
Jupyter Notebook 465
1 年前
https://static.github-zh.com/github_avatars/gkonovalov?size=40
gkonovalov / android-vad

#安卓#Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

vadofflinereal-timeaudio-processingWebRTCAndroiddnnon-device-aisilero-vadneural-networksvoice-detection深度神经网络onnx-modelsvoice-activity-detection
C 408
2 个月前
loading...