GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

speech-processing

Website
Wikipedia
https://static.github-zh.com/github_avatars/speechbrain?size=40
speechbrain / speechbrain

#计算机科学#A PyTorch-based Speech Toolkit

speech-recognitionspeech-toolkitspeaker-recognitionspeech-to-textspeech-enhancementspeech-separationaudioaudio-processingspeech-processingspeechrecognitionasrvoice-recognitionspeaker-diarizationspeaker-verificationPyTorchhuggingfacetransformerslanguage-model深度学习
Python 9.98 k
5 天前
https://static.github-zh.com/github_avatars/pyannote?size=40
pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

PyTorchspeech-processingspeaker-diarizationvoice-activity-detectionpretrained-modelsspeaker-recognitionspeaker-verification
Jupyter Notebook 7.69 k
2 天前
https://static.github-zh.com/github_avatars/pliang279?size=40
pliang279 / awesome-multimodal-ml

#自然语言处理#Reading list for research topics in multimodal machine learning

multimodal-learning机器学习representation-learning自然语言处理机器视觉speech-processingRoboticshealthcarereading-list深度学习reinforcement-learning
6.5 k
10 个月前
https://static.github-zh.com/github_avatars/snakers4?size=40
snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-detectionvoice-recognitionvoice-commandsPyTorchonnxvoice-activity-detectionvoice-controlonnx-runtimeonnxruntimespeechspeech-processingvad
Python 6.05 k
5 天前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / torchscale

#自然语言处理#Foundation Architecture for (M)LLMs

机器视觉机器学习multimodal自然语言处理pretrained-language-modelspeech-processingtransformertranslation
Python 3.08 k
1 年前
https://static.github-zh.com/github_avatars/linto-ai?size=40
linto-ai / whisper-timestamped

#计算机科学#Multilingual Automatic Speech Recognition with word-level timestamps and confidence

深度学习speechspeech-recognitionspeech-to-textasr机器学习PythonPyTorchattention-is-all-you-needattention-mechanismattention-modelspeaker-diarizationspeech-processingtransformersWhisper
Python 2.45 k
3 个月前
https://static.github-zh.com/github_avatars/r9y9?size=40
r9y9 / wavenet_vocoder

WaveNet vocoder

wavenetspeech-synthesisspeech-processingPyTorchPythonneural-vocoderspeech
Python 2.36 k
2 年前
https://static.github-zh.com/github_avatars/r9y9?size=40
r9y9 / deepvoice3_pytorch

#计算机科学#PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

ttsspeech-synthesisend-to-endspeech-processing机器学习PyTorchPythonmulti-speaker
Python 1.98 k
1 年前
https://static.github-zh.com/github_avatars/resemble-ai?size=40
resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

denoisespeech-denoisingspeech-enhancementspeech-processing
Python 1.84 k
6 个月前
wq2012/awesome-diarization
https://static.github-zh.com/github_avatars/wq2012?size=40
wq2012 / awesome-diarization

#Awesome#A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

speaker-diarizationAwesome Lists机器学习speech-recognitionspeech-processing深度学习
1.76 k
8 个月前
DigitalPhonetics/IMS-Toucan
https://static.github-zh.com/github_avatars/DigitalPhonetics?size=40
DigitalPhonetics / IMS-Toucan

#计算机科学#Controllable and fast Text-to-Speech for over 7000 languages!

text-to-speechtoolkitspeech-synthesis深度学习speech-processingttsPyTorchspeech
Python 1.61 k
25 天前
https://static.github-zh.com/github_avatars/coqui-ai?size=40
coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

ttssttspeech-to-texttext-to-speechspeech-recognitionspeech-synthesisspeech-processingvoice-recognitionvoice-activity-detection声音克隆speech-separation
1.34 k
1 年前
https://static.github-zh.com/github_avatars/mravanelli?size=40
mravanelli / SincNet

#计算机科学#SincNet is a neural architecture for efficiently processing raw audio samples.

深度学习audiowaveformfilteringcnnconvolutional-neural-networksspeaker-recognitionspeaker-verificationspeaker-identificationspeech-recognitionasraudio-processingspeech-processingdigital-signal-processingsignal-processingneural-networks人工智能timitPyTorchPython
Python 1.18 k
4 年前
https://static.github-zh.com/github_avatars/haoheliu?size=40
haoheliu / voicefixer

General Speech Restoration

speech-processingspeech-synthesisspeech-enhancementspeech-analysisspeechttsdenoisesuper-resolutionvocoder
Python 1.16 k
4 个月前
https://static.github-zh.com/github_avatars/midas-research?size=40
midas-research / audino

#数据仓库#Open source audio annotation tool for humans

audio-processingspeech-processing机器学习annotation-toolaudio-annotationPython数据集
JavaScript 1.1 k
4 个月前
https://static.github-zh.com/github_avatars/ictnlp?size=40
ictnlp / StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

speechspeech-recognitionspeech-synthesisspeech-to-textspeech-translationtranslationall-in-onemachine-translationstreaming-audiotext-to-speechasrttsvoicetext-to-audionon-autoregressivespeech-enhancementaudio-processingspeech-processing
Python 1.09 k
10 个月前
https://static.github-zh.com/github_avatars/X-LANCE?size=40
X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

audio-processing大语言模型multimodal-large-language-modelspeftspeech-processing
Python 826
2 个月前
https://static.github-zh.com/github_avatars/Ryuk17?size=40
Ryuk17 / SpeechAlgorithms

You can find the speech algorithms you want here

speech-processing
C 809
5 个月前
https://static.github-zh.com/github_avatars/nanahou?size=40
nanahou / Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

speech-enhancementspeech-processingsignal-processing深度神经网络机器学习
MATLAB 770
5 年前
https://static.github-zh.com/github_avatars/nyrahealth?size=40
nyrahealth / CrisperWhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

asraudiodetectionrecognitionspeechspeech-recognitiontranscriptionWhisperspeech-processing
Python 734
12 天前
loading...