GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

speech-recognition

Website
Wikipedia
huggingface/transformers
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / transformers

#自然语言处理#为 Jax、PyTorch 和 TensorFlow 打造的先进的自然语言处理

自然语言处理PyTorchpytorch-transformerstransformermodel-hubpretrained-modelsspeech-recognitionHacktoberfestPython机器学习深度学习audiodeepseekgemmaglm大语言模型qwenvlm
Python 145.62 k
2 小时前
ggml-org/whisper.cpp
https://static.github-zh.com/github_avatars/ggml-org?size=40
ggml-org / whisper.cpp

OpenAI Whisper语音识别模型,C++移植版本。

openaispeech-to-texttransformerWhisperinferencespeech-recognition
C++ 40.79 k
2 天前
https://static.github-zh.com/github_avatars/mozilla?size=40
mozilla / DeepSpeech

#计算机科学#DeepSpeech 是一款开源嵌入式(离线、设备上)语音识别引擎,最低可以在树莓派上运行

深度学习机器学习neural-networksTensorflowspeech-recognitionspeech-to-textdeepspeechembeddedon-deviceoffline
C++ 26.44 k
9 个月前
https://static.github-zh.com/github_avatars/SYSTRAN?size=40
SYSTRAN / faster-whisper

#计算机科学#Faster Whisper transcription with CTranslate2

深度学习inferencequantizationspeech-recognitionspeech-to-texttransformerWhisperopenai
Python 16.55 k
13 天前
leon-ai/leon
https://static.github-zh.com/github_avatars/leon-ai?size=40
leon-ai / leon

🧠 Leon is your open-source personal assistant.

leonpersonal-assistantNode.jsPython人工智能speech-to-texttext-to-speechspeech-recognitionspeech-synthesisfliteassistantvirtual-assistant聊天机器人Botvoice-assistant自动化offline隐私ai-assistant
TypeScript 16.37 k
21 天前
https://static.github-zh.com/github_avatars/m-bain?size=40
m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asrspeechspeech-recognitionspeech-to-textWhisper
Python 16.26 k
7 天前
https://static.github-zh.com/github_avatars/kaldi-asr?size=40
kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

kaldiC++CUDAShellspeech-recognitionspeech-to-textspeaker-verificationspeaker-idspeech
Shell 14.9 k
2 个月前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / DeepLearningExamples

#自然语言处理#深度学习示例

机器视觉深度学习drug-discoveryforecastinglarge-language-modelsmxnetpaddlepaddlePyTorchrecommender-systemsspeech-recognitionspeech-synthesisTensorflowtensorflow2translation自然语言处理
Jupyter Notebook 14.33 k
10 个月前
https://static.github-zh.com/github_avatars/kmario23?size=40
kmario23 / deep-learning-drizzle

#自然语言处理#Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

机器学习深度学习深度神经网络pattern-recognition机器视觉optimizationvisual-recognitionreinforcement-learningdeep-reinforcement-learning自然语言处理artificial-neural-networksartificial-intelligence-algorithmsbayesian-statisticsspeech-recognitiongraph-neural-networksMedical imaginggeometric-deep-learningexplainable-aiprobability
HTML 12.59 k
8 个月前
https://static.github-zh.com/github_avatars/alphacep?size=40
alphacep / vosk-api

#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ,能识别20+种语言,包括中文、英语、法语等。

speech-recognitionasrvoice-recognitionspeech-to-textAndroidiOS树莓派深度学习深度神经网络speech-to-text-androidspeaker-identificationspeaker-verificationPythonoffline隐私kaldideepspeechgoogle-speech-to-textvoskstt
Jupyter Notebook 12.06 k
1 个月前
https://static.github-zh.com/github_avatars/PaddlePaddle?size=40
PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库,用于语音和音频中的各种关键任务的开发,典型的应用包括:语音识别、语音翻译、语音合成等

transformerconformerspeech-translationstreaming-asrspeech-alignmentpunctuation-restorationstreaming-ttsspeech-synthesisttsasrspeech-recognition声音克隆vocodervoice-recognitionself-supervised-learningWhisper
Python 11.99 k
5 天前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

conformerPyTorchspeech-recognitionparaformerpunctuationspeaker-diarizationrnntaudio-visual-speech-recognitionpretrained-modelvoice-activity-detectionWhisperdfsmnvadspeechgptspeechllm
Python 11 k
19 天前
https://static.github-zh.com/github_avatars/speechbrain?size=40
speechbrain / speechbrain

#计算机科学#A PyTorch-based Speech Toolkit

speech-recognitionspeech-toolkitspeaker-recognitionspeech-to-textspeech-enhancementspeech-separationaudioaudio-processingspeech-processingspeechrecognitionasrvoice-recognitionspeaker-diarizationspeaker-verificationPyTorchhuggingfacetransformerslanguage-model深度学习
Python 9.98 k
5 天前
https://static.github-zh.com/github_avatars/espnet?size=40
espnet / espnet

#计算机科学#End-to-End Speech Processing Toolkit

深度学习end-to-endchainerPyTorchkaldispeech-recognitionspeech-synthesisspeech-translationmachine-translationvoice-conversionspeech-enhancementspeech-separationsinging-voice-synthesisspeaker-diarizationtext-to-speech
Python 9.2 k
1 天前
https://static.github-zh.com/github_avatars/Uberi?size=40
Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Pythonaudiospeech-recognitionspeech-to-text
Python 8.76 k
1 个月前
https://static.github-zh.com/github_avatars/openvinotoolkit?size=40
openvinotoolkit / openvino

#自然语言处理#OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

inference深度学习openvino人工智能机器视觉diffusion-modelsgenerative-aillm-inference自然语言处理performance-boostspeech-recognitionstable-diffusiondeploy-aioptimize-aitransformersyolorecommendation-systemgood-first-issue
C++ 8.43 k
18 小时前
https://static.github-zh.com/github_avatars/nl8590687?size=40
nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

TensorflowcnnctcPythonKerasspeech-recognitionspeech-to-textchinese-speech-recognitionasrt
Python 8.15 k
9 个月前
https://static.github-zh.com/github_avatars/TalAter?size=40
TalAter / annyang

💬 Speech recognition for your site

speech-recognitionspeechspeech-to-textvoice
JavaScript 6.66 k
10 个月前
https://static.github-zh.com/github_avatars/flashlight?size=40
flashlight / wav2letter

#计算机科学#Facebook AI Research's Automatic Speech Recognition Toolkit

wav2letterspeech-recognitionend-to-end深度学习C++
C++ 6.43 k
7 个月前
https://static.github-zh.com/github_avatars/FunAudioLLM?size=40
FunAudioLLM / SenseVoice

#大语言模型#Multilingual Voice Understanding Model

人工智能asrgpt-4ospeech-recognitionspeech-to-textaigccross-lingual大语言模型PythonPyTorchmultilingual
Python 5.9 k
3 个月前
loading...