GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

speech-to-text

Website
Wikipedia
ggml-org/whisper.cpp
https://static.github-zh.com/github_avatars/ggml-org?size=40
ggml-org / whisper.cpp

OpenAI Whisper语音识别模型,C++移植版本。

openaispeech-to-texttransformerWhisperinferencespeech-recognition
C++ 40.79 k
2 天前
https://static.github-zh.com/github_avatars/mozilla?size=40
mozilla / DeepSpeech

#计算机科学#DeepSpeech 是一款开源嵌入式(离线、设备上)语音识别引擎,最低可以在树莓派上运行

深度学习机器学习neural-networksTensorflowspeech-recognitionspeech-to-textdeepspeechembeddedon-deviceoffline
C++ 26.44 k
9 个月前
https://static.github-zh.com/github_avatars/SYSTRAN?size=40
SYSTRAN / faster-whisper

#计算机科学#Faster Whisper transcription with CTranslate2

深度学习inferencequantizationspeech-recognitionspeech-to-texttransformerWhisperopenai
Python 16.55 k
13 天前
leon-ai/leon
https://static.github-zh.com/github_avatars/leon-ai?size=40
leon-ai / leon

🧠 Leon is your open-source personal assistant.

leonpersonal-assistantNode.jsPython人工智能speech-to-texttext-to-speechspeech-recognitionspeech-synthesisfliteassistantvirtual-assistant聊天机器人Botvoice-assistant自动化offline隐私ai-assistant
TypeScript 16.37 k
21 天前
https://static.github-zh.com/github_avatars/m-bain?size=40
m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asrspeechspeech-recognitionspeech-to-textWhisper
Python 16.26 k
7 天前
https://static.github-zh.com/github_avatars/kaldi-asr?size=40
kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

kaldiC++CUDAShellspeech-recognitionspeech-to-textspeaker-verificationspeaker-idspeech
Shell 14.9 k
2 个月前
https://static.github-zh.com/github_avatars/jianchang512?size=40
jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。

text-to-speechvideo-transitionspeech-to-text
Python 13.03 k
15 小时前
https://static.github-zh.com/github_avatars/alphacep?size=40
alphacep / vosk-api

#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ,能识别20+种语言,包括中文、英语、法语等。

speech-recognitionasrvoice-recognitionspeech-to-textAndroidiOS树莓派深度学习深度神经网络speech-to-text-androidspeaker-identificationspeaker-verificationPythonoffline隐私kaldideepspeechgoogle-speech-to-textvoskstt
Jupyter Notebook 12.08 k
1 个月前
https://static.github-zh.com/github_avatars/speechbrain?size=40
speechbrain / speechbrain

#计算机科学#A PyTorch-based Speech Toolkit

speech-recognitionspeech-toolkitspeaker-recognitionspeech-to-textspeech-enhancementspeech-separationaudioaudio-processingspeech-processingspeechrecognitionasrvoice-recognitionspeaker-diarizationspeaker-verificationPyTorchhuggingfacetransformerslanguage-model深度学习
Python 9.98 k
5 天前
https://static.github-zh.com/github_avatars/Uberi?size=40
Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Pythonaudiospeech-recognitionspeech-to-text
Python 8.76 k
1 个月前
https://static.github-zh.com/github_avatars/nl8590687?size=40
nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

TensorflowcnnctcPythonKerasspeech-recognitionspeech-to-textchinese-speech-recognitionasrt
Python 8.15 k
9 个月前
https://static.github-zh.com/github_avatars/KoljaB?size=40
KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Pythonrealtimespeech-to-text
Python 7.52 k
1 个月前
https://static.github-zh.com/github_avatars/TalAter?size=40
TalAter / annyang

💬 Speech recognition for your site

speech-recognitionspeechspeech-to-textvoice
JavaScript 6.66 k
10 个月前
https://static.github-zh.com/github_avatars/k2-fsa?size=40
k2-fsa / sherpa-onnx

#安卓#Sherpa-ONNX 是一个轻量级语音识别框架, 基于 Kaldi 和 onnxruntime,无需联网即可实现语音转文本、文本转语音、说话人分离以及语音活动检测(VAD)。支持嵌入式系统、安卓、iOS、鸿蒙系统、树莓派、RISC-V、x86_64 服务器、WebSocket 服务器 / 客户端,以及 C/C++、Python、Kotlin、C#、Go、NodeJS、Java、Swift、Dart、JavaScript、Flutter、Object Pascal、Lazarus、Rust 等编程语言。

asronnxWindowsLinuxmacOSC++AndroidiOS树莓派aarch64arm32C#.NETmfcspeech-to-texttext-to-speechvitsRISC-Vlazarusobject-pascal
C++ 6.36 k
7 天前
https://static.github-zh.com/github_avatars/FunAudioLLM?size=40
FunAudioLLM / SenseVoice

#大语言模型#Multilingual Voice Understanding Model

人工智能asrgpt-4ospeech-recognitionspeech-to-textaigccross-lingual大语言模型PythonPyTorchmultilingual
Python 5.9 k
3 个月前
https://static.github-zh.com/github_avatars/snakers4?size=40
snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

speech-recognitionspeech-to-textsttasrpretrained-modelsenglishgermanspanishstt-benchmarkPyTorchcolabonnxtext-to-speechspeechspeech-synthesistts
Jupyter Notebook 5.34 k
2 年前
https://static.github-zh.com/github_avatars/modelscope?size=40
modelscope / FunClip

#大语言模型#Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognitionvideo-clipvideo-subtitlessubtitles-generatorspeech-to-textgradiogradio-python-llm大语言模型
Python 4.67 k
3 个月前
https://static.github-zh.com/github_avatars/MahmoudAshraf97?size=40
MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asrspeaker-diarizationspeechspeech-recognitionspeech-to-textWhisper
Jupyter Notebook 4.63 k
2 个月前
https://static.github-zh.com/github_avatars/sanchit-gandhi?size=40
sanchit-gandhi / whisper-jax

#计算机科学#JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

深度学习jaxspeech-recognitionspeech-to-textWhisper
Jupyter Notebook 4.6 k
1 年前
https://static.github-zh.com/github_avatars/huggingface?size=40
huggingface / speech-to-speech

#计算机科学#Speech To Speech: an effort for an open-sourced and modular GPT4-o

人工智能assistantlanguage-model机器学习Pythonspeechspeech-synthesisspeech-to-textspeech-translation
Python 4.06 k
2 个月前
loading...