GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

automatic-speech-recognition

Website
Wikipedia
https://static.github-zh.com/github_avatars/wenet-e2e?size=40
wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

e2e-modelsPyTorchasrtransformerconformerproduction-readyautomatic-speech-recognitionspeech-recognitionWhisper
Python 4.56 k
5 天前
https://static.github-zh.com/github_avatars/zzw922cn?size=40
zzw922cn / awesome-speech-recognition-speech-synthesis-papers

#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

automatic-speech-recognitionpapers路线图rnncnndnnattention-mechanismseq2seqtimit-datasetttslanguage-modelspeaker-verificationspeech-recognitionspeech-synthesis神经网络diffusion-modelssinging-voice-synthesisvoice-conversion
3.05 k
2 年前
https://static.github-zh.com/github_avatars/zzw922cn?size=40
zzw922cn / Automatic_Speech_Recognition

#计算机科学#End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

automatic-speech-recognitionTensorflowtimit-datasetfeature-vectorphonemesdata-preprocessingrnnaudio深度学习lstmend-to-endcnnevaluationBukkitspeech-recognitionchinese-speech-recognition
Python 2.84 k
2 年前
https://static.github-zh.com/github_avatars/ahmetoner?size=40
ahmetoner / whisper-asr-webservice

OpenAI Whisper ASR Webservice API

automatic-speech-recognitionspeech-recognitionspeech-to-textopenai-whisperDockerasrspeech
Python 2.66 k
4 个月前
https://static.github-zh.com/github_avatars/coqui-ai?size=40
coqui-ai / STT

#计算机科学#🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

sttspeech-to-textTensorflow深度学习automatic-speech-recognitionasrvoice-recognitionspeech-recognition
C++ 2.45 k
1 年前
https://static.github-zh.com/github_avatars/kakaobrain?size=40
kakaobrain / pororo

#自然语言处理#PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

深度学习自然语言处理automatic-speech-recognitionspeech-synthesisneural-models
Python 1.3 k
3 年前
https://static.github-zh.com/github_avatars/FireRedTeam?size=40
FireRedTeam / FireRedASR

#大语言模型#Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recogn...

asr大语言模型Open Sourcespeech-recognitionautomatic-speech-recognitionconformerspeechllmtransformer
Python 1.05 k
3 个月前
https://static.github-zh.com/github_avatars/TensorSpeech?size=40
TensorSpeech / TensorFlowASR

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

automatic-speech-recognitionspeech-recognitionspeech-to-texttensorflow2rnn-transducerconformertflitectcTensorflow
Python 980
22 天前
https://static.github-zh.com/github_avatars/snakers4?size=40
snakers4 / open_stt

Open STT

speech-to-textrussiandatasetsttasrautomatic-speech-recognition
Python 798
3 年前
https://static.github-zh.com/github_avatars/jitsi?size=40
jitsi / jiwer

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

automatic-speech-recognitionPythonspeech-to-textevaluation-metrics
Python 735
4 个月前
https://static.github-zh.com/github_avatars/shirayu?size=40
shirayu / whispering

Streaming transcriber with whisper

automatic-speech-recognitionWhisper
Python 687
2 年前
https://static.github-zh.com/github_avatars/EmulationAI?size=40
EmulationAI / awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

audio-processingfoundational-modelslarge-language-modelsspeech-to-textmusic-information-retrievalautomatic-speech-recognition
678
10 个月前
https://static.github-zh.com/github_avatars/Picovoice?size=40
Picovoice / cheetah

On-device streaming speech-to-text engine powered by deep learning

speech-to-textasrautomatic-speech-recognitionspeech-recognitionstttranscriptionvoice-recognition
Python 631
4 天前
https://static.github-zh.com/github_avatars/hirofumi0810?size=40
hirofumi0810 / neural_sp

End-to-end ASR/LM implementation with PyTorch

PyTorchspeech-recognitionautomatic-speech-recognitionasrctcattention-mechanismattentionseq2seqsequence-to-sequencespeechlanguage-modeltransformerlanguage-modelingrnn-transducertransformer-xlstreaming
Python 596
4 年前
https://static.github-zh.com/github_avatars/YoavRamon?size=40
YoavRamon / awesome-kaldi

#Awesome#This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

kaldiautomatic-speech-recognitionAwesome Listsspeech-to-textspeechspeech-recognition
535
3 年前
https://static.github-zh.com/github_avatars/TEN-framework?size=40
TEN-framework / ten-vad

TEN VAD: low-latency high-performance Voice Activity Detector

conversational-aireal-timespeech-processingvadvoice-activity-detectionvoice-commandsvoice-recognitionaudioautomatic-speech-recognitionspeech
C 508
10 天前
https://static.github-zh.com/github_avatars/Z-yq?size=40
Z-yq / TensorflowASR

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1

transformerberttensorflow2automatic-speech-recognitionstate-of-the-artctctransducersC++
Python 475
3 个月前
https://static.github-zh.com/github_avatars/Picovoice?size=40
Picovoice / leopard

On-device speech-to-text engine powered by deep learning

sttspeech-to-textasrautomatic-speech-recognitionon-devicespeech-recognitiontranscriptionvoice-recognition
Python 457
5 天前
https://static.github-zh.com/github_avatars/jonatasgrosman?size=40
jonatasgrosman / huggingsound

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

transformersaudiospeechspeech-recognitionasrautomatic-speech-recognitionspeech-to-text
Python 457
2 年前
https://static.github-zh.com/github_avatars/vilassn?size=40
vilassn / whisper_android

#安卓#Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

asropenaitexttospeechttsWhispertext-to-speechspeech-recognitionTensorflowtfliteofflinetensorflowliteAndroidautomatic-speech-recognitiontranscriptiontranscribeembedded移动translation
C++ 456
4 个月前
loading...