GitHub 中文社区

回车: Github搜索 Shift+回车: Google搜索

©2025 GitHub中文社区论坛 GitHub官网网站地图 GitHub官方翻译

GitHub on X
GitHub on Facebook
GitHub on LinkedIn
GitHub on YouTube
GitHub on Twitch
GitHub on TikTok
GitHub’s organization on GitHub

集合主题趋势排行榜

#

automatic-speech-recognition

Website
Wikipedia

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

e2e-models PyTorch asr transformer conformer production-ready automatic-speech-recognition speech-recognition Whisper

Python 4.71 k

20 天前

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

#新手入门#Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

automatic-speech-recognition papers 路线图 rnn cnn dnn attention-mechanism seq2seq timit-dataset tts language-model speaker-verification speech-recognition speech-synthesis 神经网络 diffusion-models singing-voice-synthesis voice-conversion

3.06 k

2 年前

zzw922cn / Automatic_Speech_Recognition

#计算机科学#End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

automatic-speech-recognition Tensorflow timit-dataset feature-vector phonemes data-preprocessing rnn audio 深度学习 lstm end-to-end cnn evaluation Bukkit speech-recognition chinese-speech-recognition

Python 2.84 k

2 年前

ahmetoner / whisper-asr-webservice

OpenAI Whisper ASR Webservice API

automatic-speech-recognition speech-recognition speech-to-text openai-whisper Docker asr speech

Python 2.79 k

1 个月前

#计算机科学#🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

stt speech-to-text Tensorflow 深度学习 automatic-speech-recognition asr voice-recognition speech-recognition

C++ 2.49 k

1 年前

kakaobrain / pororo

#自然语言处理#PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

深度学习自然语言处理 automatic-speech-recognition speech-synthesis neural-models

Python 1.3 k

3 年前

FireRedTeam / FireRedASR

#大语言模型#Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recogn...

asr 大语言模型 Open Source speech-recognition automatic-speech-recognition conformer speechllm transformer

Python 1.21 k

4 个月前

TEN-framework / ten-vad

Voice Activity Detector(VAD) from TEN: low-latency, high-performance and lightweight

conversational-ai real-time speech-processing vad voice-activity-detection voice-commands voice-recognition audio automatic-speech-recognition speech

C 1.06 k

10 天前

TensorSpeech / TensorFlowASR

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

automatic-speech-recognition speech-recognition speech-to-text tensorflow2 rnn-transducer conformer tflite ctc Tensorflow

Python 987

2 个月前

snakers4 / open_stt

Open STT

speech-to-text russian dataset stt asr automatic-speech-recognition

Python 801

3 年前

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

automatic-speech-recognition Python speech-to-text evaluation-metrics

Python 762

5 个月前

shirayu / whispering

Streaming transcriber with whisper

automatic-speech-recognition Whisper

Python 690

2 年前

EmulationAI / awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

audio-processing foundational-models large-language-models speech-to-text music-information-retrieval automatic-speech-recognition

685

1 年前

Picovoice / cheetah

On-device streaming speech-to-text engine powered by deep learning

speech-to-text asr automatic-speech-recognition speech-recognition stt transcription voice-recognition

Python 634

20 小时前

hirofumi0810 / neural_sp

End-to-end ASR/LM implementation with PyTorch

PyTorch speech-recognition automatic-speech-recognition asr ctc attention-mechanism attention seq2seq sequence-to-sequence speech language-model transformer language-modeling rnn-transducer transformer-xl streaming

Python 597

4 年前

YoavRamon / awesome-kaldi

#Awesome#This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

kaldi automatic-speech-recognition Awesome Lists speech-to-text speech speech-recognition

537

3 年前

vilassn / whisper_android

#安卓#Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

asr openai texttospeech tts Whisper text-to-speech speech-recognition Tensorflow tflite offline tensorflowlite Android automatic-speech-recognition transcription transcribe embedded 移动 translation

C++ 495

6 个月前

Z-yq / TensorflowASR

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于0.1

transformer bert tensorflow2 automatic-speech-recognition state-of-the-art ctc transducers C++

Python 474

5 个月前

Picovoice / leopard

On-device speech-to-text engine powered by deep learning

stt speech-to-text asr automatic-speech-recognition on-device speech-recognition transcription voice-recognition

Python 458

2 天前

jonatasgrosman / huggingsound

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

transformers audio speech speech-recognition asr automatic-speech-recognition speech-to-text

Python 458

2 年前

loading...