集合主题趋势排行榜

speech-to-text

ggml-org / whisper.cpp

OpenAI Whisper语音识别模型，C++移植版本。

openai speech-to-text transformer Whisper inference speech-recognition

C++ 43.29 k

10 天前

mozilla / DeepSpeech

#计算机科学#DeepSpeech 是一款开源嵌入式（离线、设备上）语音识别引擎，最低可以在树莓派上运行

深度学习机器学习 neural-networks Tensorflow speech-recognition speech-to-text deepspeech embedded on-device offline

C++ 26.6 k

3 个月前

SYSTRAN / faster-whisper

#计算机科学#Faster Whisper transcription with CTranslate2

深度学习 inference quantization speech-recognition speech-to-text transformer Whisper openai

Python 18.16 k

1 个月前

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asr speech speech-recognition speech-to-text Whisper

Python 17.78 k

3 个月前

leon-ai / leon

🧠 Leon is your open-source personal assistant.

leon personal-assistant Node.js Python 人工智能 speech-to-text text-to-speech speech-recognition speech-synthesis flite assistant virtual-assistant 聊天机器人 Bot voice-assistant 自动化 offline 隐私 ai-assistant

TypeScript 16.65 k

4 天前

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

kaldi C++CUDA Shell speech-recognition speech-to-text speaker-verification speaker-id speech

Shell 15.12 k

2 个月前

jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。

text-to-speech video-transition speech-to-text

Python 14.24 k

5 天前

alphacep / vosk-api

#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ，能识别20+种语言，包括中文、英语、法语等。

Jupyter Notebook 13.22 k

8 天前

Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python audio speech-recognition speech-to-text

Python 8.86 k

4 天前

KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python realtime speech-to-text

Python 8.6 k

2 个月前

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Tensorflow cnn ctc Python Keras speech-recognition speech-to-text chinese-speech-recognition asrt

Python 8.23 k

12 天前

k2-fsa / sherpa-onnx

#安卓#Sherpa-ONNX 是一个轻量级语音识别框架，基于 Kaldi 和 onnxruntime，无需联网即可实现语音转文本、文本转语音、说话人分离以及语音活动检测(VAD)。支持嵌入式系统、安卓、iOS、鸿蒙系统、树莓派、RISC-V、x86_64 服务器、WebSocket 服务器 / 客户端，以及 C/C++、Python、Kotlin、C#、Go、NodeJS、Java、Swift、Dart、JavaScript、Flutter、Object Pascal、Lazarus、Rust 等编程语言。

asr onnx Windows Linux macOS C++Android iOS 树莓派 aarch64 arm32 C#.NET mfc speech-to-text text-to-speech vits RISC-V lazarus object-pascal

C++ 7.45 k

3 小时前

TalAter / annyang

💬 Speech recognition for your site

speech-recognition speech speech-to-text voice

JavaScript 6.66 k

1 年前

FunAudioLLM / SenseVoice

#大语言模型#Multilingual Voice Understanding Model

人工智能 asr gpt-4o speech-recognition speech-to-text aigc cross-lingual 大语言模型 Python PyTorch multilingual

Python 6.63 k

1 个月前

snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

speech-recognition speech-to-text stt asr pretrained-models english german spanish stt-benchmark PyTorch colab onnx text-to-speech speech speech-synthesis tts

Jupyter Notebook 5.49 k

2 年前

modelscope / FunClip

#大语言模型#Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition video-clip video-subtitles subtitles-generator speech-to-text gradio gradio-python-llm 大语言模型

Python 4.97 k

2 个月前

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asr speaker-diarization speech speech-recognition speech-to-text Whisper

Jupyter Notebook 4.96 k

1 个月前

abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

faster-whisper tts Whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp 声音克隆 podcasts audiobook voice-conversion karaoke whisperx

Python 4.81 k

2 个月前

sanchit-gandhi / whisper-jax

#计算机科学#JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

深度学习 jax speech-recognition speech-to-text Whisper

Jupyter Notebook 4.63 k

1 年前

Website
Wikipedia