stt

#大语言模型#Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...

semantic-search Emacs Obsidian chat ChatGPT 人工智能大语言模型 productivity agent 自托管 rag whatsapp-ai offline-llm llamacpp llama3 image-generation stt assistant research

Python 31.1 k

2 天前

alphacep / vosk-api

#安卓#Vosk 是一个离线的语言识别工具。支持 Python, Java, Node.JS, C#, C++ ，能识别20+种语言，包括中文、英语、法语等。

Jupyter Notebook 13.22 k

8 天前

snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

speech-recognition speech-to-text stt asr pretrained-models english german spanish stt-benchmark PyTorch colab onnx text-to-speech speech speech-synthesis tts

Jupyter Notebook 5.49 k

2 年前

jianchang512 / stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式

speech speech-recognition speech-to-text stt

Python 3.83 k

20 天前

pluja / whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

人工智能 audio-to-text Go subtitles sveltekit transcription Whisper ui Web app speech-recognition speech-to-text stt Web

Svelte 2.66 k

1 个月前

coqui-ai / STT

#计算机科学#🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

stt speech-to-text Tensorflow 深度学习 automatic-speech-recognition asr voice-recognition speech-recognition

C++ 2.52 k

2 年前

pannous / tensorflow-speech-recognition

#计算机科学#🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Tensorflow speech-recognition 神经网络深度学习 stt speech-to-text

Python 2.17 k

2 年前

neural-maze / ava-whatsapp-agent-course

Meet Ava, the WhatsApp Agent

agent agentic-workflow agents stt tts vector-database

Python 1.5 k

5 个月前

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

tts stt speech-to-text text-to-speech speech-recognition speech-synthesis speech-processing voice-recognition voice-activity-detection 声音克隆 speech-separation

1.36 k

1 年前

lenML / Speech-AI-Forge

#大语言模型#🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

chattts tts agent gpt 大语言模型 text-to-speech colab llama 中文 english cosyvoice asr stt Whisper

Python 1.34 k

2 天前

Robitx / gp.nvim

#编辑器#Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]

copilot Neovim speech-to-text Whisper Vim codeium Lua voice 大语言模型 ollama claude gpt4o gpt-4o sonnet gemini mistral perplexity stt parrot

Lua 1.26 k

1 个月前

R3gm / SoniTranslate

Synchronized Translation for Videos. Video dubbing

audio-processing diarization translation translate-audio translate-video video-dubbing asr automatic-dubbing document-translator dubbing speech-to-text stt text-to-speech tts

Python 1.23 k

1 个月前

mkiol / dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

asr stt tts Linux nmt offline translator machine-translation speech-recognition speech-synthesis speech-to-text text-to-speech translation

C++ 1.13 k

21 天前

joey-zhou / xiaozhi-esp32-server-java

小智ESP32的Java企业级管理平台，提供设备监控、音色定制、角色切换和对话记录管理的前后端及服务端一体化解决方案

ESP32 Java mcp mcp-client mcp-server spring-ai stt tts xiaozhi xiaozhi-ai xiaozhi-esp32 xiaozhi-server

Java 865

4 小时前

snakers4 / open_stt

Open STT

speech-to-text russian dataset stt asr automatic-speech-recognition

Python 803

4 年前

VRCWizard / TTS-Voice-Wizard

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

tts speech-to-text speech-recognition VRChat osc Discord 免费 voice vtuber chatbox Spotify stt text-to-speech

C# 712

25 天前