#

stt

khoj-ai/khoj
https://static.github-zh.com/github_avatars/khoj-ai?size=40

#大语言模型#Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...

Python 31.1 k
2 天前
https://static.github-zh.com/github_avatars/alphacep?size=40
Jupyter Notebook 13.22 k
8 天前
https://static.github-zh.com/github_avatars/snakers4?size=40

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 5.49 k
2 年前
https://static.github-zh.com/github_avatars/jianchang512?size=40

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

Python 3.83 k
20 天前
https://static.github-zh.com/github_avatars/pluja?size=40

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Svelte 2.66 k
1 个月前
https://static.github-zh.com/github_avatars/coqui-ai?size=40

#计算机科学#🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

C++ 2.52 k
2 年前
https://static.github-zh.com/github_avatars/pannous?size=40

#计算机科学#🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Python 2.17 k
2 年前
lenML/Speech-AI-Forge
https://static.github-zh.com/github_avatars/lenML?size=40

#大语言模型#🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 1.34 k
2 天前
Robitx/gp.nvim
https://static.github-zh.com/github_avatars/Robitx?size=40

#编辑器#Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]

Lua 1.26 k
1 个月前
https://static.github-zh.com/github_avatars/mkiol?size=40

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

C++ 1.13 k
21 天前
https://static.github-zh.com/github_avatars/joey-zhou?size=40

小智ESP32的Java企业级管理平台,提供设备监控、音色定制、角色切换和对话记录管理的前后端及服务端一体化解决方案

Java 865
4 小时前
https://static.github-zh.com/github_avatars/VRCWizard?size=40

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

C# 712
25 天前
https://static.github-zh.com/github_avatars/lobehub?size=40

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

TypeScript 673
4 个月前
https://static.github-zh.com/github_avatars/evancohen?size=40

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

JavaScript 639
1 年前
https://static.github-zh.com/github_avatars/Picovoice?size=40
Python 637
6 天前
https://static.github-zh.com/github_avatars/Macoron?size=40

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

C# 619
5 个月前
loading...
Website
Wikipedia