GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

text-to-speech

Website
Wikipedia
https://static.github-zh.com/github_avatars/RVC-Boss?size=40
RVC-Boss / GPT-SoVITS

强大的少样本语音转换与语音合成Web用户界面。

text-to-speechttsvitsvoice-clonevoice-cloneai声音克隆
Python 47.62 k
2 天前
https://static.github-zh.com/github_avatars/coqui-ai?size=40
coqui-ai / TTS

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Pythontext-to-speech深度学习speechPyTorchttsvocodertacotronglow-ttsmelganspeaker-encoderhifiganspeaker-encodingsmulti-speaker-ttstts-modelspeech-synthesis声音克隆voice-synthesisvoice-conversion
Python 40.73 k
10 个月前
unslothai/unsloth
https://static.github-zh.com/github_avatars/unslothai?size=40
unslothai / unsloth

#大语言模型#Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

finetuningfine-tuningllama大语言模型loramistralqloragemmallama3unslothdeepseekdeepseek-r1gemma3llama-4llama4text-to-speechttsqwenqwen3
Python 40.54 k
3 天前
https://static.github-zh.com/github_avatars/2noise?size=40
2noise / ChatTTS

#大语言模型#ChatTTS是专门为对话场景设计的文本转语音模型,例如LLM助手对话任务。它支持英文和中文两种语言

agenttext-to-speechchatChatGPTchattts中文chinese-languageenglishenglish-languagegpt大语言模型llm-agentnatural-language-inferencePythontorchtts
Python 36.8 k
23 天前
babysor/MockingBird
https://static.github-zh.com/github_avatars/babysor?size=40
babysor / MockingBird

#计算机科学#🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

人工智能speechPyTorch深度学习text-to-speechtts
Python 36.33 k
7 个月前
https://static.github-zh.com/github_avatars/myshell-ai?size=40
myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

text-to-speechttsvoice-clonezero-shot-tts
Python 32.62 k
2 个月前
nari-labs/dia
https://static.github-zh.com/github_avatars/nari-labs?size=40
nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

人工智能text-to-speech
Python 16.93 k
18 天前
leon-ai/leon
https://static.github-zh.com/github_avatars/leon-ai?size=40
leon-ai / leon

🧠 Leon is your open-source personal assistant.

leonpersonal-assistantNode.jsPython人工智能speech-to-texttext-to-speechspeech-recognitionspeech-synthesisfliteassistantvirtual-assistant聊天机器人Botvoice-assistant自动化offline隐私ai-assistant
TypeScript 16.37 k
21 天前
https://static.github-zh.com/github_avatars/FunAudioLLM?size=40
FunAudioLLM / CosyVoice

#大语言模型#Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

audio-generationgpt-4otext-to-speechttscantonese聊天机器人ChatGPT中文englishfine-grainedfine-tuningjapanesekoreanmulti-lingualnatural-language-generationPythoncosyvoicecross-lingual声音克隆
Python 14.54 k
3 天前
https://static.github-zh.com/github_avatars/jianchang512?size=40
jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。

text-to-speechvideo-transitionspeech-to-text
Python 13.03 k
9 小时前
https://static.github-zh.com/github_avatars/mozilla?size=40
mozilla / TTS

#计算机科学#:robot: 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

深度学习text-to-speechPythonPyTorchtacotronttsspeaker-encoderdataset-analysistacotron2tensorflow2vocodermelganglow-ttsspeech
Jupyter Notebook 9.88 k
2 年前
https://static.github-zh.com/github_avatars/rhasspy?size=40
rhasspy / piper

A fast, local neural text to speech system

speech-synthesistext-to-speechtts
C++ 9.31 k
1 个月前
https://static.github-zh.com/github_avatars/espnet?size=40
espnet / espnet

#计算机科学#End-to-End Speech Processing Toolkit

深度学习end-to-endchainerPyTorchkaldispeech-recognitionspeech-synthesisspeech-translationmachine-translationvoice-conversionspeech-enhancementspeech-separationsinging-voice-synthesisspeaker-diarizationtext-to-speech
Python 9.2 k
17 小时前
open-mmlab/Amphion
https://static.github-zh.com/github_avatars/open-mmlab?size=40
open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

audio-generationaudio-synthesisaudioldmmusic-generationnaturalspeech2singing-voice-conversionspeech-synthesistext-to-audiotext-to-speechvall-evoice-conversionauditfastspeech2vitsemiliamaskgctvocoder
Python 9.15 k
19 天前
https://static.github-zh.com/github_avatars/rany2?size=40
rany2 / edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

ttsspeech-synthesistext-to-speech
Python 8.44 k
1 个月前
https://static.github-zh.com/github_avatars/netease-youdao?size=40
netease-youdao / EmotiVoice

#计算机科学#EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

PyTorchspeechspeech-synthesisttsmulti-speakertext-to-speech深度学习promptemotivoice人工智能Pythonemotionstyle
Python 8.03 k
10 个月前
https://static.github-zh.com/github_avatars/Plachtaa?size=40
Plachtaa / VALL-E-X

微软VALL-E X 零样本语音合成模型的开源实现

emotional-speechgpttext-to-speechvoice-clonetransformer-architecturettsvall-e
Python 7.88 k
1 年前
https://static.github-zh.com/github_avatars/jaywalnut310?size=40
jaywalnut310 / vits

#计算机科学#VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

ttstext-to-speechPyTorch深度学习speech-synthesis
Python 7.49 k
2 年前
https://static.github-zh.com/github_avatars/k2-fsa?size=40
k2-fsa / sherpa-onnx

#安卓#Sherpa-ONNX 是一个轻量级语音识别框架, 基于 Kaldi 和 onnxruntime,无需联网即可实现语音转文本、文本转语音、说话人分离以及语音活动检测(VAD)。支持嵌入式系统、安卓、iOS、鸿蒙系统、树莓派、RISC-V、x86_64 服务器、WebSocket 服务器 / 客户端,以及 C/C++、Python、Kotlin、C#、Go、NodeJS、Java、Swift、Dart、JavaScript、Flutter、Object Pascal、Lazarus、Rust 等编程语言。

asronnxWindowsLinuxmacOSC++AndroidiOS树莓派aarch64arm32C#.NETmfcspeech-to-texttext-to-speechvitsRISC-Vlazarusobject-pascal
C++ 6.34 k
6 天前
https://static.github-zh.com/github_avatars/myshell-ai?size=40
myshell-ai / MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

text-to-speechtts中文englishfrenchjapanesekoreanmultilingualspanish
Python 6.16 k
6 个月前
loading...