text-to-speech · GitHub Topics

RVC-Boss / GPT-SoVITS

强大的少样本语音转换与语音合成Web用户界面。

text-to-speech tts vits voice-clone voice-cloneai 声音克隆

Python 50.84 k

3 天前

unslothai / unsloth

#大语言模型#Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

fine-tuning llama 大语言模型 lora mistral gemma llama3 unsloth deepseek deepseek-r1 gemma3 text-to-speech tts qwen qwen3 agent 人工智能 openai gpt-oss

Python 45.43 k

9 小时前

coqui-ai / TTS

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Python text-to-speech 深度学习 speech PyTorch tts vocoder tacotron glow-tts melgan speaker-encoder hifigan speaker-encodings multi-speaker-tts tts-model speech-synthesis 声音克隆 voice-synthesis voice-conversion

Python 42.57 k

1 年前

2noise / ChatTTS

#大语言模型#ChatTTS是专门为对话场景设计的文本转语音模型，例如LLM助手对话任务。它支持英文和中文两种语言

agent text-to-speech chat ChatGPT chattts 中文 chinese-language english english-language gpt 大语言模型 llm-agent natural-language-inference Python torch tts

Python 37.79 k

2 个月前

babysor / MockingBird

#计算机科学#🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

人工智能 speech PyTorch 深度学习 text-to-speech tts

Python 36.62 k

10 个月前

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

text-to-speech tts voice-clone zero-shot-tts

Python 34.39 k

5 个月前

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

人工智能 text-to-speech

Python 18.34 k

2 个月前

leon-ai / leon

🧠 Leon is your open-source personal assistant.

leon personal-assistant Node.js Python 人工智能 speech-to-text text-to-speech speech-recognition speech-synthesis flite assistant virtual-assistant 聊天机器人 Bot voice-assistant 自动化 offline 隐私 ai-assistant

TypeScript 16.64 k

14 小时前

FunAudioLLM / CosyVoice

#大语言模型#Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

audio-generation gpt-4o text-to-speech tts cantonese 聊天机器人 ChatGPT 中文 english fine-grained fine-tuning japanese korean multi-lingual natural-language-generation Python cosyvoice cross-lingual 声音克隆

Python 16.37 k

2 天前

jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。

text-to-speech video-transition speech-to-text

Python 14.19 k

2 小时前

mozilla / TTS

#计算机科学#🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

深度学习 text-to-speech Python PyTorch tacotron tts speaker-encoder dataset-analysis tacotron2 tensorflow2 vocoder melgan glow-tts speech

Jupyter Notebook 10 k

2 年前

rhasspy / piper

A fast, local neural text to speech system

speech-synthesis text-to-speech tts

C++ 9.99 k

18 天前

espnet / espnet

#计算机科学#End-to-End Speech Processing Toolkit

深度学习 end-to-end chainer PyTorch kaldi speech-recognition speech-synthesis speech-translation machine-translation voice-conversion speech-enhancement speech-separation singing-voice-synthesis speaker-diarization text-to-speech

Python 9.45 k

8 小时前

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

audio-generation audio-synthesis audioldm music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e voice-conversion audit fastspeech2 vits emilia maskgct vocoder

Python 9.38 k

4 个月前

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

cross-lingual text-to-speech tts voice-clone zero-shot-tts

Python 9.21 k

1 天前

rany2 / edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

tts speech-synthesis text-to-speech

Python 9.04 k

16 天前

netease-youdao / EmotiVoice

#计算机科学#EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

PyTorch speech speech-synthesis tts multi-speaker text-to-speech 深度学习 prompt emotivoice 人工智能 Python emotion style

Python 8.32 k

1 年前

Plachtaa / VALL-E-X

微软VALL-E X 零样本语音合成模型的开源实现

emotional-speech gpt text-to-speech voice-clone transformer-architecture tts vall-e

Python 7.92 k

2 年前

jaywalnut310 / vits

#计算机科学#VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

tts text-to-speech PyTorch 深度学习 speech-synthesis

Python 7.67 k

2 年前

k2-fsa / sherpa-onnx

#安卓#Sherpa-ONNX 是一个轻量级语音识别框架，基于 Kaldi 和 onnxruntime，无需联网即可实现语音转文本、文本转语音、说话人分离以及语音活动检测(VAD)。支持嵌入式系统、安卓、iOS、鸿蒙系统、树莓派、RISC-V、x86_64 服务器、WebSocket 服务器 / 客户端，以及 C/C++、Python、Kotlin、C#、Go、NodeJS、Java、Swift、Dart、JavaScript、Flutter、Object Pascal、Lazarus、Rust 等编程语言。

asr onnx Windows Linux macOS C++Android iOS 树莓派 aarch64 arm32 C#.NET mfc speech-to-text text-to-speech vits RISC-V lazarus object-pascal

C++ 7.4 k

1 天前