voice-cloning · GitHub Topics

CorentinJ / Real-Time-Voice-Cloning

#计算机科学#Real-Time-Voice-Cloning 是一个基于深度学习的语音合成工具，5秒内即可克隆一个声音。

深度学习 PyTorch Tensorflow tts 声音克隆 Python

Python 54.78 k

2 个月前

RVC-Boss / GPT-SoVITS

强大的少样本语音转换与语音合成Web用户界面。

text-to-speech tts vits voice-clone voice-cloneai 声音克隆

Python 49.42 k

12 天前

coqui-ai / TTS

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Python text-to-speech 深度学习 speech PyTorch tts vocoder tacotron glow-tts melgan speaker-encoder hifigan speaker-encodings multi-speaker-tts tts-model speech-synthesis 声音克隆 voice-synthesis voice-conversion

Python 41.72 k

1 年前

FunAudioLLM / CosyVoice

#大语言模型#Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

audio-generation gpt-4o text-to-speech tts cantonese 聊天机器人 ChatGPT 中文 english fine-grained fine-tuning japanese korean multi-lingual natural-language-generation Python cosyvoice cross-lingual 声音克隆

Python 15.48 k

15 天前

Huanshere / VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

ai-translation dubbing Localization (l10n)video-translation 声音克隆

Python 14.05 k

2 个月前

PaddlePaddle / PaddleSpeech

PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源模型库，用于语音和音频中的各种关键任务的开发，典型的应用包括：语音识别、语音翻译、语音合成等

transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr speech-recognition 声音克隆 vocoder voice-recognition self-supervised-learning Whisper

Python 12.11 k

9 天前

DrewThomasson / ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages!

audiobooks Docker epub Linux macOS tts Windows xtts 声音克隆 gradio 中文 english multilingual colab-notebook kaggle audiobook

Python 10.95 k

3 天前

multimodal-art-projection / YuE

#计算机科学#YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

foundation-models music-generation huggingface llama audio-generation 声音克隆大语言模型人工智能深度学习 gpt

Python 5.26 k

2 个月前

abus-aikorea / voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isola...

faster-whisper tts Whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp 声音克隆 podcasts audiobook voice-conversion karaoke whisperx

Python 4.35 k

11 天前

Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

speech speech-synthesis text-to-speech voice-cloneai 声音克隆

Jupyter Notebook 2.78 k

1 年前

IAHispano / Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

rvc vc vits voice 人工智能声音克隆 voice-conversion applio voice-clone PyTorch speech text-to-speech tts

Python 2.5 k

3 天前

voice-cloning-app / Voice-Cloning-App

#计算机科学#A Python/Pytorch app for easily synthesising human voices

Python tts text-to-speech PyTorch 深度学习声音克隆 tacotron2

Python 1.45 k

8 个月前

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

tts stt speech-to-text text-to-speech speech-recognition speech-synthesis speech-processing voice-recognition voice-activity-detection 声音克隆 speech-separation

1.35 k

1 年前

gitmylo / audio-webui

A webui for different audio related Neural Networks

人工智能 audioldm bark rvc text-to-audio text-to-speech 声音克隆 audiocraft music generative-music tts aio all-in-one

Python 1.19 k

2 个月前

panyanyany / Twocast

AI Podcast Generator for bilingual episodes, Multi Languages, Alternative to NotebookLLM；真人对话AI播客生成器，多语言，多音色

podcast podcast-generator 声音克隆 voice-synthesis

TypeScript 936

1 个月前

MiniMax-AI / MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

image-generation mcp mcp-server mcp-tools text-to-speech video-generation image-to-video text-to-image text-to-video 声音克隆

Python 874

23 天前