微软VALL-E X 零样本语音合成模型的开源实现

关于

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

emotional-speech gpt text-to-speech voice-clone transformer-architecture tts vall-e

创建时间

2023-07-29

是否国产

否

语言

Python100.0%

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README

0 条讨论

登录后发表评论

Plachtaa 的其他开源项目

VITS-fast-fine-tuning

@Plachtaa

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python4.93 k

5 个月前

seed-vc

@Plachtaa

zero-shot voice conversion & singing voice conversion, with real-time support

voice-conversion singing-voice-conversion

Python2.65 k

2 个月前

Plachtaa.github.io

@Plachtaa

HTML0

9 个月前

您可能感兴趣的

GPT-SoVITS

@RVC-Boss

强大的少样本语音转换与语音合成Web用户界面。

text-to-speech tts vits voice-clone voice-cloneai

Python47.85 k

1 天前

grok-1

@xai-org

大模型Grok-1开源

Python50.29 k

10 个月前

Open-Sora

@hpcaitech

Open-Sora：完全开源的高效复现类Sora视频生成方案

Python26.7 k

2 个月前

TTS

@coqui-ai

#计算机科学#🐸💬 - 一个深度学习的 TTS 语言合成库

Python text-to-speech 深度学习 speech PyTorch

Python40.83 k

10 个月前

OpenVoice

@myshell-ai

Instant voice cloning by MIT and MyShell. Audio foundation model.

text-to-speech tts voice-clone zero-shot-tts

Python32.67 k

2 个月前

VoiceCraft

@jasonppy

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook8.3 k

3 个月前

bark

@suno-ai

Bark 是一个文本提示的生成式语音模型。支持英语、中文、德语、日语等多国语言

Jupyter Notebook38.04 k

10 个月前

whisper

OpenAI@openai

whisper 是一个通用语音识别模型

Python83.56 k

1 个月前

vall-e

@lifeiteng

#大语言模型#PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

in-context-learning large-language-models text-to-speech tts ChatGPT

Python2.14 k

13 天前

edge-tts

@rany2

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

tts speech-synthesis text-to-speech

Python8.48 k

2 个月前

ollama

@ollama

#大语言模型#本地化搭建和运行 Llama2 和其他大模型

llama 大语言模型 llama2 Go

Go144.23 k

1 小时前

MeloTTS

@myshell-ai

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

text-to-speech tts 中文 english french

Python6.18 k

6 个月前

Retrieval-based-Voice-Conversion-WebUI

@RVC-Project

一个基于VITS的简单易用的语音转换（变声器）框架

change sovits vits voice voice-conversion

Python30.19 k

7 个月前

AniPortrait

@Zejun-Yang

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python4.96 k

1 年前

IDArling存档

@IDArlingTeam

Collaborative Reverse Engineering plugin for IDA Pro & Hex-Rays

ida ida-pro ida-plugin idapython idapython-plugin

Python662

4 年前

StyleTTS2

@yl4579

#计算机科学#StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

深度学习 PyTorch speaker-adaptation speech-synthesis text-to-speech

Python5.8 k

10 个月前

MoneyPrinterTurbo

@harry0703

#大语言模型#利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

shortvideo 自动化 ChatGPT moviepy Python

Python36.89 k

9 天前

Open-Sora-Plan

@PKU-YuanGroup

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python11.99 k

17 小时前

Real-Time-Voice-Cloning

Corentin Jemine@CorentinJ

#计算机科学#Real-Time-Voice-Cloning 是一个基于深度学习的语音合成工具，5秒内即可克隆一个声音。

深度学习 PyTorch Tensorflow tts 声音克隆

Python54.54 k

21 天前

MockingBird

@babysor

#计算机科学#🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

人工智能 speech PyTorch 深度学习 text-to-speech

Python36.35 k

7 个月前