This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
zero-shot voice conversion & singing voice conversion, with real-time support
Open-Sora: 完全开源的高效复现类Sora视频生成方案
Instant voice cloning by MIT and MyShell. Audio foundation model.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
whisper 是一个通用语音识别模型
#大语言模型#PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
一个基于VITS的简单易用的语音转换(变声器)框架
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Collaborative Reverse Engineering plugin for IDA Pro & Hex-Rays
#计算机科学#StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
#大语言模型#利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
#计算机科学#Real-Time-Voice-Cloning 是一个基于深度学习的语音合成工具,5秒内即可克隆一个声音。
#计算机科学#🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time