AI Vtuber for Streaming on Youtube/Twitch
All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab
Run FaceFusion Gradio Using Google Colab
Combination of Edge TTS and Sovits Voice Converter Using Google Colab
Open-Sora: 完全开源的高效复现类Sora视频生成方案
一个基于VITS的简单易用的语音转换(变声器)框架
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
A Hands on Guide on MLOps Practices to take your model from Laptop to Production. Created by the Author - Nachiketh Murthy
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
#计算机科学#Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
An online IDE for rapid web development
0 条讨论