Loading

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README


0 条讨论

登录后发表评论

关于

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

创建时间
是否国产

  修改时间

2025-03-19T06:51:33Z


语言

  • Python100.0%

MoonInTheRiver 的其他开源项目

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

Python443
2 年前

您可能感兴趣的

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Python2.9 k
3 个月前

大模型Grok-1开源

Python50.52 k
1 年前

强大的少样本语音转换与语音合成Web用户界面。

Python51.26 k
21 天前
openinterpreter/01

The #1 open-source voice interface for desktop, mobile, and ESP32 chips.

Python5.09 k
1 年前
Python63.91 k
3 小时前
open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, ...

Python9.42 k
4 个月前

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable mu...

Jupyter Notebook22.51 k
7 个月前

一个基于VITS的简单易用的语音转换(变声器)框架

Python32.25 k
10 个月前

Magenta: Music and Art Generation with Machine Intelligence

Python19.67 k
3 个月前

Devika is now Opcode

Python19.5 k
6 天前

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Python1.06 k
1 年前

上传截图通过GPT生成HTML/Tailwind/JavaScript代码

Python70.95 k
2 个月前

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python2.64 k
1 年前

#大语言模型#利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python44.69 k
4 个月前

#计算机科学#VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python7.7 k
2 年前

Open-Sora: 完全开源的高效复现类Sora视频生成方案

Python27.27 k
5 个月前

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python2.3 k
2 个月前

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python13.24 k
1 年前

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python12.03 k
3 天前