Loading

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README


0 条讨论

登录后发表评论

关于

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

创建时间
是否国产

  修改时间

2023-12-28T06:21:05Z


语言

  • Python100.0%

modelscope 的其他开源项目

Python12.94 k
3 天前

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python12.91 k
6 天前

Enjoy the magic of Diffusion models!

Python10.25 k
7 天前

#大语言模型#Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4,...

Python10.2 k
1 天前

您可能感兴趣的

强大的少样本语音转换与语音合成Web用户界面。

Python51.34 k
1 个月前
Python8.59 k
3 天前

Open-Sora: 完全开源的高效复现类Sora视频生成方案

Python27.31 k
5 个月前

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python12.03 k
8 天前

#大语言模型#本地化搭建和运行 Llama2 和其他大模型

Go153.62 k
1 天前

大模型Grok-1开源

Python50.52 k
1 年前

#计算机科学#The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.

Python3.58 k
4 天前

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

C175
2 年前

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python12.91 k
6 天前

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook8.38 k
7 天前

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7.65 k
1 年前

[CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.

Python352
8 个月前

#大语言模型#利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python45.74 k
4 个月前

UT-Sarulab MOS prediction system using SSL models

Python270
1 年前
Jupyter Notebook99.77 k
16 小时前

[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python4.23 k
1 年前

#大语言模型#Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.

Python18.65 k
3 天前