该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
2022-02-13
否
2023-04-02T00:55:24Z
Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)
Gradio UI for running Meta AI's Segment Anything on own hardware
强大的少样本语音转换与语音合成Web用户界面。
A simple Segment Anything WebUI based on Gradio.
[ICCV 2023] BlendFace: Re-designing Identity Encoders for Face-Swapping https://arxiv.org/abs/2307.10854
Mora: More like Sora for Generalist Video Generation
Add bisenetv2. My implementation of BiSeNet
#计算机科学#[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
#大语言模型#Code examples and resources for DBRX, a large language model developed by Databricks
[CVPR 2022] RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs
[ICLR 2025] HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Official PyTorch implementation of BigVGAN (ICLR 2023)
实时目标检测 - YOLOv9 论文实现:Learning What You Want to Learn Using Programmable Gradient Information
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
TinyRAG
[ICLR 2024] Code for FreeNoise based on VideoCrafter
0 条讨论