[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Open-Sora: 完全开源的高效复现类Sora视频生成方案
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Generative Models by Stability AI
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
A curated list of recent diffusion models for video generation, editing, and various other applications.
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Mora: More like Sora for Generalist Video Generation
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
DUSt3R: Geometric 3D Vision Made Easy
TripoSR: Fast 3D Object Reconstruction from a Single Image
Zero-Shot Speech Editing and Text-to-Speech in the Wild
0 条讨论