#

video-synthesis

https://static.github-zh.com/github_avatars/ali-vilab?size=40

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3.13 k
8 个月前
https://static.github-zh.com/github_avatars/DmitryRyumin?size=40

#人脸识别#ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...

Python 955
1 年前
https://static.github-zh.com/github_avatars/TIGER-AI-Lab?size=40

#计算机科学#Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]

Jupyter Notebook 614
1 年前
https://static.github-zh.com/github_avatars/DmitryRyumin?size=40
Python 452
1 年前
https://static.github-zh.com/github_avatars/TIGER-AI-Lab?size=40

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]

Python 251
1 年前
https://static.github-zh.com/github_avatars/guyyariv?size=40

#计算机科学#This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Python 128
7 个月前
https://static.github-zh.com/github_avatars/Bomou-AI?size=40

AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.

36
3 年前
https://static.github-zh.com/github_avatars/zhengqia?size=40

视频合成(videosyn)是由PlugLink官方开发的多素材合成插件,主要用于矩阵号发布,解放双手。Video synthesis (videosyn) is a multi-material synthesis plugin developed by PlugLink, mainly used for matrix number publishing, freeing up your han...

HTML 6
1 年前
https://static.github-zh.com/github_avatars/adrianSRoman?size=40

Generating Diverse Audio-Visual 360º Soundscapes for Sound Event Localization and Detection

Python 6
1 个月前
https://static.github-zh.com/github_avatars/vonqo?size=40

devola2 is an open source real-time audio visualizer (video-synth) tool. Purposely designed for live-music performance of B.L.M.D and Even Tide.

JavaScript 6
4 个月前
https://static.github-zh.com/github_avatars/tasinislam21?size=40

This model synthesises high-fidelity fashion videos from single images featuring spontaneous and believable movements.

Python 5
1 年前
https://static.github-zh.com/github_avatars/cskonopka?size=40
Makefile 4
1 年前
https://static.github-zh.com/github_avatars/Yazdi9?size=40
Jupyter Notebook 1
2 年前
https://static.github-zh.com/github_avatars/madebynanditaaa?size=40

#计算机科学#LipGANs is a text-to-viseme GAN framework that generates realistic mouth movements directly from text, without requiring audio. It maps phonemes → visemes, predicts phoneme durations, and uses per-vi...

Python 0
9 天前
https://static.github-zh.com/github_avatars/DeboJp?size=40

#大语言模型#An automated video storytelling pipeline that turns online articles into narrated clips with custom scripts, titles, and visuals. Combines scraping, summarization, TTS, and video generation—ideal for ...

HTML 0
24 天前
Website
Wikipedia