集合主题趋势排行榜

video-synthesis

ali-vilab / VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

diffusion-models video-synthesis

Python 3.13 k

8 个月前

DmitryRyumin / ICCV-2023-Papers

#人脸识别#ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support ...

Python 955

1 年前

TIGER-AI-Lab / AnyV2V

#计算机科学#Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]

深度学习 generative-ai image-editing image-to-video-generation PyTorch video-editing video-synthesis

Jupyter Notebook 614

1 年前

DmitryRyumin / CVPR-2023-24-Papers

#人脸识别#CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included...

action-recognition autonomous-driving biometrics 机器视觉 cvpr cvpr2023 数据集深度学习 face-recognition gesture-recognition image-synthesis medical-image-processing multi-modal-learning pattern-recognition segmentation self-supervised-learning video-synthesis cvpr2024

Python 452

1 年前

TIGER-AI-Lab / ConsistI2V

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]

diffusion-models image-to-video-generation video-generation video-synthesis

Python 251

1 年前

guyyariv / TempoTokens

#计算机科学#This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

ai-art 深度学习 diffusion-models generative-ai video-synthesis modelscope PyTorch

Python 128

7 个月前

Bomou-AI / Talking-Head

AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.

talking-head lip-sync talking-face talking-heads text-to-video video-synthesis wav2lip talking-head-videos

3 年前

brandon-rezko / HeyGem

#计算机科学#HeyGem — Your AI face, made free

deepfake 机器学习 speech-synthesis text-to-video video-synthesis 声音克隆

C 35

15 天前

shgaurav1 / DVG

#计算机科学#Diverse Video Generation using a Gaussian Process Trigger

FFmpeg gaussian-processes Video video-generation video-generator video-processing video-synthesis convolutional-neural-networks 深度学习深度神经网络机器学习

Python 18

3 年前

zhengqia / PlugLink_videosyn

视频合成（videosyn）是由PlugLink官方开发的多素材合成插件，主要用于矩阵号发布，解放双手。Video synthesis (videosyn) is a multi-material synthesis plugin developed by PlugLink, mainly used for matrix number publishing, freeing up your han...

Video video-generation video-synthesis

HTML 6

1 年前

adrianSRoman / SELDVisualSynth

Generating Diverse Audio-Visual 360º Soundscapes for Sound Event Localization and Detection

audio-visual-learning video-synthesis

Python 6

1 个月前

vonqo / devola2

devola2 is an open source real-time audio visualizer (video-synth) tool. Purposely designed for live-music performance of B.L.M.D and Even Tide.

audio-visualizer glsl-shaders p5js Three.js webgl video-synthesis

JavaScript 6

4 个月前

tasinislam21 / FashionFlow

This model synthesises high-fidelity fashion videos from single images featuring spontaneous and believable movements.

人工智能 attention-mechanism cross-attention diffusion-models fashion latent-diffusion video-synthesis

Python 5

1 年前

cskonopka / ofx-supplemental

Collection of openFrameworks video synthesis examples

fragment-shader glsl openframeworks shaders video-synthesis Xcode

Makefile 4

1 年前

Ramtin-ma / VideoSynopsis-FGS

机器视觉 object-detection surveillance-systems tracking video-analysis video-synthesis yolov8

Python 3

7 个月前

Yazdi9 / Nerf-Video-Synthesis

NeRF- Real-time View Synthesis

3D 机器视觉 nerf video-synthesis

Jupyter Notebook 1

2 年前

madebynanditaaa / lipgans

#计算机科学#LipGANs is a text-to-viseme GAN framework that generates realistic mouth movements directly from text, without requiring audio. It maps phonemes → visemes, predicts phoneme durations, and uses per-vi...

Web Accessibility (a11y)机器视觉深度学习 Generative Adversarial Network Keras lip-sync Python speech-synthesis Tensorflow video-synthesis

Python 0

9 天前

DeboJp / StoryTeller

#大语言模型#An automated video storytelling pipeline that turns online articles into narrated clips with custom scripts, titles, and visuals. Combines scraping, summarization, TTS, and video generation—ideal for ...

API 自动化大语言模型 prompt-engineering ranking tts video-synthesis webscraping moviepy pil

HTML 0

24 天前

Website
Wikipedia