#

world-models

https://static.github-zh.com/github_avatars/danijar?size=40
Python 2.13 k
5 个月前
https://static.github-zh.com/github_avatars/Tencent-Hunyuan?size=40

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2.13 k
13 天前
https://static.github-zh.com/github_avatars/eloialonso?size=40

#计算机科学#DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1.87 k
9 个月前
https://static.github-zh.com/github_avatars/Tencent-Hunyuan?size=40

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1.1 k
5 天前
https://static.github-zh.com/github_avatars/eloialonso?size=40
Python 840
1 年前
https://static.github-zh.com/github_avatars/OpenDriveLab?size=40

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving

Python 753
2 个月前
https://static.github-zh.com/github_avatars/danijar?size=40
Python 552
4 年前
https://static.github-zh.com/github_avatars/SkyworkAI?size=40

Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.

Python 477
13 天前
https://static.github-zh.com/github_avatars/zli12321?size=40

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

364
10 天前
https://static.github-zh.com/github_avatars/runjiali-rl?size=40

[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Python 363
2 个月前
https://static.github-zh.com/github_avatars/danijar?size=40

DayDreamer: World Models for Physical Robot Learning

Jupyter Notebook 348
3 年前
https://static.github-zh.com/github_avatars/UMass-Embodied-AGI?size=40

ICCV 2025 | TesserAct: Learning 4D Embodied World Models

Python 328
1 个月前
https://static.github-zh.com/github_avatars/SenseTime-FVG?size=40

An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.

Python 284
3 个月前
https://static.github-zh.com/github_avatars/zhanghm1995?size=40

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

266
1 年前
https://static.github-zh.com/github_avatars/HCPLab-SYSU?size=40

《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞

HTML 239
9 个月前
https://static.github-zh.com/github_avatars/YvanYin?size=40

Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"

Python 208
8 个月前
https://static.github-zh.com/github_avatars/johan-gras?size=40
Python 205
3 年前
loading...
Website
Wikipedia