Mastering Diverse Domains through World Models
#计算机科学#DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
#计算机科学#Mastering Atari with Discrete World Models
#计算机科学#Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving
#计算机科学#Dream to Control: Learning Behaviors by Latent Imagination
#计算机科学#A curated list of world models for autonomous driving. Keep updated.
DayDreamer: World Models for Physical Robot Learning
A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.
[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.
A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.
World Model based Autonomous Driving Platform in CARLA 🚗
《多模态大模型:新一代人工智能技术范式》作者:刘阳,林倞
A structured implementation of MuZero
Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"
[ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"
#计算机科学#Implementation of a framework for Genie2 in Pytorch