Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
#计算机科学#Collect some World Models for Autonomous Driving (and Robotic) papers.
Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
Build, evaluate and train General Multi-Agent Assistance with ease
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
[ICCV 2025] Aether: Geometric-Aware Unified World Modeling
#计算机科学#A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related website...
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
A skill-based platform for ROS v.2 with knowledge representating, planning and reasoning
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
DeepVerse: 4D Autoregressive Video Generation as a World Model
[ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
#自然语言处理#[NeurIPS 2024] Agent Planning with World Knowledge Model
Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223
#大语言模型#A general AI agent framework that can be adapted to various tasks and environments.
#大语言模型#A general AI agent framework that can be adapted to various tasks and environments.
Official repository for "RLVR-World: Training World Models with Reinforcement Learning", https://arxiv.org/abs/2505.13934