world-models.io — The Knowledge Hub for AI World Models
world-models.io is a structured knowledge platform covering AI world models — from model-based reinforcement learning to foundation world models, embodied AI, and simulation engines.
Explore world models like DreamerV3, Genie 3, V-JEPA 2, NVIDIA Cosmos, MuZero, Sora, DIAMOND, OASIS, LeWorldModel, PixVerse R1, and more. Compare architectures, read research, and discover the labs building the future of AI.
Featured World Models
Genie 3 — Google DeepMind's real-time interactive 3D world generation from text at 24fps
V-JEPA 2 — Meta FAIR's self-supervised world model for visual understanding and zero-shot robot control
DreamerV3 — General algorithm mastering diverse domains through world model learning
NVIDIA Cosmos — World foundation model platform for physical AI
Genie 2 — Interactive 3D environment generation from single images
Sora — OpenAI's video generation model with emergent world understanding
MuZero — Superhuman game play through learned planning
LeWorldModel — Compact 15M-parameter JEPA that learns physics on a single GPU
V-JEPA — Self-supervised visual world model by Meta FAIR
DIAMOND — Diffusion-based world model for reinforcement learning
OASIS — Open-source real-time neural game engine
PixVerse R1 — First multiplayer real-time world model with shared worlds
TD-MPC2 — Scalable world model agent for 104+ tasks
IRIS — Autoregressive world model bridging language modeling and RL
GAIA-1 — Generative world model for autonomous driving
Copilot4D — 4D LiDAR world model for self-driving
I-JEPA — Image-based Joint Embedding Predictive Architecture
GameNGen — Neural game engine simulating DOOM in real-time
3D-VLA — 3D vision-language-action model for embodied AI
Pandora — Generative world model for open-ended environments
PlaNet — Deep Planning Network for latent space control
RSSM — Foundational architecture behind the Dreamer family
DreamerV2 — First model-based agent to achieve human-level Atari
World Models (Ha & Schmidhuber) — The seminal 2018 paper that popularized world models
Predictron — End-to-end learning and planning via abstract world models
UniSim — Universal simulator for real-world interactions
I2A — Imagination-augmented agents for deep RL
VPN — Value Prediction Network for abstract planning
Emu Video — Factorized video generation via image conditioning
AMI World Model — Multimodal foundation model for embodied AI
RT-2 — Vision-Language-Action model for robotic control
Gen-3 Alpha — Advanced video generation by Runway
Stable Video Diffusion — Open-source video generation foundation model
MILE — Model-based imitation learning for autonomous driving
LWM — Large World Model processing 1M+ tokens of video and text
Genie — First generative interactive environment from unlabeled video
STEVE-1 — Instruction-following agent for Minecraft
Explore
Research Topics
Comparisons
Guides
Labs & Organizations
Categories
Knowledge Hub