world-models.io - The Knowledge Hub for AI World Models
world-models.io is a structured knowledge platform covering AI world models - from model-based reinforcement learning to foundation world models, embodied AI, and simulation engines.
Explore world models like DreamerV3, Genie 3, V-JEPA 2, NVIDIA Cosmos, MuZero, Sora, DIAMOND, OASIS, LeWorldModel, PixVerse R1, and more. Compare architectures, read research, and discover the labs building the future of AI.
Featured World Models
Genie 3 - Google DeepMind's real-time interactive 3D world generation from text at 24fps
V-JEPA 2 - Meta FAIR's self-supervised world model for visual understanding and zero-shot robot control
DreamerV3 - General algorithm mastering diverse domains through world model learning
NVIDIA Cosmos - World foundation model platform for physical AI
Genie 2 - Interactive 3D environment generation from single images
Sora - OpenAI's video generation model with emergent world understanding
MuZero - Superhuman game play through learned planning
LeWorldModel - Compact 15M-parameter JEPA that learns physics on a single GPU
V-JEPA - Self-supervised visual world model by Meta FAIR
DIAMOND - Diffusion-based world model for reinforcement learning
OASIS - Open-source real-time neural game engine
PixVerse R1 - First multiplayer real-time world model with shared worlds
TD-MPC2 - Scalable world model agent for 104+ tasks
IRIS - Autoregressive world model bridging language modeling and RL
GAIA-1 - Generative world model for autonomous driving
Copilot4D - 4D LiDAR world model for self-driving
I-JEPA - Image-based Joint Embedding Predictive Architecture
GameNGen - Neural game engine simulating DOOM in real-time
3D-VLA - 3D vision-language-action model for embodied AI
Pandora - Generative world model for open-ended environments
PlaNet - Deep Planning Network for latent space control
RSSM - Foundational architecture behind the Dreamer family
DreamerV2 - First model-based agent to achieve human-level Atari
World Models (Ha & Schmidhuber) - The seminal 2018 paper that popularized world models
Predictron - End-to-end learning and planning via abstract world models
UniSim - Universal simulator for real-world interactions
I2A - Imagination-augmented agents for deep RL
VPN - Value Prediction Network for abstract planning
Emu Video - Factorized video generation via image conditioning
AMI World Model - Multimodal foundation model for embodied AI
RT-2 - Vision-Language-Action model for robotic control
Gen-3 Alpha - Advanced video generation by Runway
Stable Video Diffusion - Open-source video generation foundation model
MILE - Model-based imitation learning for autonomous driving
LWM - Large World Model processing 1M+ tokens of video and text
Genie - First generative interactive environment from unlabeled video
STEVE-1 - Instruction-following agent for Minecraft
Explore
Research Topics
Comparisons
Guides
Labs & Organizations
Categories
Knowledge Hub