Language-conditioned World Model Improves Policy Generalization By Reading Environmental Descriptions
2025 Β· Anh Nguyen, Stefan Lee
Abstract
To interact effectively with humans in the real world, it is important for agents to understand language that describes the dynamics of the environment--that is, how the environment behaves--rather than just task instructions specifying "what to do". Understanding this dynamics-descriptive language is important for human-agent interaction and agent behavior. Recent work address this problem using a model-based approach: language is incorporated into a world model, which is then used to learn a behavior policy. However, these existing methods either do not demonstrate policy generalization to unseen games or rely on limiting assumptions. For instance, assuming that the latency induced by inference-time planning is tolerable for the target task or expert demonstrations are available. Expanding on this line of research, we focus on improving policy generalization from a language-conditioned world model while dropping these assumptions. We propose a model-based reinforcement learning appro
Authors
(none)
Tags
Stats
Related papers
- Mental Modeling Of Reinforcement Learning Agents By Language Models (2024)0.00
- General Agents Contain World Models (2025)0.00
- World Models As An Intermediary Between Agents And The Real World (2026)0.00
- Co-evolution Of Policy And Internal Reward For Language Agents (2026)0.00
- Enhancing Vision-language Model Training With Reinforcement Learning In Synthetic Worlds For Real-world Success (2025)0.00
- From Laws To Motivation: Guiding Exploration Through Law-based Reasoning And Rewards (2024)0.00
- Procedural Generalization By Planning With Self-supervised World Models (2021)0.00
- Rlzero: Direct Policy Inference From Language Without In-domain Supervision (2024)0.00