Embedded Universal Predictive Intelligence: A Coherent Framework For Multi-agent Learning
2025 · Alexander Meulemans, Rajai Nasser, MacIej Wołczyk, et al.
Abstract
The standard theory of model-free reinforcement learning assumes that the environment dynamics are stationary and that agents are decoupled from their environment, such that policies are treated as being separate from the world they inhabit. This leads to theoretical challenges in the multi-agent setting where the non-stationarity induced by the learning of other agents demands prospective learning based on prediction models. To accurately model other agents, an agent must account for the fact that those other agents are, in turn, forming beliefs about it to predict its future behavior, motivating agents to model themselves as part of the environment. Here, building upon foundational work on universal artificial intelligence (AIXI), we introduce a mathematical framework for prospective learning and embedded agency centered on self-prediction, where Bayesian RL agents predict both future perceptual inputs and their own actions, and must therefore resolve epistemic uncertainty about them
Authors
(none)
Tags
Stats
Related papers
- Learning Policy Representations In Multiagent Systems (2018)0.00
- A Unifying Framework For Action-conditional Self-predictive Reinforcement Learning (2024)0.00
- Innate-values-driven Reinforcement Learning Based Cooperative Multi-agent Cognitive Modeling (2024)0.00
- Emergent Social Learning Via Multi-agent Reinforcement Learning (2020)0.00
- Genai-based Multi-agent Reinforcement Learning Towards Distributed Agent Intelligence: A Generative-rl Agent Perspective (2025)0.00
- Theory Of Mind As Intrinsic Motivation For Multi-agent Reinforcement Learning (2023)0.00
- Discovering Latent States For Model Learning: Applying Sensorimotor Contingencies Theory And Predictive Processing To Model Context (2016)0.00
- Multi-task Multi-agent Shared Layers Are Universal Cognition Of Multi-agent Coordination (2023)0.00