Iterated Reasoning With Mutual Information In Cooperative And Byzantine Decentralized Teaming
2022 Β· Sachin Konan, Esmaeil Seraj, Matthew Gombolay
Abstract
Information sharing is key in building team cognition and enables coordination and cooperation. High-performing human teams also benefit from acting strategically with hierarchical levels of iterated communication and rationalizability, meaning a human agent can reason about the actions of their teammates in their decision-making. Yet, the majority of prior work in Multi-Agent Reinforcement Learning (MARL) does not support iterated rationalizability and only encourage inter-agent communication, resulting in a suboptimal equilibrium cooperation strategy. In this work, we show that reformulating an agent's policy to be conditional on the policies of its neighboring teammates inherently maximizes Mutual Information (MI) lower-bound when optimizing under Policy Gradient (PG). Building on the idea of decision-making under bounded rationality and cognitive hierarchy theory, we show that our modified PG approach not only maximizes local agent rewards but also implicitly reasons about MI betwe
Authors
(none)
Tags
Stats
Related papers
- PMIC: Improving Multi-agent Reinforcement Learning With Progressive Mutual Information Collaboration (2022)0.00
- Emergent Cooperation Through Mutual Information Maximization (2020)0.00
- Inducing Cooperation Via Team Regret Minimization Based Multi-agent Deep Reinforcement Learning (2019)0.00
- Cautiously-optimistic Knowledge Sharing For Cooperative Multi-agent Reinforcement Learning (2023)5.84
- Modelling Bounded Rationality In Multi-agent Interactions By Generalized Recursive Reasoning (2019)9.23
- A Variational Approach To Mutual Information-based Coordination For Multi-agent Reinforcement Learning (2023)0.00
- Tacit Learning With Adaptive Information Selection For Cooperative Multi-agent Reinforcement Learning (2024)0.00
- Probing Dec-pomdp Reasoning In Cooperative MARL (2026)0.00