Networked Agents In The Dark: Team Value Learning Under Partial Observability
2025 Β· Guilherme S. Varela, Alberto Sardinha, Francisco S. Melo
Abstract
We propose a novel cooperative multi-agent reinforcement learning (MARL) approach for networked agents. In contrast to previous methods that rely on complete state information or joint observations, our agents must learn how to reach shared objectives under partial observability. During training, they collect individual rewards and approximate a team value function through local communication, resulting in cooperative behavior. To describe our problem, we introduce the networked dynamic partially observable Markov game framework, where agents communicate over a switching topology communication network. Our distributed method, DNA-MARL, uses a consensus mechanism for local communication and gradient descent for local computation. DNA-MARL increases the range of the possible applications of networked agents, being well-suited for real world domains that impose privacy and where the messages may not reach their recipients. We evaluate DNA-MARL across benchmark MARL scenarios. Our results
Authors
(none)
Tags
Stats
Related papers
- Fully Decentralized Multi-agent Reinforcement Learning With Networked Agents (2018)0.00
- Learning To Share In Multi-agent Reinforcement Learning (2021)0.00
- Value Propagation For Decentralized Networked Deep Multi-agent Reinforcement Learning (2019)0.00
- Local Advantage Networks For Cooperative Multi-agent Reinforcement Learning (2021)0.00
- Inducing Cooperation Via Team Regret Minimization Based Multi-agent Deep Reinforcement Learning (2019)0.00
- Mean-field Multi-agent Reinforcement Learning: A Decentralized Network Approach (2021)0.00
- Multi-agent Reinforcement Learning In Stochastic Networked Systems (2020)0.00
- Cooperative Multi-agent Reinforcement Learning With Partial Observations (2020)10.35