Deep Decentralized Multi-task Multi-agent Reinforcement Learning Under Partial Observability
2017 Β· Shayegan Omidshafiei, Jason Pazis, Christopher Amato, et al.
Abstract
Many real-world tasks involve multiple agents with partial observability and limited communication. Learning is challenging in these settings due to local viewpoints of agents, which perceive the world as non-stationary due to concurrently-exploring teammates. Approaches that learn specialized policies for individual tasks face problems when applied to the real world: not only do agents have to learn and store distinct policies for each task, but in practice identities of tasks are often non-observable, making these approaches inapplicable. This paper formalizes and addresses the problem of multi-task multi-agent reinforcement learning under partial observability. We introduce a decentralized single-task learning approach that is robust to concurrent interactions of teammates, and present an approach for distilling single-task policies into a unified policy that performs well across multiple related tasks, without explicit provision of task identity.
Authors
(none)
Tags
Stats
Related papers
- Belief States For Cooperative Multi-agent Reinforcement Learning Under Partial Observability (2025)0.00
- Macro-action-based Multi-agent/robot Deep Reinforcement Learning Under Partial Observability (2022)5.84
- Centralized Model And Exploration Policy For Multi-agent RL (2021)0.00
- Multi-agent Off-policy Actor-critic Reinforcement Learning For Partially Observable Environments (2024)2.26
- Policy Distillation And Value Matching In Multiagent Reinforcement Learning (2019)10.48
- Agent Modelling Under Partial Observability For Deep Reinforcement Learning (2020)0.00
- Fully Decentralized Cooperative Multi-agent Reinforcement Learning: A Survey (2024)0.00
- Individual Specialization In Multi-task Environments With Multiagent Reinforcement Learners (2019)0.00