Individual Specialization In Multi-task Environments With Multiagent Reinforcement Learners
2019 · Marco Jerome Gasparrini, Ricard Solé, Martí Sánchez-Fibla
Abstract
There is a growing interest in Multi-Agent Reinforcement Learning (MARL) as the first steps towards building general intelligent agents that learn to make low and high-level decisions in non-stationary complex environments in the presence of other agents. Previous results point us towards increased conditions for coordination, efficiency/fairness, and common-pool resource sharing. We further study coordination in multi-task environments where several rewarding tasks can be performed and thus agents don't necessarily need to perform well in all tasks, but under certain conditions may specialize. An observation derived from the study is that epsilon greedy exploration of value-based reinforcement learning methods is not adequate for multi-agent independent learners because the epsilon parameter that controls the probability of selecting a random action synchronizes the agents artificially and forces them to have deterministic policies at the same time. By using policy-based methods with
Authors
(none)
Tags
Stats
Related papers
- Policy Distillation And Value Matching In Multiagent Reinforcement Learning (2019)10.48
- Attention-driven Multi-agent Reinforcement Learning: Enhancing Decisions With Expertise-informed Tasks (2024)4.52
- Benchmarking Multi-agent Deep Reinforcement Learning Algorithms In Cooperative Tasks (2020)0.00
- Hypermarl: Adaptive Hypernetworks For Multi-agent RL (2024)0.00
- Ensemble Value Functions For Efficient Exploration In Multi-agent Reinforcement Learning (2023)0.00
- Coordinated Exploration Via Intrinsic Rewards For Multi-agent Reinforcement Learning (2019)0.00
- Prioritized Guidance For Efficient Multi-agent Reinforcement Learning Exploration (2019)0.00
- Multi-agent Reinforcement Learning In Stochastic Networked Systems (2020)0.00