Actor-critic Algorithms For Constrained Multi-agent Reinforcement Learning
2019 Β· Raghuram Bharadwaj Diddigi, Sai Koti Reddy Danda, Prabuchandran K. J., et al.
Abstract
In cooperative stochastic games multiple agents work towards learning joint optimal actions in an unknown environment to achieve a common goal. In many real-world applications, however, constraints are often imposed on the actions that can be jointly taken by the agents. In such scenarios the agents aim to learn joint actions to achieve a common goal (minimizing a specified cost function) while meeting the given constraints (specified via certain penalty functions). In this paper, we consider the relaxation of the constrained optimization problem by constructing the Lagrangian of the cost and penalty functions. We propose a nested actor-critic solution approach to solve this relaxed problem. In this approach, an actor-critic scheme is employed to improve the policy for a given Lagrange parameter update on a faster timescale as in the classical actor-critic architecture. A meta actor-critic scheme using this faster timescale policy updates is then employed to improve the Lagrange parame
Authors
(none)
Tags
Stats
Related papers
- Attention Actor-critic Algorithm For Multi-agent Constrained Co-operative Reinforcement Learning (2021)0.00
- Actor-attention-critic For Multi-agent Reinforcement Learning (2018)0.00
- Actor-critic Policy Optimization In Partially Observable Multiagent Environments (2018)0.00
- Multi-agent Actor-critic For Mixed Cooperative-competitive Environments (2017)0.00
- Natural Policy Gradient And Actor Critic Methods For Constrained Multi-task Reinforcement Learning (2024)0.00
- Bi-level Actor-critic For Multi-agent Coordination (2019)0.00
- Multi-agent Natural Actor-critic Reinforcement Learning Algorithms (2021)3.58
- Context-aware Bayesian Network Actor-critic Methods For Cooperative Multi-agent Reinforcement Learning (2023)0.00