Risk-sensitive Multi-agent Reinforcement Learning In Network Aggregative Markov Games
2024 Β· Hafez Ghaemi, Hamed Kebriaei, Alireza Ramezani Moghaddam, et al.
Abstract
Classical multi-agent reinforcement learning (MARL) assumes risk neutrality and complete objectivity for agents. However, in settings where agents need to consider or model human economic or social preferences, a notion of risk must be incorporated into the RL optimization problem. This will be of greater importance in MARL where other human or non-human agents are involved, possibly with their own risk-sensitive policies. In this work, we consider risk-sensitive and non-cooperative MARL with cumulative prospect theory (CPT), a non-convex risk measure and a generalization of coherent measures of risk. CPT is capable of explaining loss aversion in humans and their tendency to overestimate/underestimate small/large probabilities. We propose a distributed sampling-based actor-critic (AC) algorithm with CPT risk for network aggregative Markov games (NAMGs), which we call Distributed Nested CPT-AC. Under a set of assumptions, we prove the convergence of the algorithm to a subjective notion
Authors
(none)
Tags
Stats
Related papers
- Breaking The Curse Of Multiagency In Robust Multi-agent Reinforcement Learning (2024)0.00
- Incentivize Without Bonus: Provably Efficient Model-based Online Multi-agent RL For Markov Games (2025)0.00
- Multi-agent Reinforcement Learning In Stochastic Networked Systems (2020)0.00
- Risk-aware Distributed Multi-agent Reinforcement Learning (2023)3.58
- Taming Equilibrium Bias In Risk-sensitive Multi-agent Reinforcement Learning (2024)0.00
- Optimism As Risk-seeking In Multi-agent Reinforcement Learning (2025)0.00
- Toward Risk-based Optimistic Exploration For Cooperative Multi-agent Reinforcement Learning (2023)0.00
- Minimax-optimal Multi-agent RL In Markov Games With A Generative Model (2022)2.26