Achieving Fairness In Multi-agent Markov Decision Processes Using Reinforcement Learning
2023 Β· Peizhong Ju, Arnob Ghosh, Ness B. Shroff
Abstract
Fairness plays a crucial role in various multi-agent systems (e.g., communication networks, financial markets, etc.). Many multi-agent dynamical interactions can be cast as Markov Decision Processes (MDPs). While existing research has focused on studying fairness in known environments, the exploration of fairness in such systems for unknown environments remains open. In this paper, we propose a Reinforcement Learning (RL) approach to achieve fairness in multi-agent finite-horizon episodic MDPs. Instead of maximizing the sum of individual agents' value functions, we introduce a fairness function that ensures equitable rewards across agents. Since the classical Bellman's equation does not hold when the sum of individual value functions is not maximized, we cannot use traditional approaches. Instead, in order to explore, we maintain a confidence bound of the unknown environment and then propose an online convex optimization based approach to obtain a policy constrained to this confidence
Authors
(none)
Tags
Stats
Related papers
- Striking A Balance In Fairness For Dynamic Systems Through Reinforcement Learning (2024)2.26
- What Hides Behind Unfairness? Exploring Dynamics Fairness In Reinforcement Learning (2024)0.95
- Learning Fair Policies In Multiobjective (deep) Reinforcement Learning With Average And Discounted Rewards (2020)0.00
- Optimal Decision-making In Mixed-agent Partially Observable Stochastic Environments Via Reinforcement Learning (2019)0.00
- Socially Fair Reinforcement Learning (2022)0.00
- Incentivize Without Bonus: Provably Efficient Model-based Online Multi-agent RL For Markov Games (2025)0.00
- Robust Cooperative Multi-agent Reinforcement Learning:a Mean-field Type Game Perspective (2024)0.00
- Past-discounting Is Key For Learning Markovian Fairness With Long Horizons (2025)0.00