Distributional Reward Estimation For Effective Multi-agent Deep Reinforcement Learning
2022 Β· Jifeng Hu, Yanchao Sun, Hechang Chen, et al.
Abstract
Multi-agent reinforcement learning has drawn increasing attention in practice, e.g., robotics and automatic driving, as it can explore optimal policies using samples generated by interacting with the environment. However, high reward uncertainty still remains a problem when we want to train a satisfactory model, because obtaining high-quality reward feedback is usually expensive and even infeasible. To handle this issue, previous methods mainly focus on passive reward correction. At the same time, recent active reward estimation methods have proven to be a recipe for reducing the effect of reward uncertainty. In this paper, we propose a novel Distributional Reward Estimation framework for effective Multi-Agent Reinforcement Learning (DRE-MARL). Our main idea is to design the multi-action-branch reward estimation and policy-weighted reward aggregation for stabilized training. Specifically, we design the multi-action-branch reward estimation to model reward distributions on all action br
Authors
(none)
Tags
Stats
Related papers
- Noise Distribution Decomposition Based Multi-agent Distributional Reinforcement Learning (2023)0.00
- Distributional Reinforcement Learning For Multi-dimensional Reward Functions (2021)0.00
- Toward Risk-based Optimistic Exploration For Cooperative Multi-agent Reinforcement Learning (2023)0.00
- DIFFER: Decomposing Individual Reward For Fair Experience Replay In Multi-agent Reinforcement Learning (2023)2.26
- GOV-REK: Governed Reward Engineering Kernels For Designing Robust Multi-agent Reinforcement Learning Systems (2024)0.00
- Weighted Double Deep Multiagent Reinforcement Learning In Stochastic Cooperative Environments (2018)0.00
- Deep Multiagent Reinforcement Learning: Challenges And Directions (2021)0.00
- Incentivize Without Bonus: Provably Efficient Model-based Online Multi-agent RL For Markov Games (2025)0.00