Research On Multi-agent Communication And Collaborative Decision-making Based On Deep Reinforcement Learning
2023 Β· Zeng da
Abstract
In a multi-agent environment, In order to overcome and alleviate the non-stationarity of the multi-agent environment, the mainstream method is to adopt the framework of Centralized Training Decentralized Execution (CTDE). This thesis is based on the framework of CTDE, and studies the cooperative decision-making of multi-agent based on the Multi-Agent Proximal Policy Optimization (MAPPO) algorithm for multi-agent proximal policy optimization. In order to alleviate the non-stationarity of the multi-agent environment, a multi-agent communication mechanism based on weight scheduling and attention module is introduced. Different agents can alleviate the non-stationarity caused by local observations through information exchange between agents, assisting in the collaborative decision-making of agents. The specific method is to introduce a communication module in the policy network part. The communication module is composed of a weight generator, a weight scheduler, a message encoder, a messag
Authors
(none)
Tags
Stats
Related papers
- Exploring Task-oriented Communication In Multi-agent System: A Deep Reinforcement Learning Approach (2022)0.00
- Learning Multi-agent Coordination Through Connectivity-driven Communication (2020)8.60
- Improving Coordination In Small-scale Multi-agent Deep Reinforcement Learning Through Memory-driven Communication (2019)12.25
- Fully Decentralized Cooperative Multi-agent Reinforcement Learning: A Survey (2024)0.00
- A New Framework For Multi-agent Reinforcement Learning -- Centralized Training And Exploration With Decentralized Execution Via Policy Distillation (2019)0.00
- DSDF: An Approach To Handle Stochastic Agents In Collaborative Multi-agent Reinforcement Learning (2021)0.00
- Delay-aware Multi-agent Reinforcement Learning For Cooperative And Competitive Environments (2020)0.00
- Contextual Knowledge Sharing In Multi-agent Reinforcement Learning With Decentralized Communication And Coordination (2025)0.00