Voting-based Multi-agent Reinforcement Learning For Intelligent Iot
2019 Β· Yue Xu, Zengde Deng, Mengdi Wang, et al.
Abstract
The recent success of single-agent reinforcement learning (RL) in Internet of things (IoT) systems motivates the study of multi-agent reinforcement learning (MARL), which is more challenging but more useful in large-scale IoT. In this paper, we consider a voting-based MARL problem, in which the agents vote to make group decisions and the goal is to maximize the globally averaged returns. To this end, we formulate the MARL problem based on the linear programming form of the policy optimization problem and propose a distributed primal-dual algorithm to obtain the optimal solution. We also propose a voting mechanism through which the distributed learning achieves the same sublinear convergence rate as centralized learning. In other words, the distributed decision making does not slow down the process of achieving global consensus on optimality. Lastly, we verify the convergence of our proposed algorithm with numerical simulations and conduct case studies in practical multi-agent IoT syste
Authors
(none)
Tags
Stats
Related papers
- Decentralized Multi-agent Reinforcement Learning With Networked Agents: Recent Advances (2019)0.00
- Applications Of Multi-agent Reinforcement Learning In Future Internet: A Comprehensive Survey (2021)0.00
- Fully Decentralized Multi-agent Reinforcement Learning With Networked Agents (2018)0.00
- Multi-agent Reinforcement Learning Via Double Averaging Primal-dual Optimization (2018)0.00
- A Review Of Cooperative Multi-agent Deep Reinforcement Learning (2019)19.08
- Mean-field Multi-agent Reinforcement Learning: A Decentralized Network Approach (2021)0.00
- Multi-agent Reinforcement Learning For Resources Allocation Optimization: A Survey (2025)0.00
- Multi-agent Reinforcement Learning In Stochastic Networked Systems (2020)0.00