Multi-agent Reinforcement Learning Via Adaptive Kalman Temporal Difference And Successor Representation
2021 Β· Mohammad Salimibeni, Arash Mohammadi, Parvin Malekzadeh, et al.
Abstract
Distributed Multi-Agent Reinforcement Learning (MARL) algorithms has attracted a surge of interest lately mainly due to the recent advancements of Deep Neural Networks (DNNs). Conventional Model-Based (MB) or Model-Free (MF) RL algorithms are not directly applicable to the MARL problems due to utilization of a fixed reward model for learning the underlying value function. While DNN-based solutions perform utterly well when a single agent is involved, such methods fail to fully generalize to the complexities of MARL problems. In other words, although recently developed approaches based on DNNs for multi-agent environments have achieved superior performance, they are still prone to overfiting, high sensitivity to parameter selection, and sample inefficiency. The paper proposes the Multi-Agent Adaptive Kalman Temporal Difference (MAK-TD) framework and its Successor Representation-based variant, referred to as the MAK-SR. Intuitively speaking, the main objective is to capitalize on unique
Authors
(none)
Tags
Stats
Related papers
- Hierarchical Deep Multiagent Reinforcement Learning With Temporal Abstraction (2018)0.00
- Decentralized Multi-agent Reinforcement Learning With Networked Agents: Recent Advances (2019)0.00
- Model-based Multi-agent Reinforcement Learning: Recent Progress And Prospects (2022)0.00
- Multi-agent Reinforcement Learning In Stochastic Networked Systems (2020)0.00
- A Review Of Cooperative Multi-agent Deep Reinforcement Learning (2019)19.08
- An Initial Introduction To Cooperative Multi-agent Reinforcement Learning (2024)0.00
- MM-KTD: Multiple Model Kalman Temporal Differences For Reinforcement Learning (2020)0.00
- Adaptability In Multi-agent Reinforcement Learning: A Framework And Unified Review (2025)0.00