Multi-agent Reinforcement Learning In Bayesian Stackelberg Markov Games For Adaptive Moving Target Defense
2020 Β· Sailik Sengupta, Subbarao Kambhampati
Abstract
The field of cybersecurity has mostly been a cat-and-mouse game with the discovery of new attacks leading the way. To take away an attacker's advantage of reconnaissance, researchers have proposed proactive defense methods such as Moving Target Defense (MTD). To find good movement strategies, researchers have modeled MTD as leader-follower games between the defender and a cyber-adversary. We argue that existing models are inadequate in sequential settings when there is incomplete information about a rational adversary and yield sub-optimal movement strategies. Further, while there exists an array of work on learning defense policies in sequential settings for cyber-security, they are either unpopular due to scalability issues arising out of incomplete information or tend to ignore the strategic nature of the adversary simplifying the scenario to use single-agent reinforcement learning techniques. To address these concerns, we propose (1) a unifying game-theoretic model, called the Baye
Authors
(none)
Tags
Stats
Related papers
- Model-free Reinforcement Learning For Stochastic Stackelberg Security Games (2020)5.24
- Learning To Communicate In Multi-agent Reinforcement Learning For Autonomous Cyber Defence (2025)0.00
- Stackelberg Games For Learning Emergent Behaviors During Competitive Autocurricula (2023)5.84
- Imitation Learning Of Correlated Policies In Stackelberg Games (2025)0.00
- SUB-PLAY: Adversarial Policies Against Partially Observed Multi-agent Reinforcement Learning Systems (2024)0.00
- Delay-aware Multi-agent Reinforcement Learning For Cooperative And Competitive Environments (2020)0.00
- Camouflage Adversarial Attacks On Multiple Agent Systems (2024)0.00
- Learning In Markov Games With Adaptive Adversaries: Policy Regret, Fundamental Barriers, And Efficient Algorithms (2024)0.00