Learning To Communicate In Multi-agent Reinforcement Learning For Autonomous Cyber Defence
2025 Β· Faizan Contractor, Li Li, Ranwa Al Mallah
Abstract
Popular methods in cooperative Multi-Agent Reinforcement Learning with partially observable environments typically allow agents to act independently during execution, which may limit the coordinated effect of the trained policies. However, by sharing information such as known or suspected ongoing threats, effective communication can lead to improved decision-making in the cyber battle space. We propose a game design where defender agents learn to communicate and defend against imminent cyber threats by playing training games in the Cyber Operations Research Gym, using the Differentiable Inter Agent Learning algorithm adapted to the cyber operational environment. The tactical policies learned by these autonomous agents are akin to those of human experts during incident responses to avert cyber threats. In addition, the agents simultaneously learn minimal cost communication messages while learning their defence tactical policies.
Authors
(none)
Tags
Stats
Related papers
- Beyond Rewards In Reinforcement Learning For Cyber Defence (2026)0.00
- Robust Communicative Multi-agent Reinforcement Learning With Active Defense (2023)0.00
- Mixed Cooperative-competitive Communication Using Multi-agent Reinforcement Learning (2021)5.84
- Delay-aware Multi-agent Reinforcement Learning For Cooperative And Competitive Environments (2020)0.00
- Learning Practical Communication Strategies In Cooperative Multi-agent Reinforcement Learning (2022)0.00
- Constrained Black-box Attacks Against Cooperative Multi-agent Reinforcement Learning (2025)0.00
- Improved Reinforcement Learning In Cooperative Multi-agent Environments Using Knowledge Transfer (2021)0.00
- Robust Multi-agent Communication Based On Decentralization-oriented Adversarial Training (2025)0.00