A Spatiotemporal Stealthy Backdoor Attack Against Cooperative Multi-agent Deep Reinforcement Learning
2024 Β· Yinbo Yu, Saihao Yan, Jiajia Liu
Abstract
Recent studies have shown that cooperative multi-agent deep reinforcement learning (c-MADRL) is under the threat of backdoor attacks. Once a backdoor trigger is observed, it will perform abnormal actions leading to failures or malicious goals. However, existing proposed backdoors suffer from several issues, e.g., fixed visual trigger patterns lack stealthiness, the backdoor is trained or activated by an additional network, or all agents are backdoored. To this end, in this paper, we propose a novel backdoor attack against c-MADRL, which attacks the entire multi-agent team by embedding the backdoor only in a single agent. Firstly, we introduce adversary spatiotemporal behavior patterns as the backdoor trigger rather than manual-injected fixed visual patterns or instant status and control the attack duration. This method can guarantee the stealthiness and practicality of injected backdoors. Secondly, we hack the original reward function of the backdoored agent via reward reverse and unil
Authors
(none)
Tags
Stats
Related papers
- Backdoor Attacks On Multiagent Collaborative Systems (2022)0.00
- Cooperative Backdoor Attack In Decentralized Reinforcement Learning With Theoretical Guarantee (2024)0.00
- Adversarial Inception Backdoor Attacks Against Reinforcement Learning (2024)0.00
- Constrained Black-box Attacks Against Cooperative Multi-agent Reinforcement Learning (2025)0.00
- SAJA: A State-action Joint Attack Framework On Multi-agent Deep Reinforcement Learning (2025)0.00
- Cuda2: An Approach For Incorporating Traitor Agents Into Cooperative Multi-agent Systems (2024)0.00
- Attacking Cooperative Multi-agent Reinforcement Learning By Adversarial Minority Influence (2023)0.00
- Policycleanse: Backdoor Detection And Mitigation In Reinforcement Learning (2022)0.00