Cooperative Backdoor Attack In Decentralized Reinforcement Learning With Theoretical Guarantee
2024 Β· Mengtong Gao, Yifei Zou, Zuyuan Zhang, et al.
Abstract
The safety of decentralized reinforcement learning (RL) is a challenging problem since malicious agents can share their poisoned policies with benign agents. The paper investigates a cooperative backdoor attack in a decentralized reinforcement learning scenario. Differing from the existing methods that hide a whole backdoor attack behind their shared policies, our method decomposes the backdoor behavior into multiple components according to the state space of RL. Each malicious agent hides one component in its policy and shares its policy with the benign agents. When a benign agent learns all the poisoned policies, the backdoor attack is assembled in its policy. The theoretical proof is given to show that our cooperative method can successfully inject the backdoor into the RL policies of benign agents. Compared with the existing backdoor attacks, our cooperative method is more covert since the policy from each attacker only contains a component of the backdoor attack and is harder to d
Authors
(none)
Tags
Stats
Related papers
- A Spatiotemporal Stealthy Backdoor Attack Against Cooperative Multi-agent Deep Reinforcement Learning (2024)0.00
- Sleepernets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents (2024)0.00
- Policycleanse: Backdoor Detection And Mitigation In Reinforcement Learning (2022)0.00
- Backdoor Attacks On Multiagent Collaborative Systems (2022)0.00
- Efficient Reward Poisoning Attacks On Online Deep Reinforcement Learning (2022)0.00
- Adversarial Inception Backdoor Attacks Against Reinforcement Learning (2024)0.00
- Beyond Training-time Poisoning: Component-level And Post-training Backdoors In Deep Reinforcement Learning (2025)0.00
- Constrained Black-box Attacks Against Cooperative Multi-agent Reinforcement Learning (2025)0.00