Tactics Of Adversarial Attack On Deep Reinforcement Learning Agents
2017 Β· Yen-Chen Lin, Zhang-Wei Hong, Yuan-Hong Liao, et al.
Abstract
We introduce two tactics to attack agents trained by deep reinforcement learning algorithms using adversarial examples, namely the strategically-timed attack and the enchanting attack. In the strategically-timed attack, the adversary aims at minimizing the agent's reward by only attacking the agent at a small subset of time steps in an episode. Limiting the attack activity to this subset helps prevent detection of the attack by the agent. We propose a novel method to determine when an adversarial example should be crafted and applied. In the enchanting attack, the adversary aims at luring the agent to a designated target state. This is achieved by combining a generative model and a planning algorithm: while the generative model predicts the future states, the planning algorithm generates a preferred sequence of actions for luring the agent. A sequence of adversarial examples is then crafted to lure the agent to take the preferred sequence of actions. We apply the two tactics to the age
Authors
(none)
Tags
Stats
Related papers
- Adversarial Inception Backdoor Attacks Against Reinforcement Learning (2024)0.00
- Adversarial Policies: Attacking Deep Reinforcement Learning (2019)0.00
- RAT: Adversarial Attacks On Deep Reinforcement Agents For Targeted Behaviors (2024)0.00
- Snooping Attacks On Deep Reinforcement Learning (2019)2.26
- Understanding Adversarial Attacks On Observations In Deep Reinforcement Learning (2021)0.00
- Robust Deep Reinforcement Learning Against Adversarial Behavior Manipulation (2024)0.00
- Trojdrl: Trojan Attacks On Deep Reinforcement Learning Agents (2019)0.00
- Optimal Attack And Defense For Reinforcement Learning (2023)6.34