Snooping Attacks On Deep Reinforcement Learning
2019 Β· Matthew Inkawhich, Yiran Chen, Hai Li
Abstract
Adversarial attacks have exposed a significant security vulnerability in state-of-the-art machine learning models. Among these models include deep reinforcement learning agents. The existing methods for attacking reinforcement learning agents assume the adversary either has access to the target agent's learned parameters or the environment that the agent interacts with. In this work, we propose a new class of threat models, called snooping threat models, that are unique to reinforcement learning. In these snooping threat models, the adversary does not have the ability to interact with the target agent's environment, and can only eavesdrop on the action and reward signals being exchanged between agent and environment. We show that adversaries operating in these highly constrained threat models can still launch devastating attacks against the target agent by training proxy models on related tasks and leveraging the transferability of adversarial examples.
Authors
(none)
Tags
Stats
Related papers
- Trojdrl: Trojan Attacks On Deep Reinforcement Learning Agents (2019)0.00
- Adversarial Inception Backdoor Attacks Against Reinforcement Learning (2024)0.00
- Observed Adversaries In Deep Reinforcement Learning (2022)0.00
- Understanding Adversarial Attacks On Observations In Deep Reinforcement Learning (2021)0.00
- Sleepernets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents (2024)0.00
- Adversarial Policies: Attacking Deep Reinforcement Learning (2019)0.00
- Robust Deep Reinforcement Learning Against Adversarial Behavior Manipulation (2024)0.00
- Tactics Of Adversarial Attack On Deep Reinforcement Learning Agents (2017)17.32