Visualizing And Understanding Atari Agents
2017 Β· Sam Greydanus, Anurag Koul, Jonathan Dodge, et al.
Abstract
While deep reinforcement learning (deep RL) agents are effective at maximizing rewards, it is often unclear what strategies they use to do so. In this paper, we take a step toward explaining deep RL agents through a case study using Atari 2600 environments. In particular, we focus on using saliency maps to understand how an agent learns and executes a policy. We introduce a method for generating useful saliency maps and use it to show 1) what strong agents attend to, 2) whether agents are making decisions for the right or wrong reasons, and 3) how agents evolve during learning. We also test our method on non-expert human subjects and find that it improves their ability to reason about these agents. Overall, our results show that saliency information can provide significant insight into an RL agent's decisions and learning behavior.
Authors
(none)
Tags
Stats
Related papers
- Learn To Interpret Atari Agents (2018)0.00
- Exploratory Not Explanatory: Counterfactual Analysis Of Saliency Maps For Deep Reinforcement Learning (2019)0.00
- Benchmarking Perturbation-based Saliency Maps For Explaining Atari Agents (2021)0.00
- Explain Your Move: Understanding Agent Actions Using Specific And Relevant Feature Attribution (2019)0.00
- Machine Versus Human Attention In Deep Reinforcement Learning Tasks (2020)0.00
- Explaining Deep Reinforcement Learning Agents In The Atari Domain Through A Surrogate Model (2021)0.00
- Local And Global Explanations Of Agent Behavior: Integrating Strategy Summaries With Saliency Maps (2020)11.85
- Counterfactual States For Atari Agents Via Generative Deep Learning (2019)0.00