Benchmarking Perturbation-based Saliency Maps For Explaining Atari Agents
2021 Β· Tobias Huber, Benedikt Limmer, Elisabeth AndrΓ©
Abstract
One of the most prominent methods for explaining the behavior of Deep Reinforcement Learning (DRL) agents is the generation of saliency maps that show how much each pixel attributed to the agents' decision. However, there is no work that computationally evaluates and compares the fidelity of different saliency map approaches specifically for DRL agents. It is particularly challenging to computationally evaluate saliency maps for DRL agents since their decisions are part of an overarching policy. For instance, the output neurons of value-based DRL algorithms encode both the value of the current state as well as the value of doing each action in this state. This ambiguity should be considered when evaluating saliency maps for such agents. In this paper, we compare five popular perturbation-based approaches to create saliency maps for DRL agents trained on four different Atari 2600 games. The approaches are compared using two computational metrics: dependence on the learned parameters of
Authors
(none)
Tags
Stats
Related papers
- Exploratory Not Explanatory: Counterfactual Analysis Of Saliency Maps For Deep Reinforcement Learning (2019)0.00
- Visualizing And Understanding Atari Agents (2017)0.00
- Explain Your Move: Understanding Agent Actions Using Specific And Relevant Feature Attribution (2019)0.00
- Machine Versus Human Attention In Deep Reinforcement Learning Tasks (2020)0.00
- Explaining Deep Reinforcement Learning Agents In The Atari Domain Through A Surrogate Model (2021)0.00
- Importance Of Using Appropriate Baselines For Evaluation Of Data-efficiency In Deep Reinforcement Learning For Atari (2020)0.00
- Are Gradient-based Saliency Maps Useful In Deep Reinforcement Learning? (2020)0.00
- Is Deep Reinforcement Learning Really Superhuman On Atari? Leveling The Playing Field (2019)0.00