Learn To Interpret Atari Agents
2018 Β· Zhao Yang, Song Bai, Li Zhang, et al.
Abstract
Deep reinforcement learning (DeepRL) agents surpass human-level performance in many tasks. However, the direct mapping from states to actions makes it hard to interpret the rationale behind the decision-making of the agents. In contrast to previous a-posteriori methods for visualizing DeepRL policies, in this work, we propose to equip the DeepRL model with an innate visualization ability. Our proposed agent, named region-sensitive Rainbow (RS-Rainbow), is an end-to-end trainable network based on the original Rainbow, a powerful deep Q-network agent. It learns important regions in the input domain via an attention module. At inference time, after each forward pass, we can visualize regions that are most important to decision-making by backpropagating gradients from the attention module to the input frames. The incorporation of our proposed module not only improves model interpretability, but leads to performance improvement. Extensive experiments on games from the Atari 2600 suite demon
Authors
(none)
Tags
Stats
Related papers
- Visualizing And Understanding Atari Agents (2017)0.00
- Explaining Deep Reinforcement Learning Agents In The Atari Domain Through A Surrogate Model (2021)0.00
- Playing Atari With Six Neurons (2018)0.00
- Is Deep Reinforcement Learning Really Superhuman On Atari? Leveling The Playing Field (2019)0.00
- Explainable Deep Reinforcement Learning Using Introspection In A Non-episodic Task (2021)0.00
- Machine Versus Human Attention In Deep Reinforcement Learning Tasks (2020)0.00
- Revisiting Rainbow: Promoting More Insightful And Inclusive Deep Reinforcement Learning Research (2020)0.00
- "so, Tell Me About Your Policy...": Distillation Of Interpretable Policies From Deep Reinforcement Learning Agents (2025)0.00