Ganterfactual-rl: Understanding Reinforcement Learning Agents' Strategies Through Visual Counterfactual Explanations
2023 Β· Tobias Huber, Maximilian Demmler, Silvan Mertes, et al.
Abstract
Counterfactual explanations are a common tool to explain artificial intelligence models. For Reinforcement Learning (RL) agents, they answer "Why not?" or "What if?" questions by illustrating what minimal change to a state is needed such that an agent chooses a different action. Generating counterfactual explanations for RL agents with visual input is especially challenging because of their large state spaces and because their decisions are part of an overarching policy, which includes long-term decision-making. However, research focusing on counterfactual explanations, specifically for RL agents with visual input, is scarce and does not go beyond identifying defective agents. It is unclear whether counterfactual explanations are still helpful for more complex tasks like analyzing the learned strategies of different agents or choosing a fitting agent for a specific task. We propose a novel but simple method to generate counterfactual explanations for RL agents by formulating the proble
Authors
(none)
Tags
Stats
Related papers
- Counterfactual State Explanations For Reinforcement Learning Agents Via Generative Deep Learning (2021)13.23
- Redefining Counterfactual Explanations For Reinforcement Learning: Overview, Challenges And Opportunities (2022)0.00
- Explaining Reinforcement Learning Agents Through Counterfactual Action Outcomes (2023)5.84
- Experiential Explanations For Reinforcement Learning (2022)2.26
- Why The Agent Made That Decision: Contrastive Explanation Learning For Reinforcement Learning (2024)0.00
- RACCER: Towards Reachable And Certain Counterfactual Explanations For Reinforcement Learning (2023)0.00
- Learning Impartial Policies For Sequential Counterfactual Explanations Using Deep Reinforcement Learning (2023)0.00
- Counterfactual States For Atari Agents Via Generative Deep Learning (2019)0.00