Explainable Deep Reinforcement Learning: State Of The Art And Challenges
2023 Β· George A. Vouros
Abstract
Interpretability, explainability and transparency are key issues to introducing Artificial Intelligence methods in many critical domains: This is important due to ethical concerns and trust issues strongly connected to reliability, robustness, auditability and fairness, and has important consequences towards keeping the human in the loop in high levels of automation, especially in critical cases for decision making, where both (human and the machine) play important roles. While the research community has given much attention to explainability of closed (or black) prediction boxes, there are tremendous needs for explainability of closed-box methods that support agents to act autonomously in the real world. Reinforcement learning methods, and especially their deep versions, are such closed-box methods. In this article we aim to provide a review of state of the art methods for explainable deep reinforcement learning methods, taking also into account the needs of human operators - i.e., of
Authors
(none)
Tags
Stats
Related papers
- Explainability In Deep Reinforcement Learning, A Review Into Current Methods And Applications (2022)12.33
- Explainability In Deep Reinforcement Learning (2020)0.00
- A Survey On Explainable Reinforcement Learning: Concepts, Algorithms, Challenges (2022)0.00
- Domain-level Explainability -- A Challenge For Creating Trust In Superhuman AI Strategies (2020)0.00
- A Survey Of Explainable Reinforcement Learning: Targets, Methods And Needs (2025)0.00
- Redefining Counterfactual Explanations For Reinforcement Learning: Overview, Challenges And Opportunities (2022)0.00
- Explainable Reinforcement Learning: A Survey (2020)0.00
- Causal State Distillation For Explainable Reinforcement Learning (2023)0.00