A Survey On Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
2022 Β· Yunpeng Qing, Shunyu Liu, Jie Song, et al.
Abstract
Reinforcement Learning (RL) is a popular machine learning paradigm where intelligent agents interact with the environment to fulfill a long-term goal. Driven by the resurgence of deep learning, Deep RL (DRL) has witnessed great success over a wide spectrum of complex control tasks. Despite the encouraging results achieved, the deep neural network-based backbone is widely deemed as a black box that impedes practitioners to trust and employ trained agents in realistic scenarios where high security and reliability are essential. To alleviate this issue, a large volume of literature devoted to shedding light on the inner workings of the intelligent agents has been proposed, by constructing intrinsic interpretability or post-hoc explainability. In this survey, we provide a comprehensive review of existing works on eXplainable RL (XRL) and introduce a new taxonomy where prior works are clearly categorized into model-explaining, reward-explaining, state-explaining, and task-explaining methods
Authors
(none)
Tags
Stats
Related papers
- A Survey Of Explainable Reinforcement Learning (2022)0.00
- Explainable Reinforcement Learning: A Survey (2020)0.00
- A Survey Of Explainable Reinforcement Learning: Targets, Methods And Needs (2025)0.00
- Explainability In Deep Reinforcement Learning (2020)0.00
- Explainability In Deep Reinforcement Learning, A Review Into Current Methods And Applications (2022)12.33
- Xrl-bench: A Benchmark For Evaluating And Comparing Explainable Reinforcement Learning Techniques (2024)0.00
- A Survey On Interpretable Reinforcement Learning (2021)0.00
- Explainable Reinforcement Learning For Broad-xai: A Conceptual Framework And Survey (2021)0.00