Global And Local Analysis Of Interestingness For Competency-aware Deep Reinforcement Learning
2022 Β· Pedro Sequeira, Jesse Hostetler, Melinda Gervasio
Abstract
In recent years, advances in deep learning have resulted in a plethora of successes in the use of reinforcement learning (RL) to solve complex sequential decision tasks with high-dimensional inputs. However, existing systems lack the necessary mechanisms to provide humans with a holistic view of their competence, presenting an impediment to their adoption, particularly in critical applications where the decisions an agent makes can have significant consequences. Yet, existing RL-based systems are essentially competency-unaware in that they lack the necessary interpretation mechanisms to allow human operators to have an insightful, holistic view of their competency. In this paper, we extend a recently-proposed framework for explainable RL that is based on analyses of "interestingness." Our new framework provides various measures of RL agent competence stemming from interestingness analysis and is applicable to a wide range of RL algorithms. We also propose novel mechanisms for assessing
Authors
(none)
Tags
Stats
Related papers
- Ixdrl: A Novel Explainable Deep Reinforcement Learning Toolkit Based On Analyses Of Interestingness (2023)5.24
- Interestingness Elements For Explainable Reinforcement Learning: Understanding Agents' Capabilities And Limitations (2019)14.55
- Explainable Deep Reinforcement Learning: State Of The Art And Challenges (2023)15.80
- Local And Global Explanations Of Agent Behavior: Integrating Strategy Summaries With Saliency Maps (2020)11.85
- Explainability In Deep Reinforcement Learning (2020)0.00
- Redefining Counterfactual Explanations For Reinforcement Learning: Overview, Challenges And Opportunities (2022)0.00
- Evaluating The Progress Of Deep Reinforcement Learning In The Real World: Aligning Domain-agnostic And Domain-specific Research (2021)0.00
- A Survey On Enhancing Reinforcement Learning In Complex Environments: Insights From Human And LLM Feedback (2024)0.00