Interpretable Local Tree Surrogate Policies
2021 Β· John Mern, Sidhart Krishnan, Anil Yildiz, et al.
Abstract
High-dimensional policies, such as those represented by neural networks, cannot be reasonably interpreted by humans. This lack of interpretability reduces the trust users have in policy behavior, limiting their use to low-impact tasks such as video games. Unfortunately, many methods rely on neural network representations for effective learning. In this work, we propose a method to build predictable policy trees as surrogates for policies such as neural networks. The policy trees are easily human interpretable and provide quantitative predictions of future behavior. We demonstrate the performance of this approach on several simulated tasks.
Authors
(none)
Tags
Stats
Related papers
- Mitigating Information Loss In Tree-based Reinforcement Learning Via Direct Optimization (2024)0.00
- Optimizing Interpretable Decision Tree Policies For Reinforcement Learning (2024)0.00
- Three Pathways To Neurosymbolic Reinforcement Learning With Interpretable Model And Policy Networks (2024)0.00
- Iterative Bounding Mdps: Learning Interpretable Policies Via Non-interpretable Methods (2021)0.00
- "so, Tell Me About Your Policy...": Distillation Of Interpretable Policies From Deep Reinforcement Learning Agents (2025)0.00
- Softtreemax: Policy Gradient With Tree Search (2022)0.00
- Evaluating Interpretable Reinforcement Learning By Distilling Policies Into Programs (2025)0.00
- S-REINFORCE: A Neuro-symbolic Policy Gradient Approach For Interpretable Reinforcement Learning (2023)0.00