Interpretability By Design For Efficient Multi-objective Reinforcement Learning
2025 Β· Qiyue Xia, Tianwei Wang, J. Michael Herrmann
Abstract
Multi-objective reinforcement learning (MORL) aims at optimising several, often conflicting goals to improve the flexibility and reliability of RL in practical tasks. This is typically achieved by finding a set of diverse, non-dominated policies that form a Pareto front in the performance space. We introduce LLE-MORL, an approach that achieves interpretability by design by utilising a training scheme based on the local relationship between the parameter space and the performance space. By exploiting a locally linear map between these spaces, our method provides an interpretation of policy parameters in terms of the objectives, and this structured representation enables an efficient search within contiguous solution domains, allowing for the rapid generation of high-quality solutions without extensive retraining. Experiments across diverse continuous control domains demonstrate that LLE-MORL consistently achieves higher Pareto front quality and efficiency than state-of-the-art approache
Authors
(none)
Tags
Stats
Related papers
- Navigating Trade-offs: Policy Summarization For Multi-objective Reinforcement Learning (2024)2.26
- Using Logical Specifications Of Objectives In Multi-objective Reinforcement Learning (2019)0.00
- Provable Multi-objective Reinforcement Learning With Generative Models (2020)0.00
- On Generalization Across Environments In Multi-objective Reinforcement Learning (2025)0.00
- Addressing The Issue Of Stochastic Environments And Local Decision-making In Multi-objective Reinforcement Learning (2022)0.00
- A Generalized Algorithm For Multi-objective Reinforcement Learning And Policy Adaptation (2019)0.00
- Sample-efficient Multi-objective Learning Via Generalized Policy Improvement Prioritization (2023)5.24
- Multi-objective Reinforcement Learning Based On Decomposition: A Taxonomy And Framework (2023)9.92