Limitations Of Scalarisation In MORL: A Comparative Study In Discrete Environments
2025 Β· Muhammad Sa'Ood Shah, Asad Jeewa
Abstract
Scalarisation functions are widely employed in MORL algorithms to enable intelligent decision-making. However, these functions often struggle to approximate the Pareto front accurately, rendering them unideal in complex, uncertain environments. This study examines selected Multi-Objective Reinforcement Learning (MORL) algorithms across MORL environments with discrete action and observation spaces. We aim to investigate further the limitations associated with scalarisation approaches for decision-making in multi-objective settings. Specifically, we use an outer-loop multi-policy methodology to assess the performance of a seminal single-policy MORL algorithm, MO Q-Learning implemented with linear scalarisation and Chebyshev scalarisation functions. In addition, we explore a pioneering inner-loop multi-policy algorithm, Pareto Q-Learning, which offers a more robust alternative. Our findings reveal that the performance of the scalarisation functions is highly dependent on the environment a
Authors
(none)
Tags
Stats
Related papers
- Addressing The Issue Of Stochastic Environments And Local Decision-making In Multi-objective Reinforcement Learning (2022)0.00
- An Empirical Investigation Of Value-based Multi-objective Reinforcement Learning For Stochastic Environments (2024)0.00
- Issues With Value-based Multi-objective Reinforcement Learning: Value Function Interference And Overestimation Sensitivity (2024)0.00
- On Generalization Across Environments In Multi-objective Reinforcement Learning (2025)0.00
- Interpretability By Design For Efficient Multi-objective Reinforcement Learning (2025)0.00
- A Generalized Algorithm For Multi-objective Reinforcement Learning And Policy Adaptation (2019)0.00
- Navigating Trade-offs: Policy Summarization For Multi-objective Reinforcement Learning (2024)2.26
- Provable Multi-objective Reinforcement Learning With Generative Models (2020)0.00