Learning Robust Control For LQR Systems With Multiplicative Noise Via Policy Gradient
2019 Β· Benjamin Gravell, Peyman Mohajerin Esfahani, Tyler Summers
Abstract
The linear quadratic regulator (LQR) problem has reemerged as an important theoretical benchmark for reinforcement learning-based control of complex dynamical systems with continuous state and action spaces. In contrast with nearly all recent work in this area, we consider multiplicative noise models, which are increasingly relevant because they explicitly incorporate inherent uncertainty and variation in the system dynamics and thereby improve robustness properties of the controller. Robustness is a critical and poorly understood issue in reinforcement learning; existing methods which do not account for uncertainty can converge to fragile policies or fail to converge at all. Additionally, intentional injection of multiplicative noise into learning algorithms can enhance robustness of policies, as observed in ad hoc work on domain randomization. Although policy gradient algorithms require optimization of a non-convex cost function, we show that the multiplicative noise LQR cost has a s
Authors
(none)
Tags
Stats
Related papers
- Robust Reinforcement Learning: A Case Study In Linear Quadratic Regulation (2020)11.19
- Revisiting LQR Control From The Perspective Of Receding-horizon Policy Gradient (2023)8.60
- Full Error Analysis Of Policy Gradient Learning Algorithms For Exploratory Linear Quadratic Mean-field Control Problem In Continuous Time With Common Noise (2024)0.00
- Some Remarks On Gradient Dominance And LQR Policy Optimization (2025)0.00
- Online Policy Gradient For Model Free Learning Of Linear Quadratic Regulators With \(\sqrt{t}\) Regret (2021)0.00
- Meta-learning Linear Quadratic Regulators: A Policy Gradient MAML Approach For Model-free LQR (2024)0.00
- Policy Gradient For LQR With Domain Randomization (2025)2.26
- Fast Policy Learning For Linear Quadratic Control With Entropy Regularization (2023)0.00