Optimal Exploration For Model-based RL In Nonlinear Systems
2023 Β· Andrew Wagenmaker, Guanya Shi, Kevin Jamieson
Abstract
Learning to control unknown nonlinear dynamical systems is a fundamental problem in reinforcement learning and control theory. A commonly applied approach is to first explore the environment (exploration), learn an accurate model of it (system identification), and then compute an optimal controller with the minimum cost on this estimated system (policy optimization). While existing work has shown that it is possible to learn a uniformly good model of the system~\citep\{mania2020active\}, in practice, if we aim to learn a good controller with a low cost on the actual system, certain system parameters may be significantly more critical than others, and we therefore ought to focus our exploration on learning such parameters. In this work, we consider the setting of nonlinear dynamical systems and seek to formally quantify, in such settings, (a) which parameters are most relevant to learning a good controller, and (b) how we can best explore so as to minimize uncertainty in such paramete
Authors
(none)
Tags
Stats
Related papers
- Task-optimal Exploration In Linear Dynamical Systems (2021)0.00
- An Optimal Policy For Learning Controllable Dynamics By Exploration (2025)0.00
- Actively Learning Reinforcement Learning: A Stochastic Optimal Control Approach (2023)0.00
- Optimistic Active Exploration Of Dynamical Systems (2023)0.00
- RLOC: Neurobiologically Inspired Hierarchical Reinforcement Learning Algorithm For Continuous Control Of Nonlinear Dynamical Systems (2019)0.00
- Exploration Versus Exploitation In Reinforcement Learning: A Stochastic Control Approach (2018)9.76
- PC-MLP: Model-based Reinforcement Learning With Policy Cover Guided Exploration (2021)0.00
- Smart Exploration In Reinforcement Learning Using Bounded Uncertainty Models (2025)0.00