Time-series-informed Closed-loop Learning For Sequential Decision Making And Control
2024 Β· Sebastian Hirt, Lukas Theiner, Rolf Findeisen
Abstract
Closed-loop performance of sequential decision making algorithms, such as model predictive control, depends strongly on the choice of controller parameters. Bayesian optimization allows learning of parameters from closed-loop experiments, but standard Bayesian optimization treats this as a black-box problem and ignores the temporal structure of closed-loop trajectories, leading to slow convergence and inefficient use of experimental resources. We propose a time-series-informed multi-fidelity Bayesian optimization framework that aligns the fidelity dimension with closed-loop time, enabling intermediate performance evaluations within a closed-loop experiment to be incorporated as lower-fidelity observations. Additionally, we derive probabilistic early stopping criteria to terminate unpromising closed-loop experiments based on the surrogate model's posterior belief, avoiding full episodes for poor parameterizations and thereby reducing resource usage. Simulation results on a nonlinear con
Authors
(none)
Tags
Stats
Related papers
- Model-based Reinforcement Learning For Control Under Time-varying Dynamics (2026)0.00
- Logarithmic Regret For Nonlinear Control (2025)0.00
- Towards An Adaptable And Generalizable Optimization Engine In Decision And Control: A Meta Reinforcement Learning Approach (2024)0.00
- Inverse Rational Control With Partially Observable Continuous Nonlinear Dynamics (2019)0.00
- From Reinforcement Learning To Optimal Control: A Unified Framework For Sequential Decisions (2019)0.00
- Follow The Clairvoyant: An Imitation Learning Approach To Optimal Control (2022)4.52
- Probabilistic Inverse Optimal Control For Non-linear Partially Observable Systems Disentangles Perceptual Uncertainty And Behavioral Costs (2023)0.00
- Alternating Optimisation And Quadrature For Robust Control (2016)7.16