Online Model Selection For Reinforcement Learning With Function Approximation
2020 Β· Jonathan N. Lee, Aldo Pacchiano, Vidya Muthukumar, et al.
Abstract
Deep reinforcement learning has achieved impressive successes yet often requires a very large amount of interaction data. This result is perhaps unsurprising, as using complicated function approximation often requires more data to fit, and early theoretical results on linear Markov decision processes provide regret bounds that scale with the dimension of the linear approximation. Ideally, we would like to automatically identify the minimal dimension of the approximation that is sufficient to encode an optimal policy. Towards this end, we consider the problem of model selection in RL with function approximation, given a set of candidate RL algorithms with known regret guarantees. The learner's goal is to adapt to the complexity of the optimal algorithm without knowing it \textit\{a priori\}. We present a meta-algorithm that successively rejects increasingly complex models using a simple statistical test. Given at least one candidate that satisfies realizability, we prove the meta-algori
Authors
(none)
Tags
Stats
Related papers
- On The Model-misspecification In Reinforcement Learning (2023)0.00
- Provably Efficient Reinforcement Learning With Linear Function Approximation (2019)11.76
- Online Sub-sampling For Reinforcement Learning With General Function Approximation (2021)0.00
- Minimax Optimal And Computationally Efficient Algorithms For Distributionally Robust Offline Reinforcement Learning (2024)0.00
- Distributionally Robust Offline Reinforcement Learning With Linear Function Approximation (2022)0.00
- A Nearly Optimal And Low-switching Algorithm For Reinforcement Learning With General Function Approximation (2023)0.00
- Optimal Conservative Offline RL With General Function Approximation Via Augmented Lagrangian (2022)0.00
- Deciding What To Model: Value-equivalent Sampling For Reinforcement Learning (2022)0.00