Trust The Model Where It Trusts Itself -- Model-based Actor-critic With Uncertainty-aware Rollout Adaption
2024 Β· Bernd Frauenknecht, Artur Eisele, Devdutt Subhasish, et al.
Abstract
Dyna-style model-based reinforcement learning (MBRL) combines model-free agents with predictive transition models through model-based rollouts. This combination raises a critical question: 'When to trust your model?'; i.e., which rollout length results in the model providing useful data? Janner et al. (2019) address this question by gradually increasing rollout lengths throughout the training. While theoretically tempting, uniform model accuracy is a fallacy that collapses at the latest when extrapolating. Instead, we propose asking the question 'Where to trust your model?'. Using inherent model uncertainty to consider local accuracy, we obtain the Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption (MACURA) algorithm. We propose an easy-to-tune rollout mechanism and demonstrate substantial improvements in data efficiency and performance compared to state-of-the-art deep MBRL methods on the MuJoCo benchmark.
Authors
(none)
Tags
Stats
Related papers
- Acting Upon Imagination: When To Trust Imagined Trajectories In Model Based Reinforcement Learning (2021)0.00
- Plan To Predict: Learning An Uncertainty-foreseeing Model For Model-based Reinforcement Learning (2023)0.00
- Multi-agent Uncertainty-aware Pessimistic Model-based Reinforcement Learning For Connected Autonomous Vehicles (2025)0.00
- Self-correcting Models For Model-based Reinforcement Learning (2016)0.00
- Double Horizon Model-based Policy Optimization (2025)0.00
- Bayes-adaptive Deep Model-based Policy Optimisation (2020)0.00
- Coplanner: Plan To Roll Out Conservatively But To Explore Optimistically For Model-based RL (2023)0.00
- Deep Model-based Reinforcement Learning Via Estimated Uncertainty And Conservative Policy Optimization (2019)0.00