Generalized Bayesian Deep Reinforcement Learning
2024 Β· Shreya Sinha Roy, Richard G. Everitt, Christian P. Robert, et al.
Abstract
Bayesian reinforcement learning (BRL) is a method that merges principles from Bayesian statistics and reinforcement learning to make optimal decisions in uncertain environments. As a model-based RL method, it has two key components: (1) inferring the posterior distribution of the model for the data-generating process (DGP) and (2) policy learning using the learned posterior. We propose to model the dynamics of the unknown environment through deep generative models, assuming Markov dependence. In the absence of likelihood functions for these models, we train them by learning a generalized predictive-sequential (or prequential) scoring rule (SR) posterior. We used sequential Monte Carlo (SMC) samplers to draw samples from this generalized Bayesian posterior distribution. In conjunction, to achieve scalability in the high-dimensional parameter space of the neural networks, we use the gradient-based Markov kernels within SMC. To justify the use of the prequential scoring rule posterior, we
Authors
(none)
Tags
Stats
Related papers
- Inferential Induction: A Novel Framework For Bayesian Reinforcement Learning (2020)0.00
- Online Bayesian Risk-averse Reinforcement Learning (2025)0.00
- Bayesian Exploration Networks (2023)0.00
- Combining Bayesian Inference And Reinforcement Learning For Agent Decision Making: A Review (2025)0.00
- On The Reliability And Generalizability Of Brain-inspired Reinforcement Learning Algorithms (2020)0.00
- Bayesian Reparameterization Of Reward-conditioned Reinforcement Learning With Energy-based Models (2023)0.00
- Asymptotically Optimal Reinforcement Learning In Block Markov Decision Processes (2025)0.00
- Robust Bayesian Dynamic Programming For On-policy Risk-sensitive Reinforcement Learning (2025)0.00