Distributional Reinforcement Learning With Linear Function Approximation
2019 Β· Marc G. Bellemare, Nicolas Le Roux, Pablo Samuel Castro, et al.
Abstract
Despite many algorithmic advances, our theoretical understanding of practical distributional reinforcement learning methods remains limited. One exception is Rowland et al. (2018)'s analysis of the C51 algorithm in terms of the Cram\'er distance, but their results only apply to the tabular setting and ignore C51's use of a softmax to produce normalized distributions. In this paper we adapt the Cram\'er distance to deal with arbitrary vectors. From it we derive a new distributional algorithm which is fully Cram\'er-based and can be combined to linear function approximation, with formal guarantees in the context of policy evaluation. In allowing the model's prediction to be any real vector, we lose the probabilistic interpretation behind the method, but otherwise maintain the appealing properties of distributional approaches. To the best of our knowledge, ours is the first proof of convergence of a distributional algorithm combined with function approximation. Perhaps surprisingly, our r
Authors
(none)
Tags
Stats
Related papers
- Distributionally Robust Offline Reinforcement Learning With Linear Function Approximation (2022)0.00
- Provably Efficient Reinforcement Learning With Linear Function Approximation (2019)11.76
- Conjugated Discrete Distributions For Distributional Reinforcement Learning (2021)0.00
- Uniform-pac Bounds For Reinforcement Learning With Linear Function Approximation (2021)0.00
- Distributionally Robust Online Markov Game With Linear Function Approximation (2025)0.00
- Distributionally Robust Off-dynamics Reinforcement Learning: Provable Efficiency With Linear Function Approximation (2024)0.00
- Accelerated Distributional Temporal Difference Learning With Linear Function Approximation (2025)0.00
- Reward-free Model-based Reinforcement Learning With Linear Function Approximation (2021)0.00