Deepmdp: Learning Continuous Latent Space Models For Representation Learning
2019 Β· Carles Gelada, Saurabh Kumar, Jacob Buckman, et al.
Abstract
Many reinforcement learning (RL) tasks provide the agent with high-dimensional observations that can be simplified into low-dimensional continuous states. To formalize this process, we introduce the concept of a DeepMDP, a parameterized latent space model that is trained via the minimization of two tractable losses: prediction of rewards and prediction of the distribution over next latent states. We show that the optimization of these objectives guarantees (1) the quality of the latent space as a representation of the state space and (2) the quality of the DeepMDP as a model of the environment. We connect these results to prior work in the bisimulation literature, and explore the use of a variety of metrics. Our theoretical findings are substantiated by the experimental result that a trained DeepMDP recovers the latent structure underlying high-dimensional observations on a synthetic environment. Finally, we show that learning a DeepMDP as an auxiliary task in the Atari 2600 domain lea
Authors
(none)
Tags
Stats
Related papers
- Low-dimensional State And Action Representation Learning With MDP Homomorphism Metrics (2021)0.00
- Simplifying Model-based RL: Learning Representations, Latent-space Models, And Policies With One Objective (2022)0.00
- Learning Markov State Abstractions For Deep Reinforcement Learning (2021)0.00
- Latent Variable Representation For Reinforcement Learning (2022)0.00
- Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity (2021)0.00
- Representation Learning For Efficient Deep Multi-agent Reinforcement Learning (2024)0.00
- Projection By Convolution: Optimal Sample Complexity For Reinforcement Learning In Continuous-space Mdps (2024)0.00
- Deep Active Inference For Partially Observable Mdps (2020)9.59