Visual Processing In Context Of Reinforcement Learning
2022 Β· Hlynur DavΓΓ° Hlynsson
Abstract
Although deep reinforcement learning (RL) has recently enjoyed many successes, its methods are still data inefficient, which makes solving numerous problems prohibitively expensive in terms of data. We aim to remedy this by taking advantage of the rich supervisory signal in unlabeled data for learning state representations. This thesis introduces three different representation learning algorithms that have access to different subsets of the data sources that traditional RL algorithms use: (i) GRICA is inspired by independent component analysis (ICA) and trains a deep neural network to output statistically independent features of the input. GrICA does so by minimizing the mutual information between each feature and the other features. Additionally, GrICA only requires an unsorted collection of environment states. (ii) Latent Representation Prediction (LARP) requires more context: in addition to requiring a state as an input, it also needs the previous state and an action that connec
Authors
(none)
Tags
Stats
Related papers
- Value-consistent Representation Learning For Data-efficient Reinforcement Learning (2022)0.00
- Data-efficient Reinforcement Learning With Self-predictive Representations (2020)0.00
- Learning Temporally-consistent Representations For Data-efficient Reinforcement Learning (2021)0.00
- Unsupervised Representation Learning In Deep Reinforcement Learning: A Review (2022)9.59
- A Survey Of State Representation Learning For Deep Reinforcement Learning (2025)0.00
- Learning To Identify Critical States For Reinforcement Learning From Videos (2023)8.76
- Improving Reinforcement Learning Efficiency With Auxiliary Tasks In Non-visual Environments: A Comparison (2023)2.26
- Contrastive Learning As Goal-conditioned Reinforcement Learning (2022)0.00