Learning Sparse Representations In Reinforcement Learning
2019 Β· Jacob Rafati, David C. Noelle
Abstract
Reinforcement learning (RL) algorithms allow artificial agents to improve their selection of actions to increase rewarding experiences in their environments. Temporal Difference (TD) Learning -- a model-free RL method -- is a leading account of the midbrain dopamine system and the basal ganglia in reinforcement learning. These algorithms typically learn a mapping from the agent's current sensed state to a selected action (known as a policy function) via learning a value function (expected future rewards). TD Learning methods have been very successful on a broad range of control tasks, but learning can become intractably slow as the state space of the environment grows. This has motivated methods that learn internal representations of the agent's state, effectively reducing the size of the state space and restructuring state representations in order to support generalization. However, TD Learning coupled with an artificial neural network, as a function approximator, has been shown to fa
Authors
(none)
Tags
Stats
Related papers
- Discerning Temporal Difference Learning (2023)0.00
- Temporal Difference Models: Model-free Deep RL For Model-based Control (2018)0.00
- TD Or Not TD: Analyzing The Role Of Temporal Differencing In Deep Reinforcement Learning (2018)0.00
- Temporal-difference Learning Using Distributed Error Signals (2024)0.00
- Learning Symbolic Representations For Reinforcement Learning Of Non-markovian Behavior (2023)0.00
- Approximate Temporal Difference Learning Is A Gradient Descent For Reversible Policies (2018)0.00
- Prediction And Control In Continual Reinforcement Learning (2023)0.00
- Towards A Better Understanding Of Representation Dynamics Under Td-learning (2023)0.00