Exploration In Feature Space For Reinforcement Learning
2017 Β· Suraj Narayanan Sasikumar
Abstract
The infamous exploration-exploitation dilemma is one of the oldest and most important problems in reinforcement learning (RL). Deliberate and effective exploration is necessary for RL agents to succeed in most environments. However, until very recently even very sophisticated RL algorithms employed simple, undirected exploration strategies in large-scale RL tasks. We introduce a new optimistic count-based exploration algorithm for RL that is feasible in high-dimensional MDPs. The success of RL algorithms in these domains depends crucially on generalization from limited training experience. Function approximation techniques enable RL agents to generalize in order to estimate the value of unvisited states, but at present few methods have achieved generalization about the agent's uncertainty regarding unvisited states. We present a new method for computing a generalized state visit-count, which allows the agent to estimate the uncertainty associated with any state. In contrast to exis
Authors
(none)
Tags
Stats
Related papers
- Neighboring State-based Exploration For Reinforcement Learning (2022)0.00
- Exploration And Incentives In Reinforcement Learning (2021)8.09
- Exploration Conscious Reinforcement Learning Revisited (2018)0.00
- Go-explore: A New Approach For Hard-exploration Problems (2019)0.00
- Exploitation Is All You Need... For Exploration (2025)0.00
- Is Exploration Or Optimization The Problem For Deep Reinforcement Learning? (2025)0.00
- Exploration Via Elliptical Episodic Bonuses (2022)3.58
- Provably Efficient Exploration For Reinforcement Learning Using Unsupervised Learning (2020)0.00