Representation-driven Reinforcement Learning
2023 Β· Ofir Nabati, Guy Tennenholtz, Shie Mannor
Abstract
We present a representation-driven framework for reinforcement learning. By representing policies as estimates of their expected values, we leverage techniques from contextual bandits to guide exploration and exploitation. Particularly, embedding a policy network into a linear feature space allows us to reframe the exploration-exploitation problem as a representation-exploitation problem, where good policy representations enable optimal exploration. We demonstrate the effectiveness of this framework through its application to evolutionary and policy gradient-based approaches, leading to significantly improved performance compared to traditional methods. Our framework provides a new perspective on reinforcement learning, highlighting the importance of policy representation in determining optimal exploration-exploitation strategies.
Authors
(none)
Tags
Stats
Related papers
- Representation Of Reinforcement Learning Policies In Reproducing Kernel Hilbert Spaces (2020)0.00
- Spectral Representation-based Reinforcement Learning (2025)0.00
- Improving Policy Gradient By Exploring Under-appreciated Rewards (2016)0.00
- Spectral Decomposition Representation For Reinforcement Learning (2022)0.00
- Frustratingly Easy Regularization On Representation Can Boost Deep Reinforcement Learning (2022)0.00
- S-REINFORCE: A Neuro-symbolic Policy Gradient Approach For Interpretable Reinforcement Learning (2023)0.00
- Erl-re\(^2\): Efficient Evolutionary Reinforcement Learning With Shared State Representation And Individual Policy Representation (2022)0.00
- Investigating The Impact Of Action Representations In Policy Gradient Algorithms (2023)0.00