Three Dogmas Of Reinforcement Learning
2024 Β· David Abel, Mark K. Ho, Anna Harutyunyan
Abstract
Modern reinforcement learning has been conditioned by at least three dogmas. The first is the environment spotlight, which refers to our tendency to focus on modeling environments rather than agents. The second is our treatment of learning as finding the solution to a task, rather than adaptation. The third is the reward hypothesis, which states that all goals and purposes can be well thought of as maximization of a reward signal. These three dogmas shape much of what we think of as the science of reinforcement learning. While each of the dogmas have played an important role in developing the field, it is time we bring them to the surface and reflect on whether they belong as basic ingredients of our scientific paradigm. In order to realize the potential of reinforcement learning as a canonical frame for researching intelligent agents, we suggest that it is time we shed dogmas one and two entirely, and embrace a nuanced approach to the third.
Authors
(none)
Tags
Stats
Related papers
- Illuminating The Three Dogmas Of Reinforcement Learning Under Evolutionary Light (2025)0.00
- Rethinking The Foundations For Continual Reinforcement Learning (2025)0.00
- Implications Of Human Irrationality For Reinforcement Learning (2020)0.00
- Deep Multiagent Reinforcement Learning: Challenges And Directions (2021)0.00
- Reinforcement Learning Algorithms: An Overview And Classification (2022)14.73
- Unified Algorithms For RL With Decision-estimation Coefficients: PAC, Reward-free, Preference-based Learning, And Beyond (2022)5.24
- Foundations For Transfer In Reinforcement Learning: A Taxonomy Of Knowledge Modalities (2023)0.00
- Ecological Reinforcement Learning (2020)8.35