Environment Shaping In Reinforcement Learning Using State Abstraction
2020 Β· Parameswaran Kamalaruban, Rati Devidze, Volkan Cevher, et al.
Abstract
One of the central challenges faced by a reinforcement learning (RL) agent is to effectively learn a (near-)optimal policy in environments with large state spaces having sparse and noisy feedback signals. In real-world applications, an expert with additional domain knowledge can help in speeding up the learning process via *shaping the environment*, i.e., making the environment more learner-friendly. A popular paradigm in literature is *potential-based reward shaping*, where the environment's reward function is augmented with additional local rewards using a potential function. However, the applicability of potential-based reward shaping is limited in settings where (i) the state space is very large, and it is challenging to compute an appropriate potential function, (ii) the feedback signals are noisy, and even with shaped rewards the agent could be trapped in local optima, and (iii) changing the rewards alone is not sufficient, and effective shaping requires changing the dynamics. We
Authors
(none)
Tags
Stats
Related papers
- Shaping Advice In Deep Reinforcement Learning (2022)0.00
- Subgoal-based Reward Shaping To Improve Efficiency In Reinforcement Learning (2021)0.00
- BAMDP Shaping: A Unified Framework For Intrinsic Motivation And Reward Shaping (2024)0.00
- Ecological Reinforcement Learning (2020)8.35
- Learning To Design Games: Strategic Environments In Reinforcement Learning (2017)0.00
- Learning To Shape Rewards Using A Game Of Two Partners (2021)0.00
- Highly Efficient Self-adaptive Reward Shaping For Reinforcement Learning (2024)0.00
- Unpacking Reward Shaping: Understanding The Benefits Of Reward Engineering On Sample Complexity (2022)4.52