A New Potential-based Reward Shaping For Reinforcement Learning Agent
2019 Β· Babak Badnava, Mona Esmaeili, Nasser Mozayani, et al.
Abstract
Potential-based reward shaping (PBRS) is a particular category of machine learning methods which aims to improve the learning speed of a reinforcement learning agent by extracting and utilizing extra knowledge while performing a task. There are two steps in the process of transfer learning: extracting knowledge from previously learned tasks and transferring that knowledge to use it in a target task. The latter step is well discussed in the literature with various methods being proposed for it, while the former has been explored less. With this in mind, the type of knowledge that is transmitted is very important and can lead to considerable improvement. Among the literature of both the transfer learning and the potential-based reward shaping, a subject that has never been addressed is the knowledge gathered during the learning process itself. In this paper, we presented a novel potential-based reward shaping method that attempted to extract knowledge from the learning process. The propo
Authors
(none)
Tags
Stats
Related papers
- On The Sample Efficiency Of Abstractions And Potential-based Reward Shaping In Reinforcement Learning (2024)0.00
- Subgoal-based Reward Shaping To Improve Efficiency In Reinforcement Learning (2021)0.00
- BAMDP Shaping: A Unified Framework For Intrinsic Motivation And Reward Shaping (2024)0.00
- Highly Efficient Self-adaptive Reward Shaping For Reinforcement Learning (2024)0.00
- Shaping Advice In Deep Reinforcement Learning (2022)0.00
- Centralized Reward Agent For Knowledge Sharing And Transfer In Multi-task Reinforcement Learning (2024)0.00
- Reward Shaping With Dynamic Trajectory Aggregation (2021)0.00
- Unpacking Reward Shaping: Understanding The Benefits Of Reward Engineering On Sample Complexity (2022)4.52