An Agent Design With Goal Reaching Guarantees For Enhancement Of Learning
2024 Β· Pavel Osinenko, Grigory Yaremenko, Georgiy Malaniya, et al.
Abstract
Reinforcement learning is commonly concerned with problems of maximizing accumulated rewards in Markov decision processes. Oftentimes, a certain goal state or a subset of the state space attain maximal reward. In such a case, the environment may be considered solved when the goal is reached. Whereas numerous techniques, learning or non-learning based, exist for solving environments, doing so optimally is the biggest challenge. Say, one may choose a reward rate which penalizes the action effort. Reinforcement learning is currently among the most actively developed frameworks for solving environments optimally by virtue of maximizing accumulated reward, in other words, returns. Yet, tuning agents is a notoriously hard task as reported in a series of works. Our aim here is to help the agent learn a near-optimal policy efficiently while ensuring a goal reaching property of some basis policy that merely solves the environment. We suggest an algorithm, which is fairly flexible, and can be us
Authors
(none)
Tags
Stats
Related papers
- Goal-oriented Inference Of Environment From Redundant Observations (2023)3.58
- Dense And Diverse Goal Coverage In Multi Goal Reinforcement Learning (2025)0.00
- Reward Design For Reinforcement Learning Agents (2025)0.00
- Scalable Agent Alignment Via Reward Modeling: A Research Direction (2018)0.00
- Learning To Design Games: Strategic Environments In Reinforcement Learning (2017)0.00
- Self-supervised Goal-reaching Results In Multi-agent Cooperation And Exploration (2025)0.00
- Reinforcement Learning With A Focus On Adjusting Policies To Reach Targets (2024)0.00
- Tiered Reward: Designing Rewards For Specification And Fast Learning Of Desired Behavior (2022)0.00