Reward Design For Reinforcement Learning Agents
2025 Β· Rati Devidze
Abstract
Reward functions are central in reinforcement learning (RL), guiding agents towards optimal decision-making. The complexity of RL tasks requires meticulously designed reward functions that effectively drive learning while avoiding unintended consequences. Effective reward design aims to provide signals that accelerate the agent's convergence to optimal behavior. Crafting rewards that align with task objectives, foster desired behaviors, and prevent undesirable actions is inherently challenging. This thesis delves into the critical role of reward signals in RL, highlighting their impact on the agent's behavior and learning dynamics and addressing challenges such as delayed, ambiguous, or intricate rewards. In this thesis work, we tackle different aspects of reward shaping. First, we address the problem of designing informative and interpretable reward signals from a teacher's/expert's perspective (teacher-driven). Here, the expert, equipped with the optimal policy and the corresponding
Authors
(none)
Tags
Stats
Related papers
- Designing Rewards For Fast Learning (2022)0.00
- Tiered Reward: Designing Rewards For Specification And Fast Learning Of Desired Behavior (2022)0.00
- Informativeness Of Reward Functions In Reinforcement Learning (2024)2.26
- Reward Models In Deep Reinforcement Learning: A Survey (2025)0.00
- Adaptive Reward Design For Reinforcement Learning (2024)0.00
- Differentiable Evolutionary Reinforcement Learning (2025)0.00
- Provably Feedback-efficient Reinforcement Learning Via Active Reward Learning (2023)0.00
- Deep Reinforcement Learning From Hierarchical Preference Design (2023)2.00