Automatic Intrinsic Reward Shaping For Exploration In Deep Reinforcement Learning
2023 Β· Mingqi Yuan, Bo Li, Xin Jin, et al.
Abstract
We present AIRS: Automatic Intrinsic Reward Shaping that intelligently and adaptively provides high-quality intrinsic rewards to enhance exploration in reinforcement learning (RL). More specifically, AIRS selects shaping function from a predefined set based on the estimated task return in real-time, providing reliable exploration incentives and alleviating the biased objective problem. Moreover, we develop an intrinsic reward toolkit to provide efficient and reliable implementations of diverse intrinsic reward approaches. We test AIRS on various tasks of MiniGrid, Procgen, and DeepMind Control Suite. Extensive simulation demonstrates that AIRS can outperform the benchmarking schemes and achieve superior performance with simple architecture.
Authors
(none)
Tags
Stats
Related papers
- Highly Efficient Self-adaptive Reward Shaping For Reinforcement Learning (2024)0.00
- BAMDP Shaping: A Unified Framework For Intrinsic Motivation And Reward Shaping (2024)0.00
- Rlexplore: Accelerating Research In Intrinsically-motivated Reinforcement Learning (2024)5.33
- Unpacking Reward Shaping: Understanding The Benefits Of Reward Engineering On Sample Complexity (2022)4.52
- Multimodal Reward Shaping For Efficient Exploration In Reinforcement Learning (2021)0.00
- The Impact Of Intrinsic Rewards On Exploration In Reinforcement Learning (2025)0.00
- Learning Robust Rewards With Adversarial Inverse Reinforcement Learning (2017)0.00
- Intrinsic Reward Policy Optimization For Sparse-reward Environments (2026)0.00