Dense And Diverse Goal Coverage In Multi Goal Reinforcement Learning
2025 Β· Sagalpreet Singh, Rishi Saket, Aravindan Raghuveer
Abstract
Reinforcement Learning algorithms are primarily focused on learning a policy that maximizes expected return. As a result, the learned policy can exploit one or few reward sources. However, in many natural situations, it is desirable to learn a policy that induces a dispersed marginal state distribution over rewarding states, while maximizing the expected return which is typically tied to reaching a goal state. This aspect remains relatively unexplored. Existing techniques based on entropy regularization and intrinsic rewards use stochasticity for encouraging exploration to find an optimal policy which may not necessarily lead to dispersed marginal state distribution over rewarding states. Other RL algorithms which match a target distribution assume the latter to be available apriori. This may be infeasible in large scale systems where enumeration of all states is not possible and a state is determined to be a goal state only upon reaching it. We formalize the problem of maximizing the
Authors
(none)
Tags
Stats
Related papers
- Maximum Entropy-regularized Multi-goal Reinforcement Learning (2019)0.00
- Generalizing Across Multi-objective Reward Functions In Deep Reinforcement Learning (2018)0.00
- An Agent Design With Goal Reaching Guarantees For Enhancement Of Learning (2024)0.00
- Maximum Entropy Gain Exploration For Long Horizon Multi-goal Reinforcement Learning (2020)0.00
- Learning Self-imitating Diverse Policies (2018)0.00
- Imagine Beyond! Distributionally Robust Auto-encoding For State Space Coverage In Online Reinforcement Learning (2025)0.00
- One Solution Is Not All You Need: Few-shot Extrapolation Via Structured Maxent RL (2020)0.00
- Diverse Policies Converge In Reward-free Markov Decision Processe (2023)0.00