Goal-conditioned Data Augmentation For Offline Reinforcement Learning
2024 Β· Xingshuai Huang, di Wu, Benoit Boulet
Abstract
Offline reinforcement learning (RL) enables policy learning from pre-collected offline datasets, relaxing the need to interact directly with the environment. However, limited by the quality of offline datasets, it generally fails to learn well-qualified policies in suboptimal datasets. To address datasets with insufficient optimal demonstrations, we introduce Goal-cOnditioned Data Augmentation (GODA), a novel goal-conditioned diffusion-based method for augmenting samples with higher quality. Leveraging recent advancements in generative modelling, GODA incorporates a novel return-oriented goal condition with various selection mechanisms. Specifically, we introduce a controllable scaling technique to provide enhanced return-based guidance during data sampling. GODA learns a comprehensive distribution representation of the original offline datasets while generating new data with selectively higher-return goals, thereby maximizing the utility of limited optimal demonstrations. Furthermore,
Authors
(none)
Tags
Stats
Related papers
- GTA: Generative Trajectory Augmentation With Guidance For Offline Reinforcement Learning (2024)6.62
- Diffpogan: Diffusion Policies With Generative Adversarial Networks For Offline Reinforcement Learning (2024)0.00
- Equivariant Data Augmentation For Generalization In Offline Reinforcement Learning (2023)0.00
- Model-based Offline Reinforcement Learning With Adversarial Data Augmentation (2025)0.00
- Enhancing Online Reinforcement Learning With Meta-learned Objective From Offline Data (2025)0.00
- Hundreds Guide Millions: Adaptive Offline Reinforcement Learning With Expert Guidance (2023)7.50
- Robust Offline Reinforcement Learning With Gradient Penalty And Constraint Relaxation (2022)0.00
- AWAC: Accelerating Online Reinforcement Learning With Offline Datasets (2020)0.00