Don't Start From Scratch: Behavioral Refinement Via Interpolant-based Policy Diffusion
2024 Β· Kaiqi Chen, Eugene Lim, Kelvin Lin, et al.
Abstract
Imitation learning empowers artificial agents to mimic behavior by learning from demonstrations. Recently, diffusion models, which have the ability to model high-dimensional and multimodal distributions, have shown impressive performance on imitation learning tasks. These models learn to shape a policy by diffusing actions (or states) from standard Gaussian noise. However, the target policy to be learned is often significantly different from Gaussian and this mismatch can result in poor performance when using a small number of diffusion steps (to improve inference speed) and under limited data. The key idea in this work is that initiating from a more informative source than Gaussian enables diffusion methods to mitigate the above limitations. We contribute both theoretical results, a new method, and empirical findings that show the benefits of using an informative source policy. Our method, which we call BRIDGER, leverages the stochastic interpolants framework to bridge arbitrary polic
Authors
(none)
Tags
Stats
Related papers
- Policy-guided Diffusion (2024)0.00
- Equivariant Diffusion Policy (2024)0.00
- Diffusion Policy Through Conditional Proximal Policy Optimization (2026)0.00
- Streaming Diffusion Policy: Fast Policy Synthesis With Variable Noise Diffusion Models (2024)0.00
- Fine-tuning Diffusion Policies With Backpropagation Through Diffusion Timesteps (2025)0.00
- Policy Representation Via Diffusion Probability Model For Reinforcement Learning (2023)0.00
- Genpo: Generative Diffusion Models Meet On-policy Reinforcement Learning (2025)0.00
- Dichotomous Diffusion Policy Optimization (2025)0.00