Robust Behavior Cloning Via Global Lipschitz Regularization
2025 Β· Shili Wu, Yizhao Jin, Puhua Niu, et al.
Abstract
Behavior Cloning (BC) is an effective imitation learning technique and has even been adopted in some safety-critical domains such as autonomous vehicles. BC trains a policy to mimic the behavior of an expert by using a dataset composed of only state-action pairs demonstrated by the expert, without any additional interaction with the environment. However, During deployment, the policy observations may contain measurement errors or adversarial disturbances. Since the observations may deviate from the true states, they can mislead the agent into making sub-optimal actions. In this work, we use a global Lipschitz regularization approach to enhance the robustness of the learned policy network. We then show that the resulting global Lipschitz property provides a robustness certificate to the policy with respect to different bounded norm perturbations. Then, we propose a way to construct a Lipschitz neural network that ensures the policy robustness. We empirically validate our theory across v
Authors
(none)
Tags
Stats
Related papers
- Reliable Conditioning Of Behavioral Cloning For Offline Reinforcement Learning (2022)0.00
- B3C: A Minimalist Approach To Offline Multi-agent Reinforcement Learning (2025)0.00
- Is Behavior Cloning All You Need? Understanding Horizon In Imitation Learning (2024)0.00
- Interactive And Hybrid Imitation Learning: Provably Beating Behavior Cloning (2024)0.00
- Know Your Boundaries: The Necessity Of Explicit Behavioral Cloning In Offline RL (2022)0.00
- Improving TD3-BC: Relaxed Policy Constraint For Offline Learning And Stable Online Fine-tuning (2022)0.00
- Swarm Behavior Cloning (2024)0.00
- Robust Deep Reinforcement Learning Through Bootstrapped Opportunistic Curriculum (2022)0.00