Continuous Value Iteration (CVI) Reinforcement Learning And Imaginary Experience Replay (IER) For Learning Multi-goal, Continuous Action And State Space Controllers
2019 Β· Andreas Gerken, Michael Spranger
Abstract
This paper presents a novel model-free Reinforcement Learning algorithm for learning behavior in continuous action, state, and goal spaces. The algorithm approximates optimal value functions using non-parametric estimators. It is able to efficiently learn to reach multiple arbitrary goals in deterministic and nondeterministic environments. To improve generalization in the goal space, we propose a novel sample augmentation technique. Using these methods, robots learn faster and overall better controllers. We benchmark the proposed algorithms using simulation and a real-world voltage controlled robot that learns to maneuver in a non-observable Cartesian task space.
Authors
(none)
Tags
Stats
Related papers
- Specialized Deep Residual Policy Safe Reinforcement Learning-based Controller For Complex And Continuous State-action Spaces (2023)4.52
- Continuous Episodic Control (2022)2.26
- Counterfactual Experience Augmented Off-policy Reinforcement Learning (2025)0.00
- Learning Robust And Adaptive Real-world Continuous Control Using Simulation And Transfer Learning (2018)0.00
- VIME: Variational Information Maximizing Exploration (2016)0.00
- Inverse Reinforcement Learning In A Continuous State Space With Formal Guarantees (2021)0.00
- Deep Intrinsically Motivated Exploration In Continuous Control (2022)0.00
- Continual Visual Reinforcement Learning With A Life-long World Model (2023)2.26