Action Mapping For Reinforcement Learning In Continuous Environments With Constraints
2024 Β· Mirco Theile, Lukas Dirnberger, Raphael Trumpp, et al.
Abstract
Deep reinforcement learning (DRL) has had success across various domains, but applying it to environments with constraints remains challenging due to poor sample efficiency and slow convergence. Recent literature explored incorporating model knowledge to mitigate these problems, particularly through the use of models that assess the feasibility of proposed actions. However, integrating feasibility models efficiently into DRL pipelines in environments with continuous action spaces is non-trivial. We propose a novel DRL training strategy utilizing action mapping that leverages feasibility models to streamline the learning process. By decoupling the learning of feasible actions from policy optimization, action mapping allows DRL agents to focus on selecting the optimal action from a reduced feasible action set. We demonstrate through experiments that action mapping significantly improves training performance in constrained environments with continuous action spaces, especially with imperf
Authors
(none)
Tags
Stats
Related papers
- Deep Multi-agent Reinforcement Learning With Discrete-continuous Hybrid Action Spaces (2019)12.47
- Excluding The Irrelevant: Focusing Reinforcement Learning Through Continuous Action Masking (2024)4.52
- Handling Cost And Constraints With Off-policy Deep Reinforcement Learning (2023)0.00
- CAMEL: Continuous Action Masking Enabled By Large Language Models For Reinforcement Learning (2025)0.00
- Ensemble Reinforcement Learning In Continuous Spaces -- A Hierarchical Multi-step Approach For Policy Training (2022)2.26
- Discrete And Continuous Action Representation For Practical RL In Video Games (2019)0.00
- Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity (2021)0.00
- Deep Intrinsically Motivated Exploration In Continuous Control (2022)0.00