Discrete And Continuous Action Representation For Practical RL In Video Games
2019 Β· Olivier Delalleau, Maxim Peter, Eloi Alonso, et al.
Abstract
While most current research in Reinforcement Learning (RL) focuses on improving the performance of the algorithms in controlled environments, the use of RL under constraints like those met in the video game industry is rarely studied. Operating under such constraints, we propose Hybrid SAC, an extension of the Soft Actor-Critic algorithm able to handle discrete, continuous and parameterized actions in a principled way. We show that Hybrid SAC can successfully solve a highspeed driving task in one of our games, and is competitive with the state-of-the-art on parameterized actions benchmark tasks. We also explore the impact of using normalizing flows to enrich the expressiveness of the policy at minimal computational cost, and identify a potential undesired effect of SAC when used with normalizing flows, that may be addressed by optimizing a different objective.
Authors
(none)
Tags
Stats
Related papers
- Improving Exploration In Soft-actor-critic With Normalizing Flows Policies (2019)0.00
- Revisiting Discrete Soft Actor-critic (2022)0.00
- Striving For Simplicity And Performance In Off-policy DRL: Output Normalization And Non-uniform Sampling (2019)0.00
- Attraction-repulsion Actor-critic For Continuous Control Reinforcement Learning (2019)0.00
- Deep RL With Information Constrained Policies: Generalization In Continuous Control (2020)0.00
- Deep Multi-agent Reinforcement Learning With Hybrid Action Spaces Based On Maximum Entropy (2022)0.00
- Deep Multi-agent Reinforcement Learning With Discrete-continuous Hybrid Action Spaces (2019)12.47
- DR-SAC: Distributionally Robust Soft Actor-critic For Reinforcement Learning Under Uncertainty (2025)0.00