Effects Of Different Optimization Formulations In Evolutionary Reinforcement Learning On Diverse Behavior Generation
2021 Β· Victor Villin, Naoki Masuyama, Yusuke Nojima
Abstract
Generating various strategies for a given task is challenging. However, it has already proven to bring many assets to the main learning process, such as improved behavior exploration. With the growth in the interest of heterogeneity in solution in evolutionary computation and reinforcement learning, many promising approaches have emerged. To better understand how one guides multiple policies toward distinct strategies and benefit from diversity, we need to analyze further the influence of the reward signal modulation and other evolutionary mechanisms on the obtained behaviors. To that effect, this paper considers an existing evolutionary reinforcement learning framework which exploits multi-objective optimization as a way to obtain policies that succeed at behavior-related tasks as well as completing the main goal. Experiments on the Atari games stress that optimization formulations which do not consider objectives equally fail at generating diversity and even output agents that are wo
Authors
(none)
Tags
Stats
Related papers
- DGPO: Discovering Multiple Strategies With Diversity-guided Policy Optimization (2022)2.26
- Qualitative Differences Between Evolutionary Strategies And Reinforcement Learning Methods For Control Of Autonomous Agents (2022)0.00
- A Globally Convergent Evolutionary Strategy For Stochastic Constrained Optimization With Applications To Reinforcement Learning (2022)0.00
- Effective Diversity In Population Based Reinforcement Learning (2020)0.00
- The Impact Of Behavioral Diversity In Multi-agent Reinforcement Learning (2024)0.00
- Diverse Policies Converge In Reward-free Markov Decision Processe (2023)0.00
- Efficacy Of Modern Neuro-evolutionary Strategies For Continuous Control Optimization (2019)0.00
- Phasic Diversity Optimization For Population-based Reinforcement Learning (2024)0.00