Influencing Reinforcement Learning Through Natural Language Guidance
2021 Β· Tasmia Tasrin, Md Sultan Al Nahian, Habarakadage Perera, et al.
Abstract
Interactive reinforcement learning agents use human feedback or instruction to help them learn in complex environments. Often, this feedback comes in the form of a discrete signal that is either positive or negative. While informative, this information can be difficult to generalize on its own. In this work, we explore how natural language advice can be used to provide a richer feedback signal to a reinforcement learning agent by extending policy shaping, a well-known Interactive reinforcement learning technique. Usually policy shaping employs a human feedback policy to help an agent to learn more about how to achieve its goal. In our case, we replace this human feedback policy with policy generated based on natural language advice. We aim to inspect if the generated natural language reasoning provides support to a deep reinforcement learning agent to decide its actions successfully in any given environment. So, we design our model with three networks: first one is the experience drive
Authors
(none)
Tags
Stats
Related papers
- Shaping Advice In Deep Reinforcement Learning (2022)0.00
- Learning Shaping Strategies In Human-in-the-loop Interactive Reinforcement Learning (2018)0.00
- Improving Interactive Reinforcement Learning: What Makes A Good Teacher? (2019)11.19
- Human Engagement Providing Evaluative And Informative Advice For Interactive Reinforcement Learning (2020)9.23
- Opinion-guided Reinforcement Learning (2024)0.00
- Directed Policy Gradient For Safe Reinforcement Learning With Human Advice (2018)0.00
- Improving Multimodal Interactive Agents With Reinforcement Learning From Human Feedback (2022)0.00
- Subgoal-based Reward Shaping To Improve Efficiency In Reinforcement Learning (2021)0.00