Autonomous Self-explanation Of Behavior For Interactive Reinforcement Learning Agents
2018 Β· Yosuke Fukuchi, Masahiko Osawa, Hiroshi Yamakawa, et al.
Abstract
In cooperation, the workers must know how co-workers behave. However, an agent's policy, which is embedded in a statistical machine learning model, is hard to understand, and requires much time and knowledge to comprehend. Therefore, it is difficult for people to predict the behavior of machine learning robots, which makes Human Robot Cooperation challenging. In this paper, we propose Instruction-based Behavior Explanation (IBE), a method to explain an autonomous agent's future behavior. In IBE, an agent can autonomously acquire the expressions to explain its own behavior by reusing the instructions given by a human expert to accelerate the learning of the agent's policy. IBE also enables a developmental agent, whose policy may change during the cooperation, to explain its own behavior with sufficient time granularity.
Authors
(none)
Tags
Stats
Related papers
- Experiential Explanations For Reinforcement Learning (2022)2.26
- Talktoagent: A Human-centric Explanation Of Reinforcement Learning Agents With Large Language Models (2025)0.00
- What Did You Think Would Happen? Explaining Agent Behaviour Through Intended Outcomes (2020)0.00
- An Organizationally-oriented Approach To Enhancing Explainability And Control In Multi-agent Reinforcement Learning (2025)2.26
- MAGIC-MASK: Multi-agent Guided Inter-agent Collaboration With Mask-based Explainability For Reinforcement Learning (2025)0.00
- REVEAL-IT: Reinforcement Learning With Visibility Of Evolving Agent Policy For Interpretability (2024)0.00
- Why The Agent Made That Decision: Contrastive Explanation Learning For Reinforcement Learning (2024)0.00
- Behaviour-conditioned Policies For Cooperative Reinforcement Learning Tasks (2021)2.26