Evaluating Agents Without Rewards
2020 Β· Brendon Matusch, Jimmy Ba, Danijar Hafner
Abstract
Reinforcement learning has enabled agents to solve challenging tasks in unknown environments. However, manually crafting reward functions can be time consuming, expensive, and error prone to human error. Competing objectives have been proposed for agents to learn without external supervision, but it has been unclear how well they reflect task rewards or human behavior. To accelerate the development of intrinsic objectives, we retrospectively compute potential objectives on pre-collected datasets of agent behavior, rather than optimizing them online, and compare them by analyzing their correlations. We study input entropy, information gain, and empowerment across seven agents, three Atari games, and the 3D game Minecraft. We find that all three intrinsic objectives correlate more strongly with a human behavior similarity metric than with task reward. Moreover, input entropy and information gain correlate more strongly with human similarity than task reward does, suggesting the use of in
Authors
(none)
Tags
Stats
Related papers
- Curiosity-driven Multi-agent Exploration With Mixed Objectives (2022)0.00
- Reward Learning From Human Preferences And Demonstrations In Atari (2018)0.00
- Surprise-adaptive Intrinsic Motivation For Unsupervised Reinforcement Learning (2024)0.00
- Coordinated Exploration Via Intrinsic Rewards For Multi-agent Reinforcement Learning (2019)0.00
- Experimental Evidence That Empowerment May Drive Exploration In Sparse-reward Environments (2021)0.00
- Adapting Behaviour Via Intrinsic Reward: A Survey And Empirical Study (2019)0.00
- A Unified Strategy For Implementing Curiosity And Empowerment Driven Reinforcement Learning (2018)0.00
- Winning Isn't Everything: Enhancing Game Development With Intelligent Agents (2019)11.29