GRAIL: Goal Recognition Alignment Through Imitation Learning
2026 Β· Osher Elhadad, Felipe Meneguzzi, Reuth Mirsky
Abstract
Understanding an agent's goals from its behavior is fundamental to aligning AI systems with human intentions. Existing goal recognition methods typically rely on an optimal goal-oriented policy representation, which may differ from the actor's true behavior and hinder the accurate recognition of their goal. To address this gap, this paper introduces Goal Recognition Alignment through Imitation Learning (GRAIL), which leverages imitation learning and inverse reinforcement learning to learn one goal-directed policy for each candidate goal directly from (potentially suboptimal) demonstration trajectories. By scoring an observed partial trajectory with each learned goal-directed policy in a single forward pass, GRAIL retains the one-shot inference capability of classical goal recognition while leveraging learned policies that can capture suboptimal and systematically biased behavior. Across the evaluated domains, GRAIL increases the F1-score by more than 0.5 under systematically biased opt
Authors
(none)
Tags
Stats
Related papers
- Goal Recognition As Reinforcement Learning (2022)6.34
- Towards Measuring Goal-directedness In AI Systems (2024)0.00
- Learning To Reach Goals Via Iterated Supervised Learning (2019)0.00
- A Pragmatic Look At Deep Imitation Learning (2021)0.00
- Self-supervised Goal-reaching Results In Multi-agent Cooperation And Exploration (2025)0.00
- Learning With Amigo: Adversarially Motivated Intrinsic Goals (2020)0.00
- When Will Generative Adversarial Imitation Learning Algorithms Attain Global Convergence (2020)0.00
- PAGAR: Taming Reward Misalignment In Inverse Reinforcement Learning-based Imitation Learning With Protagonist Antagonist Guided Adversarial Reward (2023)0.00