Aligning Humans And Robots Via Reinforcement Learning From Implicit Human Feedback
2025 Β· Suzie Kim, Hye-Bin Shin, Seong-Whan Lee
Abstract
Conventional reinforcement learning (RL) ap proaches often struggle to learn effective policies under sparse reward conditions, necessitating the manual design of complex, task-specific reward functions. To address this limitation, rein forcement learning from human feedback (RLHF) has emerged as a promising strategy that complements hand-crafted rewards with human-derived evaluation signals. However, most existing RLHF methods depend on explicit feedback mechanisms such as button presses or preference labels, which disrupt the natural interaction process and impose a substantial cognitive load on the user. We propose a novel reinforcement learning from implicit human feedback (RLIHF) framework that utilizes non-invasive electroencephalography (EEG) signals, specifically error-related potentials (ErrPs), to provide continuous, implicit feedback without requiring explicit user intervention. The proposed method adopts a pre-trained decoder to transform raw EEG signals into probabilistic
Authors
(none)
Tags
Stats
Related papers
- Accelerating Reinforcement Learning Agent With Eeg-based Implicit Human Feedback (2020)0.00
- A Survey Of Reinforcement Learning From Human Feedback (2023)0.00
- Towards Reinforcement Learning From Neural Feedback: Mapping Fnirs Signals To Agent Performance (2025)0.00
- Mapping Out The Space Of Human Feedback For Reinforcement Learning: A Conceptual Framework (2024)0.00
- Provably Feedback-efficient Reinforcement Learning Via Active Reward Learning (2023)0.00
- The Alignment Ceiling: Objective Mismatch In Reinforcement Learning From Human Feedback (2023)0.00
- REBEL: Reward Regularization-based Approach For Robotic Reinforcement Learning From Human Feedback (2023)0.00
- Improving Multimodal Interactive Agents With Reinforcement Learning From Human Feedback (2022)0.00