Robustifying A Policy In Multi-agent RL With Diverse Cooperative Behaviors And Adversarial Style Sampling For Assistive Tasks
2024 Β· Takayuki Osa, Tatsuya Harada
Abstract
Autonomous assistance of people with motor impairments is one of the most promising applications of autonomous robotic systems. Recent studies have reported encouraging results using deep reinforcement learning (RL) in the healthcare domain. Previous studies showed that assistive tasks can be formulated as multi-agent RL, wherein there are two agents: a caregiver and a care-receiver. However, policies trained in multi-agent RL are often sensitive to the policies of other agents. In such a case, a trained caregiver's policy may not work for different care-receivers. To alleviate this issue, we propose a framework that learns a robust caregiver's policy by training it for diverse care-receiver responses. In our framework, diverse care-receiver responses are autonomously learned through trials and errors. In addition, to robustify the care-giver's policy, we propose a strategy for sampling a care-receiver's response in an adversarial manner during the training. We evaluated the proposed m
Authors
(none)
Tags
Stats
Related papers
- Adversarial Policies: Attacking Deep Reinforcement Learning (2019)0.00
- Online Robust Policy Learning In The Presence Of Unknown Adversaries (2018)0.00
- Safety Correction From Baseline: Towards The Risk-aware Policy In Robotics Via Dual-agent Reinforcement Learning (2022)3.58
- Robust Model-based Reinforcement Learning With An Adversarial Auxiliary Model (2024)0.00
- Safe Reinforcement Learning With Dual Robustness (2023)8.60
- Policy Diagnosis Via Measuring Role Diversity In Cooperative Multi-agent RL (2022)0.00
- Neutral Agent-based Adversarial Policy Learning Against Deep Reinforcement Learning In Multi-party Open Systems (2025)0.00
- Behaviour-conditioned Policies For Cooperative Reinforcement Learning Tasks (2021)2.26