Learning-to-defer For Sequential Medical Decision-making Under Uncertainty
2021 Β· Shalmali Joshi, Sonali Parbhoo, Finale Doshi-Velez
Abstract
Learning-to-defer is a framework to automatically defer decision-making to a human expert when ML-based decisions are deemed unreliable. Existing learning-to-defer frameworks are not designed for sequential settings. That is, they defer at every instance independently, based on immediate predictions, while ignoring the potential long-term impact of these interventions. As a result, existing frameworks are myopic. Further, they do not defer adaptively, which is crucial when human interventions are costly. In this work, we propose Sequential Learning-to-Defer (SLTD), a framework for learning-to-defer to a domain expert in sequential decision-making settings. Contrary to existing literature, we pose the problem of learning-to-defer as model-based reinforcement learning (RL) to i) account for long-term consequences of ML-based actions using RL and ii) adaptively defer based on the dynamics (model-based). Our proposed framework determines whether to defer (at each time step) by quantifying
Authors
(none)
Tags
Stats
Related papers
- Learning To Make Adherence-aware Advice (2023)0.00
- Learning Mdps From Features: Predict-then-optimize For Sequential Decision Problems By Reinforcement Learning (2021)0.00
- Act As You Learn: Adaptive Decision-making In Non-stationary Markov Decision Processes (2024)0.00
- Reinforcement Learning Enhanced Online Adaptive Clinical Decision Support Via Digital Twin Powered Policy And Treatment Effect Optimized Reward (2025)0.00
- Model-free Online Learning In Unknown Sequential Decision Making Problems And Games (2021)5.24
- From Reinforcement Learning To Optimal Control: A Unified Framework For Sequential Decisions (2019)0.00
- Clinician-in-the-loop Decision Making: Reinforcement Learning With Near-optimal Set-valued Policies (2020)0.00
- Learning "what-if" Explanations For Sequential Decision-making (2020)0.00