Does Reinforcement Learning Improve Outcomes for Critically Ill Patients? A Systematic Review and Level-of-Readiness Assessment

被引:3
作者
Otten, Martijn [1 ,2 ]
Jagesar, Ameet R. [1 ,2 ]
Dam, Tariq A. [1 ,2 ]
Biesheuvel, Laurens A. [1 ,2 ]
den Hengst, Floris [2 ]
Ziesemer, Kirsten A. [3 ]
Thoral, Patrick J. [1 ]
de Grooth, Harm-Jan [1 ]
Girbes, Armand R. J. [1 ]
Francois-Lavet, Vincent [2 ]
Hoogendoorn, Mark [2 ]
Elbers, Paul W. G. [1 ]
机构
[1] Vrije Univ, Ctr Crit Care Computat Intelligence, Dept Intens Care Med, Amsterdam Med Data Sci AMDS,Amsterdam UMC, Amsterdam, Netherlands
[2] Vrije Univ, Fac Sci, Dept Comp Sci, Quantitat Data Analyt Grp, Amsterdam, Netherlands
[3] Vrije Univ, Univ Lib, Amsterdam, Netherlands
关键词
artificial intelligence; intensive care medicine; machine learning; reinforcement learning; systematic review; sequential decision-making; MEDICINE; SEPSIS;
D O I
10.1097/CCM.0000000000006100
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
OBJECTIVE: Reinforcement learning (RL) is a machine learning technique uniquely effective at sequential decision-making, which makes it potentially relevant to ICU treatment challenges. We set out to systematically review, assess level-of-readiness and meta-analyze the effect of RL on outcomes for critically ill patients. DATA SOURCES: A systematic search was performed in PubMed, Embase.com, Clarivate Analytics/Web of Science Core Collection, Elsevier/SCOPUS and the Institute of Electrical and Electronics Engineers Xplore Digital Library from inception to March 25, 2022, with subsequent citation tracking. DATA EXTRACTION: Journal articles that used an RL technique in an ICU population and reported on patient health-related outcomes were included for full analysis. Conference papers were included for level-of-readiness assessment only. Descriptive statistics, characteristics of the models, outcome compared with clinicians policy and level-of-readiness were collected. RL-health risk of bias and applicability assessment was performed. DATA SYNTHESIS: A total of 1,033 articles were screened, of which 18 journal articles and 18 conference papers, were included. Thirty of those were prototyping or modeling articles and six were validation articles. All articles reported RL algorithms to outperform clinical decision-making by ICU professionals, but only in retrospective data. The modeling techniques for the state-space, action-space, reward function, RL model training, and evaluation varied widely. The risk of bias was high in all articles, mainly due to the evaluation procedure. CONCLUSION: In this first systematic review on the application of RL in intensive care medicine we found no studies that demonstrated improved patient outcomes from RL-based technologies. All studies reported that RL-agent policies outperformed clinician policies, but such assessments were all based on retrospective off-policy evaluation. Copyright (C) 2023 by the Society of Critical Care Medicine and Wolters Kluwer Health, Inc. All Rights Reserved.
引用
收藏
页码:e79 / e88
页数:10
相关论文
共 62 条
[1]  
Baucum M., 2022, INFORMS Journal on Data Science, V1, P27
[2]   Improving Deep Reinforcement Learning With Transitional Variational Autoencoders: A Healthcare Application [J].
Baucum, Matthew ;
Khojandi, Anahita ;
Vasudevan, Rama .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (06) :2273-2280
[3]   Critical Bias in Critical Care Devices [J].
Charpignon, Marie-Laure ;
Byersb, Joseph ;
Cabral, Stephanie ;
Celi, Leo Anthony ;
Fernandes, Chrystinne ;
Gallifantg, Jack ;
Lough, Mary E. ;
Mlombwa, Donald ;
Moukheiber, Lama ;
Ong, Bradley Ashley ;
Panitchote, Anupol ;
Williamo, Wasswa ;
Wong, An-Kwok Ian ;
Nazer, Lama .
CRITICAL CARE CLINICS, 2023, 39 (04) :795-813
[4]   A model-based hybrid soft actor-critic deep reinforcement learning algorithm for optimal ventilator settings [J].
Chen, Shaotao ;
Qiu, Xihe ;
Tan, Xiaoyu ;
Fang, Zhijun ;
Jin, Yaochu .
INFORMATION SCIENCES, 2022, 611 :47-64
[5]  
den Hengst F, 2020, Data Science, V3, P107, DOI [10.3233/ds-200028, 10.3233/DS-200028, DOI 10.3233/DS-200028]
[6]   Patient-Specific Sedation Management via Deep Reinforcement Learning [J].
Eghbali, Niloufar ;
Alhanai, Tuka ;
Ghassemi, Mohammad M. .
FRONTIERS IN DIGITAL HEALTH, 2021, 3
[7]   Assuring the safety of AI-based clinical decision support systems: a case study of the AI Clinician for sepsis treatment [J].
Festor, Paul ;
Jia, Yan ;
Gordon, Anthony C. ;
Faisal, A. Aldo ;
Habli, Ibrahim ;
Komorowski, Matthieu .
BMJ HEALTH & CARE INFORMATICS, 2022, 29 (01)
[8]   Machine learning in intensive care medicine: ready for take-off? [J].
Fleuren, Lucas M. ;
Thoral, Patrick ;
Shillan, Duncan ;
Ercole, Ari ;
Elbers, Paul W. G. .
INTENSIVE CARE MEDICINE, 2020, 46 (07) :1486-1488
[9]   An Introduction to Deep Reinforcement Learning [J].
Francois-Lavet, Vincent ;
Henderson, Peter ;
Islam, Riashat ;
Bellemare, Marc G. ;
Pineau, Joelle .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2018, 11 (3-4) :219-354
[10]  
Futoma J, 2020, PR MACH LEARN RES, V108, P3578