Benefits and harms associated with the use of AI-related algorithmic decision-making systems by healthcare professionals: a systematic review

被引:6
作者
Wilhelm, Christoph [1 ,3 ]
Steckelberg, Anke [2 ]
Rebitschek, Felix G. [3 ,4 ]
机构
[1] Martin Luther Univ Halle Wittenberg, Inst Hlth & Nursing Sci, Int Grad Acad InGrA, Med Fac, Magdeburger Str 8, D-06112 Halle, Saale, Germany
[2] Martin Luther Univ Halle Wittenberg, Inst Hlth & Nursing Sci, Med Fac, Magdeburger Str 8, D-06112 Halle, Saale, Germany
[3] Univ Potsdam, Fac Hlth Sci Brandenburg, Harding Ctr Risk Literacy, Virchowstr 2, D-14482 Potsdam, Germany
[4] Max Planck Inst Human Dev, Lentzeallee 94, D-14195 Berlin, Germany
来源
LANCET REGIONAL HEALTH-EUROPE | 2025年 / 48卷
关键词
Algorithmic decision making; ADM; Arti fi cial intelligence; AI; Patient-relevant; Healthcare professionals; ARTIFICIAL-INTELLIGENCE; EUROPE; CANCER; TRIAL; RISK;
D O I
10.1016/j.lanepe.2024.101145
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background Despite notable advancements in artificial intelligence (AI) that enable complex systems to perform certain tasks more accurately than medical experts, the impact on patient-relevant outcomes remains uncertain. To address this gap, this systematic review assesses the benefits and harms associated with AI-related algorithmic decision-making (ADM) systems used by healthcare professionals, compared to standard care. Methods In accordance with the PRISMA guidelines, we included interventional and observational studies published as peer-reviewed full-text articles that met the following criteria: human patients; interventions involving algorithmic decision-making systems, developed with and/or utilizing machine learning (ML); and outcomes describing patient- relevant benefits and harms that directly affect health and quality of life, such as mortality and morbidity. Studies that did not undergo preregistration, lacked a standard-of-care control, or pertained to systems that assist in the execution of actions (e.g., in robotics) were excluded. We searched MEDLINE, EMBASE, IEEE Xplore, and Google Scholar for studies published in the past decade up to 31 March 2024. We assessed risk of bias using Cochrane's RoB 2 and ROBINS-I tools, and reporting transparency with CONSORT-AI and TRIPOD-AI. Two researchers independently managed the processes and resolved conflicts through discussion. This review has been registered with PROSPERO (CRD42023412156) and the study protocol has been published. Findings Out of 2,582 records identified after deduplication, 18 randomized controlled trials (RCTs) and one cohort study met the inclusion criteria, covering specialties such as psychiatry, oncology, and internal medicine. Collectively, the studies included a median of 243 patients (IQR 124-828), with a median of 50.5% female participants (range 12.5-79.0, IQR 43.6-53.6) across intervention and control groups. Four studies were classified as having low risk of bias, seven showed some concerns, and another seven were assessed as having high or serious risk of bias. Reporting transparency varied considerably: six studies showed high compliance, four moderate, and fi ve low compliance with CONSORT-AI or TRIPOD-AI. Twelve studies (63%) reported patient-relevant benefits. Of those with low risk of bias, interventions reduced length of stay in hospital and intensive care unit (10.3 vs. 13.0 days, p = 0.042; 6.3 vs. 8.4 days, p = 0.030), in-hospital mortality (9.0% vs. 21.3%, p = 0.018), and depression symptoms in non-complex cases (45.1% vs. 52.3%, p = 0.03). However, harms were frequently underreported, with only eight studies (42%) documenting adverse events. No study reported an increase in adverse events as a result of the interventions. Interpretation The current evidence on AI-related ADM systems provides limited insights into patient-relevant outcomes. Our fi ndings underscore the essential need for rigorous evaluations of clinical benefits, reinforced compliance with methodological standards, and balanced consideration of both benefits and harms to ensure meaningful integration into healthcare practice.
引用
收藏
页数:11
相关论文
共 65 条
[1]  
Agrawal A, 2019, EndNote 1-2-3 easy!: reference management for the professional, P7
[2]   Prediction of preterm birth using artificial intelligence: a systematic review [J].
Akazawa, Munetoshi ;
Hashimoto, Kazunori .
JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2022, 42 (06) :1662-1668
[3]   Revolutionizing healthcare: the role of artificial intelligence in clinical practice [J].
Alowais, Shuroug A. ;
Alghamdi, Sahar S. ;
Alsuhebany, Nada ;
Alqahtani, Tariq ;
Alshaya, Abdulrahman I. ;
Almohareb, Sumaya N. ;
Aldairem, Atheer ;
Alrashed, Mohammed ;
Bin Saleh, Khalid ;
Badreldin, Hisham A. ;
Al Yami, Majed S. ;
Al Harbi, Shmeylan ;
Albekairy, Abdulkareem M. .
BMC MEDICAL EDUCATION, 2023, 23 (01)
[4]   High-Risk Breast Lesions: A Machine Learning Model to Predict Pathologic Upgrade and Reduce Unnecessary Surgical Excision [J].
Bahl, Manisha ;
Barzilay, Regina ;
Yedidia, Adam B. ;
Locascio, Nicholas J. ;
Yu, Lili ;
Lehman, Constance D. .
RADIOLOGY, 2018, 286 (03) :810-818
[5]   A trial of a real-time Alert for clinical deterioration in Patients hospitalized on general medical wards [J].
Bailey, Thomas C. ;
Chen, Yixin ;
Mao, Yi ;
Lu, Chenyang ;
Hackmann, Gregory ;
Micek, Scott T. ;
Heard, Kevin M. ;
Faulkner, Kelly M. ;
Kollef, Marin H. .
JOURNAL OF HOSPITAL MEDICINE, 2013, 8 (05) :236-242
[6]   Systematic literature review of machine learning methods used in the analysis of real-world data for patient-provider decision making [J].
Brnabic, Alan ;
Hess, Lisa M. .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
[7]   Computerised interpretation of fetal heart rate during labour (INFANT): a randomised controlled trial [J].
Brocklehurst, Peter ;
Johns, Nina ;
Johnston, Tracey ;
Barnfield, Gemma ;
Davies, Karen ;
Johnson, Mark ;
Patterson, Holly ;
Montague, Imogen ;
Watmore, Sally ;
Stolton, Alison ;
Parisaei, Maryam ;
McGhee, Natasha ;
Segovia, Silvia ;
Martindale, Elizabeth ;
Jackson, Hilary ;
Holleran, Josephine ;
Roberts, Devender ;
Holt, Siobhan ;
Dragovic, Bosko ;
Willmott-Powell, Miriam ;
Hutchinson, Laura ;
Toth, Benedek ;
Chandler, Gemma ;
Ridley, Suzanne ;
Bugg, George ;
Molnar, Anna ;
Lochrie, Denise ;
Connor, Jillian ;
Howe, David ;
Head, Katie ;
Wellstead, Sue ;
Mathers, Alan ;
Walker, Laura ;
Crawford, Isobel ;
Davies, David ;
Garner, Zoe ;
Galloway, Lucy ;
Bugg, George ;
Davies, Yvette ;
Smith, Carys ;
Perkins, Gill ;
Geary, Mike ;
Walsh, Fiona ;
Nagle, Ursula ;
Martindale, Elizabeth ;
Jackson, Hilary ;
O'Malley, Louise ;
Katakam, Narmada ;
White, Heather ;
Tanton, Emma .
LANCET, 2017, 389 (10080) :1719-1729
[8]   A new risk and issue management system to improve productivity, quality, and compliance in clinical trials [J].
Ciervo, Joseph ;
Shen, Shih Chuan ;
Stallcup, Kristin ;
Thomas, Abraham ;
Farnum, Michael A. ;
Lobanov, Victor S. ;
Agrafiotis, Dimitris K. .
JAMIA OPEN, 2019, 2 (02) :216-221
[9]   TRIPOD plus AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods [J].
Collins, Gary S. ;
Moons, Karel G. M. ;
Dhiman, Paula ;
Riley, Richard ;
Beam, Andrew L. ;
Van Calster, Ben ;
Ghassemi, Marzyeh ;
Liu, Xiaoxuan ;
Reitsma, Johannes B. ;
van Smeden, Maarten ;
Boulesteix, Anne-Laure ;
Camaradou, Jennifer Catherine ;
Celi, Leo Anthony ;
Denaxas, Spiros ;
Denniston, Alastair K. ;
Glocker, Ben ;
Golub, Robert M. ;
Harvey, Hugh ;
Heinze, Georg ;
Hoffman, Michael M. ;
Kengne, Andre Pascal ;
Lam, Emily ;
Lee, Naomi ;
Loder, Elizabeth W. ;
Maier-Hein, Lena ;
Mateen, Bilal A. ;
McCradden, Melissa ;
Oakden-Rayner, Lauren ;
Ordish, Johan ;
Parnell, Richard ;
Rose, Sherri ;
Singh, Karandeep ;
Wynants, Laure ;
Logullo, Patricia .
BMJ-BRITISH MEDICAL JOURNAL, 2024, 385
[10]  
Decker C, 2023, medRxiv, DOI [10.1101/2023.05.22.23290326, DOI 10.1101/2023.05.22.23290326]