The Need to Prioritize Model-Updating Processes in Clinical Artificial Intelligence (AI) Models: Protocol for a Scoping Review

被引：7

作者：

Otokiti, Ahmed Umar ^{[1
,9
]}

Ozoude, Makuochukwu Maryann ^{[2
]}

Williams, Karmen S. ^{[3
]}

Sadiq-onilenla, Rasheedat A. ^{[4
]}

Ojo, Soji Akin ^{[5
]}

Wasarme, Leyla B. ^{[6
]}

Walsh, Samantha ^{[7
]}

Edomwande, Maxwell ^{[8
]}

机构：

[1] Digital Hlth Solut LLC, White Plains, NY USA

[2] Zaporozhye State Med Univ, Zaporizhzhia, Ukraine

[3] CUNY, New York, NY USA

[4] Elevance Hlth Amerigrp Solut, Dept Qual Management, Iselin, NJ USA

[5] Thermo Fisher Sci, Pharmaceut Prod Dev PPD, Wilmington, NC USA

[6] Geisinger Hlth Syst, Danville, PA USA

[7] Icahn Sch Med Mt Sinai, Levy Lib, New York, NY USA

[8] Nuance Commun Inc, Burlington, MA USA

[9] Digitial Hlth Solut LLC, 455 Tarrytown Rd Suite 1181, White Plains, NY 10607 USA

来源：

JMIR RESEARCH PROTOCOLS | 2023年 / 12卷

关键词：

model updating; model calibration; artificial intelligence; machine learning; direct clinical care; PREDICTION MODELS; PROGNOSTIC MODELS; RISK PREDICTION; HEALTH-CARE; EXPLANATION; VALIDATION; IMPACT; ACCURATE; MEDICINE; RULES;

D O I：

10.2196/37685

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background: With an increase in the number of artificial intelligence (AI) and machine learning (ML) algorithms available for clinical settings, appropriate model updating and implementation of updates are imperative to ensure applicability, reproducibility, and patient safety.Objective: The objective of this scoping review was to evaluate and assess the model-updating practices of AI and ML clinical models that are used in direct patient-provider clinical decision-making. Methods: We used the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) checklist and the PRISMA-P protocol guidance in addition to a modified CHARMS (Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies) checklist to conduct this scoping review. A comprehensive medical literature search of databases, including Embase, MEDLINE, PsycINFO, Cochrane, Scopus, and Web of Science, was conducted to identify AI and ML algorithms that would impact clinical decision-making at the level of direct patient care. Our primary end point is the rate at which model updating is recommended by published algorithms; we will also conduct an assessment of study quality and risk of bias in all publications reviewed. In addition, we will evaluate the rate at which published algorithms include ethnic and gender demographic distribution information in their training data as a secondary end point. Results: Our initial literature search yielded approximately 13,693 articles, with approximately 7810 articles to consider for full reviews among our team of 7 reviewers. We plan to complete the review process and disseminate the results by spring of 2023.Conclusions: Although AI and ML applications in health care have the potential to improve patient care by reducing errors between measurement and model output, currently there exists more hype than hope because of the lack of proper external validation of these models. We expect to find that the AI and ML model-updating methods are proxies for model applicability and generalizability on implementation. Our findings will add to the field by determining the degree to which published models

引用

页数：15

共 73 条

[1]

Altman DG, 2000, STAT MED, V19, P453, DOI 10.1002/(SICI)1097-0258(20000229)19:4<453::AID-SIM350>3.3.CO

[2]

2-X

[3] Reporting Recommendations for Tumor Marker Prognostic Studies (REMARK): Explanation and Elaboration [J].

Altman, Douglas G. ;

McShane, Lisa M. ;

Sauerbrei, Willi ;

Taube, Sheila E. .

PLOS MEDICINE, 2012, 9 (05)

[4] Prognosis and prognostic research: validating a prognostic model [J].

Altman, Douglas G. ;

Vergouwe, Yvonne ;

Royston, Patrick ;

Moons, Karel G. M. .

BMJ-BRITISH MEDICAL JOURNAL, 2009, 338 :1432-1435

[5]

[Anonymous], Covidence-Better Systematic Review Management

[6]

[Anonymous], 2019, Deloitte insights

[7] Frequency and Types of Patient-Reported Errors in Electronic Health Record Ambulatory Care Notes [J].

Bell, Sigall K. ;

Delbanco, Tom ;

Elmore, Joann G. ;

Fitzgerald, Patricia S. ;

Fossa, Alan ;

Harcourt, Kendall ;

Leveille, Suzanne G. ;

Payne, Thomas H. ;

Stametz, Rebecca A. ;

Walker, Jan ;

DesRoches, Catherine M. .

JAMA NETWORK OPEN, 2020, 3 (06)

[8] What is Machine Learning? A Primer for the Epidemiologist [J].

Bi, Qifang ;

Goodman, Katherine E. ;

Kaminsky, Joshua ;

Lessler, Justin .

AMERICAN JOURNAL OF EPIDEMIOLOGY, 2019, 188 (12) :2222-2239

[9]

Bohr A, 2020, Artificial Intelligence in healthcare, P25, DOI [DOI 10.1016/B978-0-12-818438-7.00002-2, 10.1016/B978-0-12-818438-7.00002-2]

[10] Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD initiative [J].

Bossuyt, PM ;

Reitsma, JB ;

Bruns, DE ;

Gatsonis, CA ;

Glasziou, PP ;

Irwig, LM ;

Lijmer, JG ;

Moher, D ;

Rennie, D ;

de Vet, HCW .

ANNALS OF INTERNAL MEDICINE, 2003, 138 (01) :40-44

← 1 2 3 4 5 6 7 8 →