Reporting guideline for the early-stage clinical evaluation of decision support systems driven by artificial intelligence: DECIDE-AI

被引:45
作者
Vasey, Baptiste [1 ,2 ,3 ]
Nagendran, Myura [4 ]
Campbell, Bruce [5 ,6 ]
Clifton, David A. [2 ]
Collins, Gary S. [7 ]
Denaxas, Spiros [8 ,9 ,10 ,11 ]
Denniston, Alastair K. [12 ,13 ,14 ]
Faes, Livia [14 ]
Geerts, Bart [15 ]
Ibrahim, Mudathir [1 ,16 ]
Liu, Xiaoxuan [3 ,12 ]
Mateen, Bilal A. [8 ,17 ,18 ]
Mathur, Piyush [19 ]
McCradden, Melissa D. [20 ,21 ]
Morgan, Lauren [22 ]
Ordish, Johan [23 ]
Rogers, Campbell [24 ]
Saria, Suchi [25 ,26 ,27 ,28 ,29 ]
Ting, Daniel S. W. [30 ,31 ]
Watkinson, Peter [3 ,32 ]
Weber, Wim [33 ]
Wheatstone, Peter [34 ]
McCulloch, Peter [1 ]
机构
[1] Univ Oxford, Nuffield Dept Surg Sci, Oxford, England
[2] Univ Oxford, Inst Biomed Engn, Dept Engn Sci, Oxford, England
[3] Univ Oxford, Nuffield Dept Clin Neurosci, Crit Care Res Grp, Oxford, England
[4] Imperial Coll London, UKRI Ctr Doctoral Training AI Healthcare, London, England
[5] Univ Exeter, Med Sch, Exeter, Devon, England
[6] Royal Devon & Exeter Hosp, Exeter, Devon, England
[7] Univ Oxford, Ctr Stat Med, Nuffield Dept Orthopaed Rheumatol & Musculoskelet, Oxford, England
[8] UCL, Inst Hlth Informat, London, England
[9] British Heart Fdn Data Sci Ctr, London, England
[10] Hlth Data Res England, London, England
[11] UCL Hosp Biomed Res Ctr, London, England
[12] Univ Hosp Birmingham NHS Fdn Trust, Birmingham, W Midlands, England
[13] Univ Birmingham, Acad Unit Ophthalmol, Coll Med & Dent Sci, Inst Inflammat & Ageing, Birmingham, W Midlands, England
[14] Moorfields Eye Hosp NHS Fdn Trust, London, England
[15] Healthplusai R&D BV, Amsterdam, Netherlands
[16] Maimonides Hosp, Dept Surg, Brooklyn, NY 11219 USA
[17] Wellcome Trust Res Labs, London, England
[18] Alan Turing Inst, London, England
[19] Cleveland Clin, Dept Gen Anesthesiol, Anesthesiol Inst, Cleveland, OH 44106 USA
[20] Hosp Sick Children, Toronto, ON, Canada
[21] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[22] Morgan Human Syst Ltd, Shrewsbury, Salop, England
[23] Med & Healthcare Prod Regulatory Agcy, London, England
[24] HeartFlow Inc, Redwood City, CA USA
[25] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
[26] Johns Hopkins Univ, Dept Stat, Baltimore, MD USA
[27] Johns Hopkins Univ, Dept Hlth Policy, Baltimore, MD USA
[28] Johns Hopkins Univ, Div Informat, Baltimore, MD USA
[29] Bayesian Hlth, New York, NY USA
[30] Singapore Eye Res Inst, Singapore Natl Eye Ctr, Singapore, Singapore
[31] Natl Univ Singapore, Duke NUS Med Sch, Singapore, Singapore
[32] Oxford Univ Hosp NHS Trust, NIHR Biomed Res Ctr Oxford, Oxford, England
[33] The BMJ, London, England
[34] Univ Leeds, Sch Med, Leeds, W Yorkshire, England
基金
美国国家卫生研究院; 英国惠康基金; 英国医学研究理事会; 英国工程与自然科学研究理事会; 美国国家科学基金会;
关键词
DELPHI; STATEMENT;
D O I
10.1038/s41591-022-01772-9
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The DECIDE-AI checklist, resulting from a multi-stakeholder group of experts in a Delphi process and following the EQUATOR Network's recommendations, includes key items that should be reported in early-stage clinical studies of AI-based decision support systems, to ensure a responsible and transparent deployment of AI systems in healthcare. A growing number of artificial intelligence (AI)-based clinical decision support systems are showing promising performance in preclinical, in silico evaluation, but few have yet demonstrated real benefit to patient care. Early-stage clinical evaluation is important to assess an AI system's actual clinical performance at small scale, ensure its safety, evaluate the human factors surrounding its use and pave the way to further large-scale trials. However, the reporting of these early studies remains inadequate. The present statement provides a multi-stakeholder, consensus-based reporting guideline for the Developmental and Exploratory Clinical Investigations of DEcision support systems driven by Artificial Intelligence (DECIDE-AI). We conducted a two-round, modified Delphi process to collect and analyze expert opinion on the reporting of early clinical evaluation of AI systems. Experts were recruited from 20 pre-defined stakeholder categories. The final composition and wording of the guideline was determined at a virtual consensus meeting. The checklist and the Explanation & Elaboration (E&E) sections were refined based on feedback from a qualitative evaluation process. In total, 123 experts participated in the first round of Delphi, 138 in the second round, 16 in the consensus meeting and 16 in the qualitative evaluation. The DECIDE-AI reporting guideline comprises 17 AI-specific reporting items (made of 28 subitems) and ten generic reporting items, with an E&E paragraph provided for each. Through consultation and consensus with a range of stakeholders, we developed a guideline comprising key items that should be reported in early-stage clinical studies of AI-based decision support systems in healthcare. By providing an actionable checklist of minimal reporting items, the DECIDE-AI guideline will facilitate the appraisal of these studies and replicability of their findings.
引用
收藏
页码:924 / +
页数:12
相关论文
共 61 条
[1]  
[Anonymous], 2019, Evidence Standards Framework for Digital Health Technologies, DOI [10.1007/978-1-349-95810-8867, DOI 10.1007/978-1-349-95810-8867]
[2]  
[Anonymous], 1986, User-centered system design: New perspectives on human-computer interaction
[3]   Research Trends in Artificial Intelligence Applications in Human Factors Health Care: Mapping Review [J].
Asan, Onur ;
Choudhury, Avishek .
JMIR HUMAN FACTORS, 2021, 8 (02)
[4]   The IDEAL Reporting Guidelines A Delphi Consensus Statement Stage Specific Recommendations for Reporting the Evaluation of Surgical Innovation [J].
Bilbro, Nicole A. ;
Hirst, Allison ;
Paez, Arsenio ;
Vasey, Baptiste ;
Pufulete, Maria ;
Sedrakyan, Art ;
McCulloch, Peter .
ANNALS OF SURGERY, 2021, 273 (01) :82-85
[5]   Two different invitation approaches for consecutive rounds of a Delphi survey led to comparable final outcome [J].
Boel, Anne ;
Navarro-Compan, Victoria ;
Landewe, Robert ;
van der Heijde, Desiree .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2021, 129 :31-39
[6]   Reporting of artificial intelligence prediction models [J].
Collins, Gary S. ;
Moons, Karel G. M. .
LANCET, 2019, 393 (10181) :1577-1579
[7]   AN EXPERIMENTAL APPLICATION OF THE DELPHI METHOD TO THE USE OF EXPERTS [J].
DALKEY, N ;
HELMER, O .
MANAGEMENT SCIENCE, 1963, 9 (03) :458-467
[8]   The Importance of Incorporating Human Factors in the Design and Implementation of Artificial Intelligence for Skin Cancer Diagnosis in the Real World [J].
Felmingham, Claire M. ;
Adler, Nikki R. ;
Ge, Zongyuan ;
Morton, Rachael L. ;
Janda, Monika ;
Mar, Victoria J. .
AMERICAN JOURNAL OF CLINICAL DERMATOLOGY, 2021, 22 (02) :233-242
[9]   The Clinician and Dataset Shift in Artificial Intelligence [J].
Finlayson, Samuel G. ;
Subbaswamy, Adarsh ;
Singh, Karandeep ;
Bowers, John ;
Kupke, Annabel ;
Zittrain, Jonathan ;
Kohane, Isaac S. ;
Saria, Suchi .
NEW ENGLAND JOURNAL OF MEDICINE, 2021, 385 (03) :283-286
[10]   Use of artificial intelligence for image analysis in breast cancer screening programmes: systematic review of test accuracy [J].
Freeman, Karoline ;
Geppert, Julia ;
Stinton, Chris ;
Todkill, Daniel ;
Johnson, Samantha ;
Clarke, Aileen ;
Taylor-Phillips, Sian .
BMJ-BRITISH MEDICAL JOURNAL, 2021, 374