The Low Rate of Adherence to Checklist for Artificial Intelligence in Medical Imaging Criteria Among Published Prostate MRI Artificial Intelligence Algorithms

被引:13
|
作者
Belue, Mason J. [1 ]
Harmon, Stephanie A. [1 ]
Lay, Nathan S. [1 ]
Daryanani, Asha [1 ]
Phelps, Tim E. [1 ]
Choyke, Peter L. [1 ]
Turkbey, Baris [1 ,2 ]
机构
[1] NCI, Artificial Intelligence Resource, Mol Imaging Branch, NIH, Bethesda, MD USA
[2] NCI, Artificial Intelligence Resource, Mol Imaging Branch, 10 Ctr Dr,MSC 1182,Bldg 10,Room B3B85, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
AI; CLAIM; classi fi cation; detection; prostate cancer; study rigor; CANCER DETECTION; CLINICALLY SIGNIFICANT; GLEASON SCORE; RADIOMICS; CLASSIFICATION; VALIDATION; DIAGNOSIS; FEATURES; IMAGES;
D O I
10.1016/j.jacr.2022.05.022
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objective: To determine the rigor, generalizability, and reproducibility of published classification and detection artificial intelligence (AI) models for prostate cancer (PCa) on MRI using the Checklist for Artificial Intelligence in Medical Imaging (CLAIM) guidelines, a 42-item checklist that is considered a measure of best practice for presenting and reviewing medical imaging AI research.Materials and methods: This review searched English literature for studies proposing PCa AI detection and classification models on MRI. Each study was evaluated with the CLAIM checklist. The additional outcomes for which data were sought included measures of AI model performance (eg, area under the curve [AUC], sensitivity, specificity, free-response operating characteristic curves), training and validation and testing group sample size, AI approach, detection versus classification AI, public data set utilization, MRI sequences used, and definition of gold standard for ground truth. The percentage of CLAIM checklist fulfillment was used to stratify studies into quartiles. Wilcoxon's rank-sum test was used for pair-wise comparisons.Results: In all, 75 studies were identified, and 53 studies qualified for analysis. The original CLAIM items that most studies did not fulfill includes item 12 (77% no): de-identification methods; item 13 (68% no): handling missing data; item 15 (47% no): rationale for choosing ground truth reference standard; item 18 (55% no): measurements of inter-and intrareader variability; item 31 (60% no): inclusion of validated interpretability maps; item 37 (92% no): inclusion of failure analysis to elucidate AI model weaknesses. An AUC score versus percentage CLAIM fulfillment quartile revealed a significant difference of the mean AUC scores between quartile 1 versus quartile 2 (0.78 versus 0.86, P = .034) and quartile 1 versus quartile 4 (0.78 versus 0.89, P = .003) scores. Based on additional information and outcome metrics gathered in this study, additional measures of best practice are defined. These new items include disclosure of public dataset usage, ground truth definition in comparison to other referenced works in the defined task, and sample size power calculation.Conclusion: A large proportion of AI studies do not fulfill key items in CLAIM guidelines within their methods and results sections. The percentage of CLAIM checklist fulfillment is weakly associated with improved AI model performance. Additions or supplementations to CLAIM are recommended to improve publishing standards and aid reviewers in determining study rigor.
引用
收藏
页码:134 / 145
页数:12
相关论文
共 50 条
  • [31] Current challenges of implementing artificial intelligence in medical imaging
    Saw, Shier Nee
    Ng, Kwan Hoong
    PHYSICA MEDICA-EUROPEAN JOURNAL OF MEDICAL PHYSICS, 2022, 100 : 12 - 17
  • [32] Challenges in the Use of Artificial Intelligence for Prostate Cancer Diagnosis from Multiparametric Imaging Data
    Corradini, Daniele
    Brizi, Leonardo
    Gaudiano, Caterina
    Bianchi, Lorenzo
    Marcelli, Emanuela
    Golfieri, Rita
    Schiavina, Riccardo
    Testa, Claudia
    Remondini, Daniel
    CANCERS, 2021, 13 (16)
  • [33] Artificial intelligence-based algorithms for the diagnosis of prostate cancer: A systematic review
    Marletta, Stefano
    Eccher, Albino
    Martelli, Filippo Maria
    Santonicco, Nicola
    Girolami, Ilaria
    Scarpa, Aldo
    Pagni, Fabio
    L'Imperio, Vincenzo
    Pantanowitz, Liron
    Gobbo, Stefano
    Seminati, Davide
    Dei Tos, Angelo Paolo
    Parwani, Anil
    AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2024, 161 (06) : 526 - 534
  • [34] Methods for Clinical Evaluation of Artificial Intelligence Algorithms for Medical Diagnosis
    Park, Seong Ho
    Han, Kyunghwa
    Jang, Hye Young
    Park, Ji Eun
    Lee, June-Goo
    Kim, Dong Wook
    Choi, Jaesoon
    RADIOLOGY, 2023, 306 (01) : 20 - 31
  • [35] The Evidence for Using Artificial Intelligence to Enhance Prostate Cancer MR Imaging
    Rodrigo Canellas
    Marc D. Kohli
    Antonio C. Westphalen
    Current Oncology Reports, 2023, 25 : 243 - 250
  • [36] Medical imaging-based artificial intelligence in pneumonia: A narrative review
    Yang, Yanping
    Xing, Wenyu
    Liu, Yiwen
    Li, Yifang
    Ta, Dean
    Song, Yuanlin
    Hou, Dongni
    NEUROCOMPUTING, 2025, 630
  • [37] Artificial intelligence in tumor subregion analysis based on medical imaging: A review
    Lin, Mingquan
    Wynne, Jacob F.
    Zhou, Boran
    Wang, Tonghe
    Lei, Yang
    Curran, Walter J.
    Liu, Tian
    Yang, Xiaofeng
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2021, 22 (07): : 10 - 26
  • [38] Applications of Artificial Intelligence Based on Medical Imaging in Glioma: Current State and Future Challenges
    Xu, Jiaona
    Meng, Yuting
    Qiu, Kefan
    Topatana, Win
    Li, Shijie
    Wei, Chao
    Chen, Tianwen
    Chen, Mingyu
    Ding, Zhongxiang
    Niu, Guozhong
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [39] Application of Artificial Intelligence Algorithms to Estimate the Success Rate in Medically Assisted Procreation
    de Guimaraes, Beatriz Bras
    Martins, Leonardo
    Metello, Jose Luis
    Ferreira, Fernando Luis
    Ferreira, Pedro
    Fonseca, Jose Manuel
    REPRODUCTIVE MEDICINE, 2020, 1 (03): : 181 - 194
  • [40] Artificial intelligence-based graded training of pulmonary nodules for junior radiology residents and medical imaging students
    Lyu, Xiaohong
    Dong, Liang
    Fan, Zhongkai
    Sun, Yu
    Zhang, Xianglin
    Liu, Ning
    Wang, Dongdong
    BMC MEDICAL EDUCATION, 2024, 24 (01)