The Low Rate of Adherence to Checklist for Artificial Intelligence in Medical Imaging Criteria Among Published Prostate MRI Artificial Intelligence Algorithms

被引:13
作者
Belue, Mason J. [1 ]
Harmon, Stephanie A. [1 ]
Lay, Nathan S. [1 ]
Daryanani, Asha [1 ]
Phelps, Tim E. [1 ]
Choyke, Peter L. [1 ]
Turkbey, Baris [1 ,2 ]
机构
[1] NCI, Artificial Intelligence Resource, Mol Imaging Branch, NIH, Bethesda, MD USA
[2] NCI, Artificial Intelligence Resource, Mol Imaging Branch, 10 Ctr Dr,MSC 1182,Bldg 10,Room B3B85, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
AI; CLAIM; classi fi cation; detection; prostate cancer; study rigor; CANCER DETECTION; CLINICALLY SIGNIFICANT; GLEASON SCORE; RADIOMICS; CLASSIFICATION; VALIDATION; DIAGNOSIS; FEATURES; IMAGES;
D O I
10.1016/j.jacr.2022.05.022
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objective: To determine the rigor, generalizability, and reproducibility of published classification and detection artificial intelligence (AI) models for prostate cancer (PCa) on MRI using the Checklist for Artificial Intelligence in Medical Imaging (CLAIM) guidelines, a 42-item checklist that is considered a measure of best practice for presenting and reviewing medical imaging AI research.Materials and methods: This review searched English literature for studies proposing PCa AI detection and classification models on MRI. Each study was evaluated with the CLAIM checklist. The additional outcomes for which data were sought included measures of AI model performance (eg, area under the curve [AUC], sensitivity, specificity, free-response operating characteristic curves), training and validation and testing group sample size, AI approach, detection versus classification AI, public data set utilization, MRI sequences used, and definition of gold standard for ground truth. The percentage of CLAIM checklist fulfillment was used to stratify studies into quartiles. Wilcoxon's rank-sum test was used for pair-wise comparisons.Results: In all, 75 studies were identified, and 53 studies qualified for analysis. The original CLAIM items that most studies did not fulfill includes item 12 (77% no): de-identification methods; item 13 (68% no): handling missing data; item 15 (47% no): rationale for choosing ground truth reference standard; item 18 (55% no): measurements of inter-and intrareader variability; item 31 (60% no): inclusion of validated interpretability maps; item 37 (92% no): inclusion of failure analysis to elucidate AI model weaknesses. An AUC score versus percentage CLAIM fulfillment quartile revealed a significant difference of the mean AUC scores between quartile 1 versus quartile 2 (0.78 versus 0.86, P = .034) and quartile 1 versus quartile 4 (0.78 versus 0.89, P = .003) scores. Based on additional information and outcome metrics gathered in this study, additional measures of best practice are defined. These new items include disclosure of public dataset usage, ground truth definition in comparison to other referenced works in the defined task, and sample size power calculation.Conclusion: A large proportion of AI studies do not fulfill key items in CLAIM guidelines within their methods and results sections. The percentage of CLAIM checklist fulfillment is weakly associated with improved AI model performance. Additions or supplementations to CLAIM are recommended to improve publishing standards and aid reviewers in determining study rigor.
引用
收藏
页码:134 / 145
页数:12
相关论文
共 50 条
  • [41] Patient perspectives on the use of artificial intelligence in prostate cancer diagnosis on MRI
    Fransen, Stefan J.
    Kwee, T. C.
    Rouw, D.
    Roest, C.
    van Lohuizen, Q. Y.
    Simonis, F. F. J.
    van Leeuwen, P. J.
    Heijmink, S.
    Ongena, Y. P.
    Haan, M.
    Yakar, D.
    EUROPEAN RADIOLOGY, 2025, 35 (02) : 769 - 775
  • [42] Ethical Considerations for Artificial Intelligence in Medical Imaging: Deployment and Governance
    Herington, Jonathan
    Mccradden, Melissa D.
    Creel, Kathleen
    Boellaard, Ronald
    Jones, Elizabeth C.
    Jha, Abhinav K.
    Rahmim, Arman
    Scott, Peter J. H.
    Sunderland, John J.
    Wahl, Richard L.
    Zuehlsdorff, Sven
    Saboury, Babak
    JOURNAL OF NUCLEAR MEDICINE, 2023, 64 (10) : 1509 - 1515
  • [43] Artificial intelligence in medical imaging: implications for patient radiation safety
    Seah, Jarrel
    Brady, Zoe
    Ewert, Kyle
    Law, Meng
    BRITISH JOURNAL OF RADIOLOGY, 2021, 94 (1126)
  • [44] Artificial intelligence and medical imaging: Definition, state of the art and perspectives
    Brunelle, F.
    Brunelle, P.
    BULLETIN DE L ACADEMIE NATIONALE DE MEDECINE, 2019, 203 (8-9): : 683 - 687
  • [45] Artificial intelligence and medical imaging 2018: French Radiology Community white paper
    Beregi, Jean-Paul
    DIAGNOSTIC AND INTERVENTIONAL IMAGING, 2018, 99 (11) : 727 - 742
  • [46] How Artificial Intelligence Is Shaping Medical Imaging Technology: A Survey of Innovations and Applications
    Pinto-Coelho, Luis
    BIOENGINEERING-BASEL, 2023, 10 (12):
  • [47] Reliability of whole mount radical prostatectomy histopathology as the ground truth for artificial intelligence assisted prostate imaging
    Jager, Auke
    Postema, Arnoud W.
    van der Linden, Hans
    Nooijen, Peet T. G. A.
    Bekers, Elise
    Kweldam, Charlotte F.
    Daures, Gautier
    Zwart, Wim
    Mischi, M.
    Beerlage, Harrie P.
    Oddens, Jorg R.
    VIRCHOWS ARCHIV, 2023, 483 (02) : 197 - 206
  • [48] Prostate cancer classification from ultrasound and MRI images using deep learning based Explainable Artificial Intelligence
    Hassan, Md Rafiul
    Islam, Md Fakrul
    Uddin, Md Zia
    Ghoshal, Goutam
    Hassan, Mohammad Mehedi
    Huda, Shamsul
    Fortino, Giancarlo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 127 : 462 - 472
  • [49] Applications of Artificial Intelligence in PSMA PET/CT for Prostate Cancer Imaging
    Belal, Sarah Lindgren
    Frantz, Sophia
    Minarik, David
    Enqvist, Olof
    Wikstrom, Erik
    Edenbrandt, Lars
    Tragardh, Elin
    SEMINARS IN NUCLEAR MEDICINE, 2024, 54 (01) : 141 - 149
  • [50] Radiomics and artificial intelligence in prostate cancer: new tools for molecular hybrid imaging and theragnostics
    Virginia Liberini
    Riccardo Laudicella
    Michele Balma
    Daniele G. Nicolotti
    Ambra Buschiazzo
    Serena Grimaldi
    Leda Lorenzon
    Andrea Bianchi
    Simona Peano
    Tommaso Vincenzo Bartolotta
    Mohsen Farsad
    Sergio Baldari
    Irene A. Burger
    Martin W. Huellner
    Alberto Papaleo
    Désirée Deandreis
    European Radiology Experimental, 6