Self-reporting with checklists in artificial intelligence research on medical imaging: a systematic review based on citations of CLAIM

被引:7
作者
Kocak, Burak [1 ]
Keles, Ali [1 ]
Akinci D'Antonoli, Tugba [2 ]
机构
[1] Univ Hlth Sci, Basaksehir Cam & Sakura City Hosp, Dept Radiol, Istanbul, Turkiye
[2] Cantonal Hosp Baselland, Inst Radiol & Nucl Med, Liestal, Switzerland
关键词
Artificial intelligence; Machine learning; Systematic review; Quality of reporting; Guideline; CONSORT STATEMENT; RADIOMICS; QUALITY; TRIALS;
D O I
10.1007/s00330-023-10243-9
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
ObjectiveTo evaluate the usage of a well-known and widely adopted checklist, Checklist for Artificial Intelligence in Medical imaging (CLAIM), for self-reporting through a systematic analysis of its citations.MethodsGoogle Scholar, Web of Science, and Scopus were used to search for citations (date, 29 April 2023). CLAIM's use for self-reporting with proof (i.e., filled-out checklist) and other potential use cases were systematically assessed in research papers. Eligible papers were evaluated independently by two readers, with the help of automatic annotation. Item-by-item confirmation analysis on papers with checklist proof was subsequently performed.ResultsA total of 391 unique citations were identified from three databases. Of the 118 papers included in this study, 12 (10%) provided a proof of self-reported CLAIM checklist. More than half (70; 59%) only mentioned some sort of adherence to CLAIM without providing any proof in the form of a checklist. Approximately one-third (36; 31%) cited the CLAIM for reasons unrelated to their reporting or methodological adherence. Overall, the claims on 57 to 93% of the items per publication were confirmed in the item-by-item analysis, with a mean and standard deviation of 81% and 10%, respectively.ConclusionOnly a small proportion of the publications used CLAIM as checklist and supplied filled-out documentation; however, the self-reported checklists may contain errors and should be approached cautiously. We hope that this systematic citation analysis would motivate artificial intelligence community about the importance of proper self-reporting, and encourage researchers, journals, editors, and reviewers to take action to ensure the proper usage of checklists.Clinical relevance statementOnly a small percentage of the publications used CLAIM for self-reporting with proof (i.e., filled-out checklist). However, the filled-out checklist proofs may contain errors, e.g., false claims of adherence, and should be approached cautiously. These may indicate inappropriate usage of checklists and necessitate further action by authorities.Key Points & BULL; Of 118 eligible papers, only 12 (10%) followed the CLAIM checklist for self-reporting with proof (i.e., filled-out checklist). More than half (70; 59%) only mentioned some kind of adherence without providing any proof.& BULL; Overall, claims on 57 to 93% of the items were valid in item-by-item confirmation analysis, with a mean and standard deviation of 81% and 10%, respectively.& BULL; Even with the checklist proof, the items declared may contain errors and should be approached cautiously.Key Points & BULL; Of 118 eligible papers, only 12 (10%) followed the CLAIM checklist for self-reporting with proof (i.e., filled-out checklist). More than half (70; 59%) only mentioned some kind of adherence without providing any proof.& BULL; Overall, claims on 57 to 93% of the items were valid in item-by-item confirmation analysis, with a mean and standard deviation of 81% and 10%, respectively.& BULL; Even with the checklist proof, the items declared may contain errors and should be approached cautiously.Key Points & BULL; Of 118 eligible papers, only 12 (10%) followed the CLAIM checklist for self-reporting with proof (i.e., filled-out checklist). More than half (70; 59%) only mentioned some kind of adherence without providing any proof.& BULL; Overall, claims on 57 to 93% of the items were valid in item-by-item confirmation analysis, with a mean and standard deviation of 81% and 10%, respectively. & BULL; Even with the checklist proof, the items declared may contain errors and should be approached cautiously.
引用
收藏
页码:2805 / 2815
页数:11
相关论文
共 38 条
  • [1] Albiol A, 2022, INSIGHTS IMAGING, V13, DOI 10.1186/s13244-022-01250-3
  • [2] Altman DG, 1996, BRIT MED J, V313, P570
  • [3] Raise standards for preclinical cancer research
    Begley, C. Glenn
    Ellis, Lee M.
    [J]. NATURE, 2012, 483 (7391) : 531 - 533
  • [4] The Low Rate of Adherence to Checklist for Artificial Intelligence in Medical Imaging Criteria Among Published Prostate MRI Artificial Intelligence Algorithms
    Belue, Mason J.
    Harmon, Stephanie A.
    Lay, Nathan S.
    Daryanani, Asha
    Phelps, Tim E.
    Choyke, Peter L.
    Turkbey, Baris
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2023, 20 (02) : 134 - 145
  • [5] Assessment of artificial intelligence (AI) reporting methodology in glioma MRI studies using the Checklist for AI in Medical Imaging (CLAIM)
    Bhandari, Abhishta
    Scott, Luke
    Weilbach, Manuela
    Marwah, Ravi
    Lasocki, Arian
    [J]. NEURORADIOLOGY, 2023, 65 (05) : 907 - 913
  • [6] Are CONSORT checklists submitted by authors adequately reflecting what information is actually reported in published papers?
    Blanco, David
    Biggane, Alice M.
    Cobo, Erik
    [J]. TRIALS, 2018, 19
  • [7] Radiomics-Derived Brain Age Predicts Functional Outcome After Acute Ischemic Stroke
    Bretzner, Martin
    Bonkhoff, Anna K.
    Schirmer, Markus D.
    Hong, Sungmin
    Dalca, Adrian
    Donahue, Kathleen
    Giese, Anne-Katrin
    Etherton, Mark R.
    Rist, Pamela M.
    Nardin, Marco
    Regenhardt, Robert W.
    Leclerc, Xavier
    Lopes, Renaud
    Gautherot, Morgan
    Wang, Clinton
    Benavente, Oscar R.
    Cole, John W.
    Donatti, Amanda
    Griessenauer, Christoph
    Heitsch, Laura
    Holmegaard, Lukas
    Jood, Katarina
    Jimenez-Conde, Jordi
    Kittner, Steven J.
    Lemmens, Robin
    Levi, Christopher R.
    McArdle, Patrick F.
    McDonough, Caitrin W.
    Meschia, James F.
    Phuah, Chia-Ling
    Rolfs, Arndt
    Ropele, Stefan
    Rosand, Jonathan
    Roquer, Jaume
    Rundek, Tatjana
    Sacco, Ralph L.
    Schmidt, Reinhold
    Sharma, Pankaj
    Slowik, Agnieszka
    Sousa, Alessandro
    Stanne, Tara M.
    Strbian, Daniel
    Tatlisumak, Turgut
    Thijs, Vincent
    Vagal, Achala
    Wasselius, Johan
    Woo, Daniel
    Wu, Ona
    Zand, Ramin
    Worrall, Bradford B.
    [J]. NEUROLOGY, 2023, 100 (08) : E822 - E833
  • [8] Using a Reporting Guideline (Checklist)
    Cartledge, Peter Thomas
    Hopkinson, Dennis
    Nsanzabaganwa, Christian
    Bassat, Quique
    [J]. JOURNAL OF TROPICAL PEDIATRICS, 2019, 65 (06) : 521 - 525
  • [9] equator-network, LIB EQUATOR NETW
  • [10] Reducing waste from incomplete or unusable reports of biomedical research
    Glasziou, Paul
    Altman, Douglas G.
    Bossuyt, Patrick
    Boutron, Isabelle
    Clarke, Mike
    Julious, Steven
    Michie, Susan
    Moher, David
    Wager, Elizabeth
    [J]. LANCET, 2014, 383 (9913) : 267 - 276