Transparency in Artificial Intelligence Research: a Systematic Review of Availability Items Related to Open Science in Radiology and Nuclear Medicine

被引:12
作者
Kocak, Burak [1 ]
Yardimci, Aytul Hande [1 ]
Yuzkan, Sabahattin [1 ]
Keles, Ali [1 ]
Altun, Omer
Bulut, Elif [1 ]
Bayrak, Osman Nuri [1 ]
Okumus, Ahmet Arda [1 ]
机构
[1] Univ Hlth Sci, Basaksehir Cam & Sakura City Hosp, Dept Radiol, TR-34480 Istanbul, Turkiye
关键词
Artificial intelligence; Machine learning; Reproducibility; Open science; Code availability; REPRODUCIBILITY;
D O I
10.1016/j.acra.2022.11.030
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Rationale and objectives: Reproducibility of artificial intelligence (AI) research has become a growing concern. One of the fundamental reasons is the lack of transparency in data, code, and model. In this work, we aimed to systematically review the radiology and nuclear medicine papers on AI in terms of transparency and open science.Materials and Methods: A systematic literature search was performed in PubMed to identify original research studies on AI. The search was restricted to studies published in Q1 and Q2 journals that are also indexed on the Web of Science. A random sampling of the literature was performed. Besides six baseline study characteristics, a total of five availability items were evaluated. Two groups of independent readers including eight readers participated in the study. Inter-rater agreement was analyzed. Disagreements were resolved with consensus.Results: Following eligibility criteria, we included a final set of 194 papers. The raw data was available in about one-fifth of the papers (34/ 194; 18%). However, the authors made their private data available only in one paper (1/161; 1%). About one-tenth of the papers made their pre-modeling (25/194; 13%), modeling (28/194; 14%), or post-modeling files (15/194; 8%) available. Most of the papers (189/194; 97%) did not attempt to create a ready-to-use system for real-world usage. Data origin, use of deep learning, and external validation had statistically significantly different distributions. The use of private data alone was negatively associated with the availability of at least one item (p<0.001).Conclusion: Overall rates of availability for items were poor, leaving room for substantial improvement.
引用
收藏
页码:2254 / 2266
页数:13
相关论文
共 44 条
[1]   Global Evolution of Research in Artificial Intelligence in Health and Medicine: A Bibliometric Study [J].
Bach Xuan Tran ;
Giang Thu Vu ;
Giang Hai Ha ;
Quan-Hoang Vuong ;
Manh-Tung Ho ;
Thu-Trang Vuong ;
Viet-Phuong La ;
Manh-Toan Ho ;
Nghiem, Kien-Cuong P. ;
Huong Lan Thi Nguyen ;
Latkin, Carl A. ;
Tam, Wilson W. S. ;
Cheung, Ngai-Man ;
Nguyen, Hong-Kong T. ;
Ho, Cyrus S. H. ;
Ho, Roger C. M. .
JOURNAL OF CLINICAL MEDICINE, 2019, 8 (03)
[2]  
Baker M, 2016, NATURE, V533, P452, DOI 10.1038/533452a
[3]   Artificial intelligence and machine learning for medical imaging: A technology review [J].
Barragan-Montero, Ana ;
Javaid, Umair ;
Valdes, Gilmer ;
Nguyen, Dan ;
Desbordes, Paul ;
Macq, Benoit ;
Willems, Siri ;
Vandewinckele, Liesbeth ;
Holmstrom, Mats ;
Lofman, Fredrik ;
Michiels, Steven ;
Souris, Kevin ;
Sterpin, Edmond ;
Lee, John A. .
PHYSICA MEDICA-EUROPEAN JOURNAL OF MEDICAL PHYSICS, 2021, 83 :242-256
[4]   Reproducibility in Science Improving the Standard for Basic and Preclinical Research [J].
Begley, C. Glenn ;
Ioannidis, John P. A. .
CIRCULATION RESEARCH, 2015, 116 (01) :116-126
[5]   Assessing Radiology Research on Artificial Intelligence: A Brief Guide for Authors, Reviewers, and Readers-From the Radiology Editorial Board [J].
Bluemke, David A. ;
Moy, Linda ;
Bredella, Miriam A. ;
Ertl-Wagner, Birgit B. ;
Fowler, Kathryn J. ;
Goh, Vicky J. ;
Halpern, Elkan F. ;
Hess, Christopher P. ;
Schiebler, Mark L. ;
Weiss, Clifford R. .
RADIOLOGY, 2020, 294 (03) :487-489
[6]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[7]   Power failure: why small sample size undermines the reliability of neuroscience [J].
Button, Katherine S. ;
Ioannidis, John P. A. ;
Mokrysz, Claire ;
Nosek, Brian A. ;
Flint, Jonathan ;
Robinson, Emma S. J. ;
Munafo, Marcus R. .
NATURE REVIEWS NEUROSCIENCE, 2013, 14 (05) :365-376
[8]   The TRUE Checklist for Identifying Impactful Artificial Intelligence-Based Findings in Nuclear Medicine: Is It True? Is It Reproducible? Is It Useful? Is It Explainable? [J].
Buvat, Irene ;
Orlhac, Fanny .
JOURNAL OF NUCLEAR MEDICINE, 2021, 62 (06) :752-754
[9]   Pragmatic considerations for fostering reproducible research in artificial intelligence [J].
Carter, Rickey E. ;
Attia, Zachi I. ;
Lopez-Jimenez, Francisco ;
Friedman, Paul A. .
NPJ DIGITAL MEDICINE, 2019, 2 (1)
[10]   A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models [J].
Christodoulou, Evangelia ;
Ma, Jie ;
Collins, Gary S. ;
Steyerberg, Ewout W. ;
Verbakel, Jan Y. ;
Van Calster, Ben .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2019, 110 :12-22