Convolutional neural network performance compared to radiologists in detecting intracranial hemorrhage from brain computed tomography: A systematic review and meta-analysis

被引:21
作者
Jorgensen, Mia Daugaard [1 ]
Antulov, Ronald [2 ,3 ]
Hess, Soren [2 ,3 ]
Lysdahlgaard, Simon [2 ,3 ]
机构
[1] Univ Copenhagen, Fac Hlth & Med Sci, Copenhagen, Denmark
[2] Univ Hosp Southern Denmark, Hosp South West Jutland, Dept Radiol & Nucl Med, Esbjerg, Denmark
[3] Univ Southern Denmark, Fac Hlth Sci, Dept Reg Hlth Res, Odense, Denmark
关键词
Computed tomography; Intracranial hemorrhage; Artificial Intelligence; Systematic review; Meta-analysis; DEEP-LEARNING ALGORITHM; ACCURACY;
D O I
10.1016/j.ejrad.2021.110073
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: To compare the diagnostic accuracy of convolutional neural networks (CNN) with radiologists as the reference standard in the diagnosis of intracranial hemorrhages (ICH) with non contrast computed tomography of the cerebrum (NCTC). Methods: PubMed, Embase, Scopus, and Web of Science were searched for the period from 1 January 2012 to 20 July 2020; eligible studies included patients with and without ICH as the target condition undergoing NCTC, studies had deep learning algorithms based on CNNs and radiologists reports as the minimum reference standard. Pooled sensitivities, specificities and a summary receiver operating characteristics curve (SROC) were employed for meta-analysis. Results: 5,119 records were identified through database searching. Title-screening left 47 studies for full-text assessment and 6 studies for meta-analysis. Comparing the CNN performance to reference standards in the retrospective studies found a pooled sensitivity of 96.00% (95% CI: 93.00% to 97.00%), pooled specificity of 97.00% (95% CI: 90.00% to 99.00%) and SROC of 98.00% (95% CI: 97.00% to 99.00%), and combining retrospective and studies with external datasets found a pooled sensitivity of 95.00% (95% CI: 91.00% to 97.00%), pooled specificity of 96.00% (95% CI: 91.00% to 98.00%) and a pooled SROC of 98.00% (95% CI: 97.00% to 99.00%). Conclusion: This review found the diagnostic performance of CNNs to be equivalent to that of radiologists for retrospective studies. Out-of-sample external validation studies pooled with retrospective studies found CNN performance to be slightly worse. There is a critical need for studies with a robust reference standard and external data-set validation.
引用
收藏
页数:8
相关论文
共 42 条
[11]   The Acute Management of Intracerebral Hemorrhage: A Clinical Review [J].
Elliott, Justine ;
Smith, Martin .
ANESTHESIA AND ANALGESIA, 2010, 110 (05) :1419-1427
[12]   Dermatologist-level classification of skin cancer with deep neural networks [J].
Esteva, Andre ;
Kuprel, Brett ;
Novoa, Roberto A. ;
Ko, Justin ;
Swetter, Susan M. ;
Blau, Helen M. ;
Thrun, Sebastian .
NATURE, 2017, 542 (7639) :115-+
[13]   Surrogate end points in clinical trials: Are we being misled? [J].
Fleming, TR ;
DeMets, DL .
ANNALS OF INTERNAL MEDICINE, 1996, 125 (07) :605-613
[14]   Quantifying the Impact of Noninterpretive Tasks on Radiology Report Turn-Around Times [J].
Glover, McKinley ;
Almeida, Renata R. ;
Schaefer, Pamela W. ;
Lev, Michael H. ;
Mehan, William A., Jr. .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2017, 14 (11) :1498-1503
[15]  
Grewal M, 2018, I S BIOMED IMAGING, P281, DOI 10.1109/ISBI.2018.8363574
[16]   Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs [J].
Gulshan, Varun ;
Peng, Lily ;
Coram, Marc ;
Stumpe, Martin C. ;
Wu, Derek ;
Narayanaswamy, Arunachalam ;
Venugopalan, Subhashini ;
Widner, Kasumi ;
Madams, Tom ;
Cuadros, Jorge ;
Kim, Ramasamy ;
Raman, Rajiv ;
Nelson, Philip C. ;
Mega, Jessica L. ;
Webster, R. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2016, 316 (22) :2402-2410
[17]   An empirical comparison of methods for meta-analysis of diagnostic accuracy showed hierarchical models are necessary [J].
Harbord, Roger M. ;
Whiting, Penny ;
Sterne, Jonathan A. C. ;
Egger, Matthias ;
Deeks, Jonathan J. ;
Shang, Aijing ;
Bachmann, Lucas M. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2008, 61 (11) :1095-1103
[18]   Image Thresholding Improves 3-Dimensional Convolutional Neural Network Diagnosis of Different Acute Brain Hemorrhages on Computed Tomography Scans [J].
Ker, Justin ;
Singh, Satya P. ;
Bai, Yeqi ;
Rao, Jai ;
Lim, Tchoyoson ;
Wang, Lipo .
SENSORS, 2019, 19 (09)
[19]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[20]   Expert-level detection of acute intracranial hemorrhage on head computed tomography using deep learning [J].
Kuo, Weicheng ;
Hane, Christian ;
Mukherjee, Pratik ;
Malik, Jitendra ;
Yuh, Esther L. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (45) :22737-22745