Improving the performance of CNN to predict the likelihood of COVID-19 using chest X-ray images with preprocessing algorithms

被引:236
作者
Heidari, Morteza [1 ]
Mirniaharikandehei, Seyedehnafiseh [1 ]
Khuzani, Abolfazl Zargari [2 ]
Danala, Gopichandh [1 ]
Qiu, Yuchen [1 ]
Zheng, Bin [1 ]
机构
[1] Univ Oklahoma, Sch Elect & Comp Engn, Norman, OK 73019 USA
[2] Univ Calif Santa Cruz, Dept Elect & Comp Engn, Santa Cruz, CA 95064 USA
关键词
Coronavirus; Convolution neural network (CNN); Disease classification; VGG16; network; COVID-19; diagnosis; Computer-aided diagnosis;
D O I
10.1016/j.ijmedinf.2020.104284
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: This study aims to develop and test a new computer-aided diagnosis (CAD) scheme of chest X-ray images to detect coronavirus (COVID-19) infected pneumonia. Method: CAD scheme first applies two image preprocessing steps to remove the majority of diaphragm regions, process the original image using a histogram equalization algorithm, and a bilateral low-pass filter. Then, the original image and two filtered images are used to form a pseudo color image. This image is fed into three input channels of a transfer learning-based convolutional neural network (CNN) model to classify chest X-ray images into 3 classes of COVID-19 infected pneumonia, other community-acquired no-COVID-19 infected pneumonia, and normal (non-pneumonia) cases. To build and test the CNN model, a publicly available dataset involving 8474 chest X-ray images is used, which includes 415, 5179 and 2,880 cases in three classes, respectively. Dataset is randomly divided into 3 subsets namely, training, validation, and testing with respect to the same frequency of cases in each class to train and test the CNN model. Results: The CNN-based CAD scheme yields an overall accuracy of 94.5 % (2404/2544) with a 95 % confidence interval of [0.93,0.96] in classifying 3 classes. CAD also yields 98.4 % sensitivity (124/126) and 98.0 % specificity (2371/2418) in classifying cases with and without COVID-19 infection. However, without using two preprocessing steps, CAD yields a lower classification accuracy of 88.0 % (2239/2544). Conclusion: This study demonstrates that adding two image preprocessing steps and generating a pseudo color image plays an important role in developing a deep learning CAD scheme of chest X-ray images to improve accuracy in detecting COVID-19 infected pneumonia.
引用
收藏
页数:9
相关论文
共 30 条
[11]   Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning [J].
Kermany, Daniel S. ;
Goldbaum, Michael ;
Cai, Wenjia ;
Valentim, Carolina C. S. ;
Liang, Huiying ;
Baxter, Sally L. ;
McKeown, Alex ;
Yang, Ge ;
Wu, Xiaokang ;
Yan, Fangbing ;
Dong, Justin ;
Prasadha, Made K. ;
Pei, Jacqueline ;
Ting, Magdalena ;
Zhu, Jie ;
Li, Christina ;
Hewett, Sierra ;
Dong, Jason ;
Ziyar, Ian ;
Shi, Alexander ;
Zhang, Runze ;
Zheng, Lianghong ;
Hou, Rui ;
Shi, William ;
Fu, Xin ;
Duan, Yaou ;
Huu, Viet A. N. ;
Wen, Cindy ;
Zhang, Edward D. ;
Zhang, Charlotte L. ;
Li, Oulan ;
Wang, Xiaobo ;
Singer, Michael A. ;
Sun, Xiaodong ;
Xu, Jie ;
Tafreshi, Ali ;
Lewis, M. Anthony ;
Xia, Huimin ;
Zhang, Kang .
CELL, 2018, 172 (05) :1122-+
[12]   CoroNet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images [J].
Khan, Asif Iqbal ;
Shah, Junaid Latief ;
Bhat, Mohammad Mudasir .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 196 (196)
[13]  
KING G, 2001, POLIT ANAL, V0009
[14]  
Kingma J., 2014, Adam: A method for stochastic optimization
[15]   Clinical and computed tomographic (CT) images characteristics in the patients with COVID-19 infection: What should radiologists need to know? [J].
Lei P. ;
Huang Z. ;
Liu G. ;
Wang P. ;
Song W. ;
Mao J. ;
Shen G. ;
Zhou S. ;
Qian W. ;
Jiao J. .
Journal of X-Ray Science and Technology, 2020, 28 (03) :369-381
[16]  
Linda W., 2020, ARXIV200309871
[17]   Interrater reliability: the kappa statistic [J].
McHugh, Mary L. .
BIOCHEMIA MEDICA, 2012, 22 (03) :276-282
[18]  
Morrison P., 2020, ARXIV PREPRINT ARXIV
[19]  
Nair Vinod, 2010, ICML, DOI DOI 10.5555/3104322.3104425
[20]  
Narin A, 2020, ARXIV PREPRINT ARXIV