Discovery of a Generalization Gap of Convolutional Neural Networks on COVID-19 X-Rays Classification

被引:0
作者
Ben Ahmed, Kaoutar [1 ]
Goldgof, Gregory M. [2 ]
Paul, Rahul [3 ,4 ]
Goldgof, Dmitry B. [1 ]
Hall, Lawrence O. [1 ]
机构
[1] Univ S Florida, Dept Comp Sci & Engn, Tampa, FL 33620 USA
[2] Univ Calif San Francisco, Dept Lab Med, San Francisco, CA 94143 USA
[3] Massachusetts Gen Hosp, Dept Radiat Oncol, Boston, MA 02115 USA
[4] Harvard Med Sch, Dept Radiat Oncol, Boston, MA 02115 USA
关键词
COVID-19; Biomedical imaging; X-rays; Pulmonary diseases; X-ray imaging; Data models; Medical diagnostic imaging; Coronavirus (COVID-19); pneumonia; chest X-ray images; deep learning; confounder; CHEST; IMAGES; IDENTIFICATION;
D O I
10.1109/ACCESS.2021.3079716
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A number of recent papers have shown experimental evidence that suggests it is possible to build highly accurate deep neural network models to detect COVID-19 from chest X-ray images. In this paper, we show that good generalization to unseen sources has not been achieved. Experiments with richer data sets than have previously been used show models have high accuracy on seen sources, but poor accuracy on unseen sources. The reason for the disparity is that the convolutional neural network model, which learns features, can focus on differences in X-ray machines or in positioning within the machines, for example. Any feature that a person would clearly rule out is called a confounding feature. Some of the models were trained on COVID-19 image data taken from publications, which may be different than raw images. Some data sets were of pediatric cases with pneumonia where COVID-19 chest X-rays are almost exclusively from adults, so lung size becomes a spurious feature that can be exploited. In this work, we have eliminated many confounding features by working with as close to raw data as possible. Still, deep learned models may leverage source specific confounders to differentiate COVID-19 from pneumonia preventing generalizing to new data sources (i.e. external sites). Our models have achieved an AUC of 1.00 on seen data sources but in the worst case only scored an AUC of 0.38 on unseen ones. This indicates that such models need further assessment/development before they can be broadly clinically deployed. An example of fine-tuning to improve performance at a new site is given.
引用
收藏
页码:72970 / 72979
页数:10
相关论文
共 50 条
  • [11] A novel multi-scale based deep convolutional neural network for detecting COVID-19 from X-rays
    Karnati, Mohan
    Seal, Ayan
    Sahu, Geet
    Yazidi, Anis
    Krejcar, Ondrej
    APPLIED SOFT COMPUTING, 2022, 125
  • [12] Deep GRU-CNN Model for COVID-19 Detection From Chest X-Rays Data
    Shah, Pir Masoom
    Ullah, Faizan
    Shah, Dilawar
    Gani, Abdullah
    Maple, Carsten
    Wang, Yulin
    Shahid
    Abrar, Mohammad
    Ul Islam, Saif
    IEEE ACCESS, 2022, 10 : 35094 - 35105
  • [13] A Convolutional Neural Network for COVID-19 Diagnosis: An Analysis of Coronavirus Infections through Chest X-rays
    Mehta, Avani Kirit
    Swarnalatha, R.
    Subramoniam, M.
    Salunkhe, Sachin
    ELECTRONICS, 2022, 11 (23)
  • [14] COVID-19 detection in X-ray images using convolutional neural networks
    Arias-Garzon, Daniel
    Alzate-Grisales, Jesus Alejandro
    Orozco-Arias, Simon
    Arteaga-Arteaga, Harold Brayan
    Bravo-Ortiz, Mario Alejandro
    Mora-Rubio, Alejandro
    Saborit-Torres, Jose Manuel
    Serrano, Joaquim aengel Montell
    Vaya, Maria de la Iglesia
    Cardona-Morales, Oscar
    Tabares-Soto, Reinel
    MACHINE LEARNING WITH APPLICATIONS, 2021, 6
  • [15] COVID-19 Identification from Chest X-Rays
    Mporas, Iosif
    Naronglerdrit, Prasitthichai
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON BIOMEDICAL INNOVATIONS AND APPLICATIONS (BIA 2020), 2020, : 69 - 72
  • [16] CDC_Net: multi-classification convolutional neural network model for detection of COVID-19, pneumothorax, pneumonia, lung Cancer, and tuberculosis using chest X-rays
    Malik, Hassaan
    Anees, Tayyaba
    Din, Muizzud
    Naeem, Ahmad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13855 - 13880
  • [17] CDC_Net: multi-classification convolutional neural network model for detection of COVID-19, pneumothorax, pneumonia, lung Cancer, and tuberculosis using chest X-rays
    Hassaan Malik
    Tayyaba Anees
    Muizzud Din
    Ahmad Naeem
    Multimedia Tools and Applications, 2023, 82 : 13855 - 13880
  • [18] An intelligent deep convolutional network based COVID-19 detection from chest X-rays
    Alshahrni, Mohammad M.
    Ahmad, Mostafa A.
    Abdullah, Monir
    Omer, Nadir
    Aziz, Muzzamil
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 64 : 399 - 417
  • [19] Investigating the Performance of FixMatch for COVID-19 Detection in Chest X-rays
    Sajun, Ali Reza
    Zualkernan, Imran
    Sankalpa, Donthi
    APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [20] Pneumonia and COVID-19 Detection in Chest X-rays Using Faster Region-Based Convolutional Neural Networks (Faster R-CNN)
    Farhat, Hanan J.
    Sakr, George E.
    Kilany, Rima
    2022 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI) JOINTLY ORGANISED WITH THE IEEE-EMBS INTERNATIONAL CONFERENCE ON WEARABLE AND IMPLANTABLE BODY SENSOR NETWORKS (BSN'22), 2022,