Factors determining generalization in deep learning models for scoring COVID-CT images

被引:5
作者
Horry, Michael James [1 ]
Chakraborty, Subrata [1 ]
Pradhan, Biswajeet [1 ,2 ,3 ]
Fallahpoor, Maryam [1 ]
Chegeni, Hossein [4 ]
Paul, Manoranjan [5 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Adv Modelling & Geospatial Informat Syst CAMG, Sydney, NSW, Australia
[2] King Abdulaziz Univ, Ctr Excellence Climate Change Res, Jeddah 21589, Saudi Arabia
[3] Univ Kebangsaan Malaysia, Earth Observat Ctr, Inst Climate Change, Bangi 43600, Selangor, Malaysia
[4] IranMehr Gen Hosp, Fellowship Intervent Radiol Imaging Ctr, Tehran, Iran
[5] Charles Sturt Univ, Sch Comp Math & Engn, Machine Vis & Digital Hlth MaViDH, Bathurst, NSW, Australia
关键词
COVID-19; scoring; computed tomography; deep learning; external validation; model; DETECTING COVID-19; X-RAYS; CLASSIFICATION; LOCALIZATION; PNEUMONIA;
D O I
10.3934/mbe.2021456
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The COVID-19 pandemic has inspired unprecedented data collection and computer vision modelling efforts worldwide, focused on the diagnosis of COVID-19 from medical images. However, these models have found limited, if any, clinical application due in part to unproven generalization to data sets beyond their source training corpus. This study investigates the generalizability of deep learning models using publicly available COVID-19 Computed Tomography data through cross dataset validation. The predictive ability of these models for COVID-19 severity is assessed using an independent dataset that is stratified for COVID-19 lung involvement. Each inter-dataset study is performed using histogram equalization, and contrast limited adaptive histogram equalization with and without a learning Gabor filter. We show that under certain conditions, deep learning models can generalize well to an external dataset with F1 scores up to 86%. The best performing model shows predictive accuracy of between 75% and 96% for lung involvement scoring against an external expertly stratified dataset. From these results we identify key factors promoting deep learning generalization, being primarily the uniform acquisition of training images, and secondly diversity in CT slice position.
引用
收藏
页码:9264 / 9293
页数:30
相关论文
共 82 条
  • [1] Correlation of Chest CT and RT-PCR Testing for Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases
    Ai, Tao
    Yang, Zhenlu
    Hou, Hongyan
    Zhan, Chenao
    Chen, Chong
    Lv, Wenzhi
    Tao, Qian
    Sun, Ziyong
    Xia, Liming
    [J]. RADIOLOGY, 2020, 296 (02) : E32 - E40
  • [2] A novel framework for rapid diagnosis of COVID-19 on computed tomography scans
    Akram, Tallha
    Attique, Muhammad
    Gul, Salma
    Shahzad, Aamir
    Altaf, Muhammad
    Naqvi, S. Syed Rameez
    Damasevicius, Robertas
    Maskeliunas, Rytis
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (03) : 951 - 964
  • [3] An innovative technique for contrast enhancement of computed tomography images using normalized gamma-corrected contrast-limited adaptive histogram equalization
    Al-Ameen, Zohair
    Sulong, Ghazali
    Rehman, Amjad
    Al-Dhelaan, Abdullah
    Saba, Tanzila
    Al-Rodhaan, Mznah
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015, : 1 - 12
  • [4] Alekseev A., ARXIV190413204
  • [5] Deep Learning Approaches for Detecting COVID-19 From Chest X-Ray Images: A Survey
    Alghamdi, Hanan S.
    Amoudi, Ghada
    Elhag, Salma
    Saeedi, Kawther
    Nasser, Jomanah
    [J]. IEEE ACCESS, 2021, 9 (09): : 20235 - 20254
  • [6] A Novel Approach for Multi-Label Chest X-Ray Classification of Common Thorax Diseases
    Allaouzi, Imane
    Ben Ahmed, Mohamed
    [J]. IEEE ACCESS, 2019, 7 : 64279 - 64288
  • [7] Amin H., 2021, DIGITAL TRANSFORMATI, P111
  • [8] [Anonymous], MEDRXIV
  • [9] [Anonymous], bioRxiv
  • [10] Bassi P. R., ARXIV210406176