Multi-centre benchmarking of deep learning models for COVID-19 detection in chest x-rays

被引:0
作者
Harkness, Rachael [1 ,2 ]
Frangi, Alejandro F. [3 ,4 ]
Zucker, Kieran [5 ]
Ravikumar, Nishant [1 ,2 ]
机构
[1] Univ Leeds, Sch Comp, Leeds, England
[2] Ctr Computat Imaging & Simulat Technol Biomed, Leeds, England
[3] Univ Manchester, Sch Hlth Sci, Div Informat Imaging & Data Sci, Manchester, England
[4] Univ Manchester, Sch Engn, Dept Comp Sci, Manchester, England
[5] Univ Leeds, Leeds Inst Med Res, Sch Med, Leeds, England
来源
FRONTIERS IN RADIOLOGY | 2024年 / 4卷
基金
英国工程与自然科学研究理事会;
关键词
deep learning; COVID-19; chest x-rays; artificial intelligence; benchmarking; NETWORK;
D O I
10.3389/fradi.2024.1386906
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Introduction This study is a retrospective evaluation of the performance of deep learning models that were developed for the detection of COVID-19 from chest x-rays, undertaken with the goal of assessing the suitability of such systems as clinical decision support tools.Methods Models were trained on the National COVID-19 Chest Imaging Database (NCCID), a UK-wide multi-centre dataset from 26 different NHS hospitals and evaluated on independent multi-national clinical datasets. The evaluation considers clinical and technical contributors to model error and potential model bias. Model predictions are examined for spurious feature correlations using techniques for explainable prediction.Results Models performed adequately on NHS populations, with performance comparable to radiologists, but generalised poorly to international populations. Models performed better in males than females, and performance varied across age groups. Alarmingly, models routinely failed when applied to complex clinical cases with confounding pathologies and when applied to radiologist defined "mild" cases.Discussion This comprehensive benchmarking study examines the pitfalls in current practices that have led to impractical model development. Key findings highlight the need for clinician involvement at all stages of model development, from data curation and label definition, to model evaluation, to ensure that all clinical factors and disease features are appropriately considered during model design. This is imperative to ensure automated approaches developed for disease detection are fit-for-purpose in a clinical setting.
引用
收藏
页数:20
相关论文
共 26 条
  • [1] UncertaintyFuseNet: Robust uncertainty-aware hierarchical feature fusion model with Ensemble Monte Carlo Dropout for COVID-19 detection
    Abdar, Moloud
    Salari, Soorena
    Qahremani, Sina
    Lam, Hak-Keung
    Karray, Fakhri
    Hussain, Sadiq
    Khosravi, Abbas
    Acharya, U. Rajendra
    Makarenkov, Vladimir
    Nahavandi, Saeid
    [J]. INFORMATION FUSION, 2023, 90 : 364 - 381
  • [2] COVID-CAPS: A capsule network-based framework for identification of COVID-19 cases from X-ray images
    Afshar, Parnian
    Heidarian, Shahin
    Naderkhani, Farnoosh
    Oikonomou, Anastasia
    Plataniotis, Konstantinos N.
    Mohammadi, Arash
    [J]. PATTERN RECOGNITION LETTERS, 2020, 138 : 638 - 643
  • [3] Albiol A, 2022, INSIGHTS IMAGING, V13, DOI 10.1186/s13244-022-01250-3
  • [4] ECOVNet: a highly effective ensemble based deep learning model for detecting COVID-19
    Chowdhury, Nihad Karim
    Kabir, Muhammad Ashad
    Rahman, Md Muhtadir
    Rezoana, Noortaz
    [J]. PEERJ COMPUTER SCIENCE, 2021,
  • [5] Chest x-ray in the COVID-19 pandemic: Radiologists' real-world reader performance
    Cozzi, Andrea
    Schiaffino, Simone
    Arpaia, Francesco
    Della Pepa, Gianmarco
    Tritella, Stefania
    Bertolotti, Pietro
    Menicagli, Laura
    Monaco, Cristian Giuseppe
    Carbonaro, Luca Alessandro
    Spairani, Riccardo
    Paskeh, Bijan Babaei
    Sardanelli, Francesco
    [J]. EUROPEAN JOURNAL OF RADIOLOGY, 2020, 132
  • [6] AI for radiographic COVID-19 detection selects shortcuts over signal
    DeGrave, Alex J.
    Janizek, Joseph D.
    Lee, Su-In
    [J]. NATURE MACHINE INTELLIGENCE, 2021, 3 (07) : 610 - 619
  • [7] Characteristics of patients with SARS-COV-2 PCR re-positivity after recovering from COVID-19
    Hu, Cheng-Yi
    Lei, Yi
    Tang, Yu-Wen
    Cui, Wen-Shuai
    Wu, Pei-Lian
    Li, Yan-Fang
    Zhou, Yan
    Li, Xin-Yan
    Cui, Hao
    Xiao, Lu-Shan
    Zhao, Zhu-Xiang
    [J]. EPIDEMIOLOGY & INFECTION, 2023, 151
  • [8] Irvin J, 2019, AAAI CONF ARTIF INTE, P590
  • [9] Racial and Ethnic Disparities in Disease Severity on Admission Chest Radiographs among Patients Admitted with Confirmed Coronavirus Disease 2019: A Retrospective Cohort Study
    Joseph, Nicholos P.
    Reid, Nicholas J.
    Som, Avik
    Li, Matthew D.
    Hyle, Emily P.
    Dugdale, Caitlin M.
    Lang, Min
    Betancourt, Joseph R.
    Deng, Francis
    Mendoza, Dexter P.
    Little, Brent P.
    Narayan, Anand K.
    Flores, Efren J.
    [J]. RADIOLOGY, 2020, 297 (03) : E303 - E312
  • [10] CoroNet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images
    Khan, Asif Iqbal
    Shah, Junaid Latief
    Bhat, Mohammad Mudasir
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 196 (196)