Multi-centre benchmarking of deep learning models for COVID-19 detection in chest x-rays

被引：0

作者：

Harkness, Rachael ^{[1
,2
]}

Frangi, Alejandro F. ^{[3
,4
]}

Zucker, Kieran ^{[5
]}

Ravikumar, Nishant ^{[1
,2
]}

机构：

[1] Univ Leeds, Sch Comp, Leeds, England

[2] Ctr Computat Imaging & Simulat Technol Biomed, Leeds, England

[3] Univ Manchester, Sch Hlth Sci, Div Informat Imaging & Data Sci, Manchester, England

[4] Univ Manchester, Sch Engn, Dept Comp Sci, Manchester, England

[5] Univ Leeds, Leeds Inst Med Res, Sch Med, Leeds, England

来源：

FRONTIERS IN RADIOLOGY | 2024年 / 4卷

基金：

英国工程与自然科学研究理事会;

关键词：

deep learning; COVID-19; chest x-rays; artificial intelligence; benchmarking; NETWORK;

D O I：

10.3389/fradi.2024.1386906

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

Introduction This study is a retrospective evaluation of the performance of deep learning models that were developed for the detection of COVID-19 from chest x-rays, undertaken with the goal of assessing the suitability of such systems as clinical decision support tools.Methods Models were trained on the National COVID-19 Chest Imaging Database (NCCID), a UK-wide multi-centre dataset from 26 different NHS hospitals and evaluated on independent multi-national clinical datasets. The evaluation considers clinical and technical contributors to model error and potential model bias. Model predictions are examined for spurious feature correlations using techniques for explainable prediction.Results Models performed adequately on NHS populations, with performance comparable to radiologists, but generalised poorly to international populations. Models performed better in males than females, and performance varied across age groups. Alarmingly, models routinely failed when applied to complex clinical cases with confounding pathologies and when applied to radiologist defined "mild" cases.Discussion This comprehensive benchmarking study examines the pitfalls in current practices that have led to impractical model development. Key findings highlight the need for clinician involvement at all stages of model development, from data curation and label definition, to model evaluation, to ensure that all clinical factors and disease features are appropriately considered during model design. This is imperative to ensure automated approaches developed for disease detection are fit-for-purpose in a clinical setting.

引用

页数：20

共 26 条

[1] UncertaintyFuseNet: Robust uncertainty-aware hierarchical feature fusion model with Ensemble Monte Carlo Dropout for COVID-19 detection
Abdar, Moloud
Salari, Soorena
Qahremani, Sina
Lam, Hak-Keung
Karray, Fakhri
Hussain, Sadiq
Khosravi, Abbas
Acharya, U. Rajendra
Makarenkov, Vladimir
Nahavandi, Saeid
[J]. INFORMATION FUSION, 2023, 90 : 364 - 381
[2] COVID-CAPS: A capsule network-based framework for identification of COVID-19 cases from X-ray images
Afshar, Parnian
Heidarian, Shahin
Naderkhani, Farnoosh
Oikonomou, Anastasia
Plataniotis, Konstantinos N.
Mohammadi, Arash
[J]. PATTERN RECOGNITION LETTERS, 2020, 138 : 638 - 643
[3] Albiol A, 2022, INSIGHTS IMAGING, V13, DOI 10.1186/s13244-022-01250-3
[4] ECOVNet: a highly effective ensemble based deep learning model for detecting COVID-19
Chowdhury, Nihad Karim
Kabir, Muhammad Ashad
Rahman, Md Muhtadir
Rezoana, Noortaz
[J]. PEERJ COMPUTER SCIENCE, 2021,
[5] Chest x-ray in the COVID-19 pandemic: Radiologists' real-world reader performance
Cozzi, Andrea
Schiaffino, Simone
Arpaia, Francesco
Della Pepa, Gianmarco
Tritella, Stefania
Bertolotti, Pietro
Menicagli, Laura
Monaco, Cristian Giuseppe
Carbonaro, Luca Alessandro
Spairani, Riccardo
Paskeh, Bijan Babaei
Sardanelli, Francesco
[J]. EUROPEAN JOURNAL OF RADIOLOGY, 2020, 132
[6] AI for radiographic COVID-19 detection selects shortcuts over signal
DeGrave, Alex J.
Janizek, Joseph D.
Lee, Su-In
[J]. NATURE MACHINE INTELLIGENCE, 2021, 3 (07) : 610 - 619
[7] Characteristics of patients with SARS-COV-2 PCR re-positivity after recovering from COVID-19
Hu, Cheng-Yi
Lei, Yi
Tang, Yu-Wen
Cui, Wen-Shuai
Wu, Pei-Lian
Li, Yan-Fang
Zhou, Yan
Li, Xin-Yan
Cui, Hao
Xiao, Lu-Shan
Zhao, Zhu-Xiang
[J]. EPIDEMIOLOGY & INFECTION, 2023, 151
[8] Irvin J, 2019, AAAI CONF ARTIF INTE, P590
[9] Racial and Ethnic Disparities in Disease Severity on Admission Chest Radiographs among Patients Admitted with Confirmed Coronavirus Disease 2019: A Retrospective Cohort Study
Joseph, Nicholos P.
Reid, Nicholas J.
Som, Avik
Li, Matthew D.
Hyle, Emily P.
Dugdale, Caitlin M.
Lang, Min
Betancourt, Joseph R.
Deng, Francis
Mendoza, Dexter P.
Little, Brent P.
Narayan, Anand K.
Flores, Efren J.
[J]. RADIOLOGY, 2020, 297 (03) : E303 - E312
[10] CoroNet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images
Khan, Asif Iqbal
Shah, Junaid Latief
Bhat, Mohammad Mudasir
[J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 196 (196)

← 1 2 3 →