Analysis of the Clever Hans effect in COVID-19 detection using Chest X-Ray images and Bayesian Deep Learning

被引：5

作者：

Arias-Londono, Julian D. ^{[1
]}

Godino-Llorente, Juan I. ^{[1
]}

机构：

[1] Univ Politecn Madrid, Dept Signals Syst & Radiocommun, ETSI Telecomunicac, Ave Complutense 30, Madrid 28040, Spain

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2024年 / 90卷

关键词：

Deep learning; Bayesian learning; Explainability; Uncertainty; Calibration; COVID-19; Pneumonia; Radiological imaging; Chest X-Ray; AUTOMATIC DETECTION;

D O I：

10.1016/j.bspc.2023.105831

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

In recent months, the detection of COVID-19 from radiological images has become a topic of significant interest. Several works have proposed different AI models to demonstrate the feasibility of the application. However, the literature has also reported unwanted behaviours, spurious correlations, and biases of the developed systems that significantly limit their translation to the clinic. This paper deals with a set of interpretability techniques to analyse spurious correlations during the inference, the consistency of the decisions, and the uncertainty of the models, and evaluate the model's performance in a broader and thoughtful way, especially regarding biasing effects, aiming to provide new methodological cues that can increase the systems' robustness. Two different off-the-shelf convolutional neural networks (DenseNet-121 and EfficientNet-B6) were tested along with their Bayesian counterparts. Different saliency maps are used to evaluate the effects of artifacts and confounding factors, and, taking advantage of uncertainty estimations, a new version of the importance of context measure was proposed, to provide more evidence of the spurious correlation affecting models' performance. In view of the results, DenseNet is preferred in both its standard and Bayesian versions, reaching BAcc over 97% training with a large data set (more than 70,000 images). However, results demonstrate that models are significantly affected by the biasing effects, which is minimised by pre-processing with a semantic segmentation of the lungs to guide the learning process towards areas with causal relationships with the problem under study. The conclusions could be extrapolated to the general context of pneumonia detection from chest RX.

引用

页数：18

共 71 条

[1]

Achtibat R, 2024, Arxiv, DOI [arXiv:2206.03208, 10.48550/arXiv.2206.03208]

[2]

Ancona M, 2018, Arxiv, DOI arXiv:1711.06104

[3] Automatic Identification of Lung Opacities Due to COVID-19 from Chest X-ray Images-Focussing Attention on the Lungs [J].

Arias-Londono, Julian D. ;

Moure-Prado, Alvaro ;

Godino-Llorente, Juan I. .

DIAGNOSTICS, 2023, 13 (08)

[4] Artificial Intelligence Applied to Chest X-Ray Images for the Automatic Detection of COVID-19. A Thoughtful Evaluation Approach [J].

Arias-Londono, Julian D. ;

Gomez-Garcia, Jorge A. ;

Moro-Velazquez, Laureano ;

Godino-Llorente, Juan, I .

IEEE ACCESS, 2020, 8 :226811-226827

[5] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation [J].

Bach, Sebastian ;

Binder, Alexander ;

Montavon, Gregoire ;

Klauschen, Frederick ;

Mueller, Klaus-Robert ;

Samek, Wojciech .

PLOS ONE, 2015, 10 (07)

[6] Deep convolutional networks do not classify based on global object shape [J].

Baker, Nicholas ;

Lu, Hongjing ;

Erlikhman, Gennady ;

Kellman, Philip J. .

PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (12)

[7] A deep convolutional neural network for COVID-19 detection using chest X-rays [J].

Bassi P.R.A.S. ;

Attux R. .

Research on Biomedical Engineering, 2022, 38 (01) :139-148

[8]

Bickel S, 2009, J MACH LEARN RES, V10, P2137

[9]

Blundell C, 2015, PR MACH LEARN RES, V37, P1613

[10] Convolutional Dynamic Alignment Networks for Interpretable Classifications [J].

Boehle, Moritz ;

Fritz, Mario ;

Schiele, Bernt .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :10024-10033

← 1 2 3 4 5 6 7 8 →