Does imbalance in chest X-ray datasets produce biased deep learning approaches for COVID-19 screening?

被引:5
作者
Alvarez-Rodriguez, Lorena [1 ,2 ]
de Moura, Joaquim [1 ,2 ]
Novo, Jorge [1 ,2 ]
Ortega, Marcos [1 ,2 ]
机构
[1] Univ A Coruna, Ctr Invest CITIC, Campus Elvina, La Coruna 15071, Spain
[2] Univ A Coruna, Inst Invest Biomed A Coruna INIBIC, Grp VARPA, La Coruna 15006, Spain
关键词
CAD system; Chest X-ray; COVID-19; screening; Data analysis; Deep learning; IMAGES;
D O I
10.1186/s12874-022-01578-w
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background The health crisis resulting from the global COVID-19 pandemic highlighted more than ever the need for rapid, reliable and safe methods of diagnosis and monitoring of respiratory diseases. To study pulmonary involvement in detail, one of the most common resources is the use of different lung imaging modalities (like chest radiography) to explore the possible affected areas. Methods The study of patient characteristics like sex and age in pathologies of this type is crucial for gaining knowledge of the disease and for avoiding biases due to the clear scarcity of data when developing representative systems. In this work, we performed an analysis of these factors in chest X-ray images to identify biases. Specifically, 11 imbalance scenarios were defined with female and male COVID-19 patients present in different proportions for the sex analysis, and 6 scenarios where only one specific age range was used for training for the age factor. In each study, 3 different approaches for automatic COVID-19 screening were used: Normal vs COVID-19, Pneumonia vs COVID-19 and Non-COVID-19 vs COVID-19. The study was validated using two public chest X-ray datasets, allowing a reliable analysis to support the clinical decision-making process. Results The results for the sex-related analysis indicate this factor slightly affects the system in the Normal VS COVID-19 and Pneumonia VS COVID-19 approaches, although the identified differences are not relevant enough to worsen considerably the system. Regarding the age-related analysis, this factor was observed to be influencing the system in a more consistent way than the sex factor, as it was present in all considered scenarios. However, this worsening does not represent a major factor, as it is not of great magnitude. Conclusions Multiple studies have been conducted in other fields in order to determine if certain patient characteristics such as sex or age influenced these deep learning systems. However, to the best of our knowledge, this study has not been done for COVID-19 despite the urgency and lack of COVID-19 chest x-ray images. The presented results evidenced that the proposed methodology and tested approaches allow a robust and reliable analysis to support the clinical decision-making process in this pandemic scenario.
引用
收藏
页数:17
相关论文
共 35 条
[1]   An Ensemble of Global and Local-Attention Based Convolutional Neural Networks for COVID-19 Diagnosis on Chest X-ray Images [J].
Afifi, Ahmed ;
Hafsa, Noor E. ;
Ali, Mona A. S. ;
Alhumam, Abdulaziz ;
Alsalman, Safa .
SYMMETRY-BASEL, 2021, 13 (01) :1-25
[2]  
[Anonymous], 2021, COVID DATA SAVE LIVE
[3]   Artificial Intelligence Applied to Chest X-Ray Images for the Automatic Detection of COVID-19. A Thoughtful Evaluation Approach [J].
Arias-Londono, Julian D. ;
Gomez-Garcia, Jorge A. ;
Moro-Velazquez, Laureano ;
Godino-Llorente, Juan, I .
IEEE ACCESS, 2020, 8 :226811-226827
[4]  
Chicco D., 2021, SIAMESE NEURAL NETWO
[5]   Sex and gender differences and biases in artificial intelligence for biomedicine and healthcare [J].
Cirillo, Davide ;
Catuara-Solarz, Silvina ;
Morey, Czuee ;
Guney, Emre ;
Subirats, Laia ;
Mellino, Simona ;
Gigante, Annalisa ;
Valencia, Alfonso ;
Rementeria, Maria Jose ;
Chadha, Antonella Santuccione ;
Mavridis, Nikolaos .
NPJ DIGITAL MEDICINE, 2020, 3 (01)
[6]  
de Moura J, 2020, MEDRXIV, DOI [10.1101/2020.05.01.20087254, DOI 10.1101/2020.05.01.20087254]
[7]   Deep Convolutional Approaches for the Analysis of COVID-19 Using Chest X-Ray Images From Portable Devices [J].
De Moura, Joaquim ;
Garcia, Lucia Ramos ;
Vidal, Placido Francisco Lizancos ;
Cruz, Milena ;
Lopez, Laura Abelairas ;
Lopez, Eva Castro ;
Novo, Jorge ;
Ortega, Marcos .
IEEE ACCESS, 2020, 8 :195594-195607
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]   IKONOS: an intelligent tool to support diagnosis of COVID-19 by texture analysis of X-ray images [J].
Gomes J.C. ;
Barbosa V.A.F. ;
Santana M.A. ;
Bandeira J. ;
Valença M.J.S. ;
de Souza R.E. ;
Ismael A.M. ;
dos Santos W.P. .
Research on Biomedical Engineering, 2022, 38 (01) :15-28
[10]   Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences [J].
Gomes, Juliana Carneiro ;
Masood, Aras Ismael ;
Silva, Leandro Honorato de S. ;
da Cruz Ferreira, Janderson Romario B. ;
Freire Junior, Agostinho Antonio ;
dos Santos Rocha, Allana Lais ;
Portela de Oliveira, Leticia Castro ;
Cauas da Silva, Nathalia Regina ;
Torres Fernandes, Bruno Jose ;
dos Santos, Wellington Pinheiro .
SCIENTIFIC REPORTS, 2021, 11 (01)