A Hybrid VDV Model for Automatic Diagnosis of Pneumothorax Using Class-Imbalanced Chest X-Rays Dataset

被引:5
作者
Iqbal, Tahira [1 ]
Shaukat, Arslan [1 ]
Akram, Muhammad Usman [1 ]
Muzaffar, Abdul Wahab [2 ]
Mustansar, Zartasha [3 ]
Byun, Yung-Cheol [4 ]
机构
[1] Natl Univ Sci & Technol NUST, Coll Elect & Mech Engn, Dept Comp & Software Engn, Islamabad 44000, Pakistan
[2] Saudi Elect Univ, Coll Comp & Informat, Riyadh 11673, Saudi Arabia
[3] Natl Univ Sci & Technol NUST, Res Ctr Modelling & Simulat RCMS, Islamabad 44000, Pakistan
[4] Jeju Natl Univ, Dept Comp Engn, Jeju 690756, South Korea
关键词
Feature extraction; X-ray imaging; Biomedical imaging; Convolutional neural networks; Training; Support vector machines; Diseases; Class-imbalance; chest X-rays; classification; deep learning; ensemble; machine learning; pneumothorax; under-bagging; ENSEMBLE METHOD; CLASSIFICATION; ALGORITHM;
D O I
10.1109/ACCESS.2022.3157316
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pneumothorax, a life-threatening disease, needs to be diagnosed immediately and efficiently. The prognosis, in this case, is not only time-consuming but also prone to human errors. So, an automatic way of accurate diagnosis using chest X-rays is the utmost requirement. To date, most of the available medical image datasets have a class-imbalance (CI) issue. The main theme of this study is to solve this problem along with proposing an automated way of detecting pneumothorax. To find the optimal approach for CI problem, we first compare the existing approaches and find that under-bagging method (referred as data-level-ensemble formed by creating subsets of majority class and then combining each subset with all samples of minority class) outperforms other existing approaches. After selection of best approach for CI problem, we propose a novel framework, named as VDV model, for pneumothorax detection from highly imbalance dataset. The proposed VDV model is a complex model-level ensemble of data-level-ensembles and uses three convolutional neural networks (CNN) including VGG16, VGG-19, and DenseNet-121 as fixed feature extractors. In each data-level-ensemble, features extracted from one of the pre-defined CNN architectures are fed to support vector machine (SVM) classifier, and output is calculated using the voting method. Once outputs from the three data-level-ensembles (corresponding to three different CNN architectures as feature extractor) are obtained, then, again, the voting method is used to calculate the final prediction. Our proposed framework is tested on the SIIM ACR Pneumothorax dataset and Random Sample of NIH Chest X-ray dataset (RS-NIH). For the first dataset, 85.17% Recall with 86.0% Area under the Receiver Operating Characteristic curve (AUC) is attained. For the second dataset, 90.9% Recall with 95.0% AUC is achieved with a random split of data while 85.45% recall with 77.06% AUC is obtained with a patient-wise split of data. The comparison of our results for both the datasets with related work proves the effectiveness of proposed VDV model for pneumothorax detection.
引用
收藏
页码:27670 / 27683
页数:14
相关论文
共 57 条
[1]   Pneumothorax Detection in Chest Radiographs Using Convolutional Neural Networks [J].
Aviel, Blumenfeld ;
Eli, Konen ;
Hayit, Greenspan .
MEDICAL IMAGING 2018: COMPUTER-AIDED DIAGNOSIS, 2018, 10575
[2]   New applications of ensembles of classifiers [J].
Barandela, R ;
Sánchez, JS ;
Valdovinos, RM .
PATTERN ANALYSIS AND APPLICATIONS, 2003, 6 (03) :245-256
[3]  
Bharati Subrato, 2020, Inform Med Unlocked, V20, P100391, DOI 10.1016/j.imu.2020.100391
[4]  
Bhavani Raskutti, 2004, SIGKDD Explor. Newsl, V6, P60, DOI 10.1145/1007730.1007739
[5]   A systematic study of the class imbalance problem in convolutional neural networks [J].
Buda, Mateusz ;
Maki, Atsuto ;
Mazurowski, Maciej A. .
NEURAL NETWORKS, 2018, 106 :249-259
[6]  
Bunrit Supaporn, 2019, International Journal of Machine Learning and Computing, V9, P201, DOI 10.18178/ijmlc.2019.9.2.787
[7]  
Chan YH, 2018, J HEALTHC ENG, V2018, DOI [10.1155/2018/2908517, 10.1155/2018/4595062]
[8]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[9]  
Chollet F., 2020, BUILDING POWERFUL IM
[10]   Chest disease radiography in twofold: using convolutional neural networks and transfer learning [J].
Choudhary, Prakash ;
Hazra, Abhishek .
EVOLVING SYSTEMS, 2021, 12 (02) :567-579