Detection of tuberculosis from chest X-ray images: Boosting the performance with vision transformer and transfer learning

被引:96
作者
Duong, Linh T. [1 ]
Le, Nhi H. [1 ]
Tran, Toan B. [1 ]
Ngo, Vuong M. [2 ]
Nguyen, Phuong T. [3 ]
机构
[1] Duy Tan Univ, Inst Res & Dev, Da Nang, Vietnam
[2] Ho Chi Minh City Open Univ, Ho Chi Minh City, Vietnam
[3] Univ Aquila, Dept Informat Engn Comp Sci & Math, Laquila, Italy
关键词
Deep learning; EfficientNet; Tuberculosis detection; Transfer learning; Transformer;
D O I
10.1016/j.eswa.2021.115519
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tuberculosis (TB) caused by Mycobacterium tuberculosis is a contagious disease which is among the top deadly diseases in the world. Research in Medical Imaging has been done to provide doctors with techniques and tools to early detect, monitor and diagnose the disease using Artificial Intelligence. Recently, many attempts have been made to automatically recognize TB from chest X-ray (CXR) images. Still, while the obtained performance is encouraging, according to our investigation, many of the existing approaches have been evaluated on small and undiverse datasets. We suppose that such a good performance might not hold for heterogeneous data sources, which originate from real world scenarios. Our present work aims to fill the gap and improve the prediction performance on larger datasets. In particular, we present a practical solution for the detection of tuberculosis from CXR images, making use of cutting-edge Machine Learning and Computer Vision algorithms. We conceptualize a framework by adopting three recent deep neural networks as the main classification engines, namely modified EfficientNet, modified original Vision Transformer, and modified Hybrid EfficientNet with Vision Transformer. Moreover, we also empower the learning process with various augmentation techniques. We evaluated the proposed approach using a large dataset which has been curated by merging various public datasets. The resulting dataset has been split into training, validation, and testing sets which account for 80%, 10%, and 10% of the original dataset, respectively. To further study our proposed approach, we compared it with two state-of-the-art systems. The obtained results are encouraging: the maximum accuracy of 97.72% with AUC of 100% is achieved with ViT_Base_EfficientNet_B1_224. The experimental results demonstrate that our conceived tool outperforms the considered baselines with respect to different quality metrics.
引用
收藏
页数:15
相关论文
共 57 条
[1]   Automatic mass detection in mammograms using deep convolutional neural networks [J].
Agarwal, Richa ;
Diaz, Oliver ;
Llado, Xavier ;
Yap, Moi Hoon ;
Marti, Robert .
JOURNAL OF MEDICAL IMAGING, 2019, 6 (03)
[2]   Managing computational complexity using surrogate models: a critical review [J].
Alizadeh, Reza ;
Allen, Janet K. ;
Mistree, Farrokh .
RESEARCH IN ENGINEERING DESIGN, 2020, 31 (03) :275-298
[3]   Ensemble of surrogates and cross-validation for rapid and accurate predictions using small data sets [J].
Alizadeh, Reza ;
Jia, Liangyue ;
Nellippallil, Anand Balu ;
Wang, Guoxin ;
Hao, Jia ;
Allen, Janet K. ;
Mistree, Farrokh .
AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2019, 33 (04) :484-501
[4]  
Bharati Subrato, 2020, Inform Med Unlocked, V20, P100391, DOI 10.1016/j.imu.2020.100391
[5]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[6]   Can AI Help in Screening Viral and COVID-19 Pneumonia? [J].
Chowdhury, Muhammad E. H. ;
Rahman, Tawsifur ;
Khandakar, Amith ;
Mazhar, Rashid ;
Kadir, Muhammad Abdul ;
Bin Mahbub, Zaid ;
Islam, Khandakar Reajul ;
Khan, Muhammad Salman ;
Iqbal, Atif ;
Al Emadi, Nasser ;
Reaz, Mamun Bin Ibne ;
Islam, Mohammad Tariqul .
IEEE ACCESS, 2020, 8 :132665-132676
[7]  
Cohen J. P., 2020, ARXIV200611988CSEESS, V1, P18272
[8]   Randaugment: Practical automated data augmentation with a reduced search space [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Shlens, Jonathon ;
Le, Quoc, V .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3008-3017
[9]   AutoAugment: Learning Augmentation Strategies from Data [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Mane, Dandelion ;
Vasudevan, Vijay ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :113-123
[10]   Automated fruit recognition using EfficientNet and MixNet [J].
Duong, Linh T. ;
Nguyen, Phuong T. ;
Di Sipio, Claudio ;
Di Ruscio, Davide .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 171