Enhanced tuberculosis detection using Vision Transformers and explainable AI with a Grad-CAM approach on chest X-rays

被引:0
|
作者
Vanitha, K. [1 ]
Mahesh, T. R. [2 ]
Kumar, V. Vinoth [3 ]
Guluwadi, Suresh [4 ]
机构
[1] Deemed Univ, Karpagam Acad Higher Educ, Fac Engn, Dept Comp Sci & Engn, Coimbatore, India
[2] JAIN Deemed Univ, Dept Comp Sci & Engn, Bengaluru 562112, India
[3] Vellore Inst Technol Univ, Sch Comp Sci, Vellore 632014, India
[4] Adama Sci & Technol Univ, Adama 302120, Ethiopia
来源
BMC MEDICAL IMAGING | 2025年 / 25卷 / 01期
关键词
Tuberculosis detection; Vision Transformer; Chest X-rays; Explainable AI; Grad-CAM; Self-attention; Medical imaging; Deep learning; Diagnostic accuracy; Convolutional neural networks;
D O I
10.1186/s12880-025-01630-3
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Tuberculosis (TB), caused by Mycobacterium tuberculosis, remains a leading global health challenge, especially in low-resource settings. Accurate diagnosis from chest X-rays is critical yet challenging due to subtle manifestations of TB, particularly in its early stages. Traditional computational methods, primarily using basic convolutional neural networks (CNNs), often require extensive pre-processing and struggle with generalizability across diverse clinical environments. This study introduces a novel Vision Transformer (ViT) model augmented with Gradient-weighted Class Activation Mapping (Grad-CAM) to enhance both diagnostic accuracy and interpretability. The ViT model utilizes self-attention mechanisms to extract long-range dependencies and complex patterns directly from the raw pixel information, whereas Grad-CAM offers visual explanations of model decisions about highlighting significant regions in the X-rays. The model contains a Conv2D stem for initial feature extraction, followed by many transformer encoder blocks, thereby significantly boosting its ability to learn discriminative features without any pre-processing. Performance testing on a validation set had an accuracy of 0.97, recall of 0.99, and F1-score of 0.98 for TB patients. On the test set, the model has accuracy of 0.98, recall of 0.97, and F1-score of 0.98, which is better than existing methods. The addition of Grad-CAM visuals not only improves the transparency of the model but also assists radiologists in assessing and verifying AI-driven diagnoses. These results demonstrate the model's higher diagnostic precision and potential for clinical application in real-world settings, providing a massive improvement in the automated detection of TB.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Explainable COVID-19 Detection on Chest X-rays Using an End-to-End Deep Convolutional Neural Network Architecture
    Chetoui, Mohamed
    Akhloufi, Moulay A.
    Yousefi, Bardia
    Bouattane, El Mostafa
    BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (04)
  • [32] Assessing radiographic findings on finger X-rays using an enhanced deep learning approach
    Kumar R.
    K S.D.
    Mohapatra D.P.
    International Journal of Information Technology, 2024, 16 (7) : 4279 - 4288
  • [33] Leveraging Sequential CNNs for Tuberculosis Detection in Chest X-rays: Employing Convolutional Neural Networks to Spot Tuberculosis in Radiographs
    Agarwal, Muskan
    Gill, Kanwarpartap Singh
    Malhotra, Sonal
    Devliyal, Swati
    2024 2ND WORLD CONFERENCE ON COMMUNICATION & COMPUTING, WCONF 2024, 2024,
  • [34] A Survey of COVID-19 Detection From Chest X-Rays Using Deep Learning Methods
    Dornadula, Bhargavinath
    Geetha, S.
    Anbarasi, L. Jani
    Kadry, Seifedine
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2022, 18 (01)
  • [35] Transfer Learning for COVID-19 and Pneumonia Detection using Chest X-Rays
    Jha, Anshul
    John, Eugene
    Banerjee, Taposh
    2022 IEEE 65TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS 2022), 2022,
  • [36] Anatomical Landmark Detection in Chest X-Rays using Transformer-Based Networks
    Kasturi, Akhil
    Vosoughi, Ali
    Hadjiyski, Nathan
    Stockmastere, Larry
    Sehnert, William J.
    Wismueller, Axel
    COMPUTER-AIDED DIAGNOSIS, MEDICAL IMAGING 2024, 2024, 12927
  • [37] Tomato Health Monitoring System: Tomato Classification, Detection, and Counting System Based on YOLOv8 Model With Explainable MobileNet Models Using Grad-CAM plus
    Quach, Luyl-Da
    Quoc, Khang Nguyen
    Quynh, Anh Nguyen
    Ngoc, Hoang Tran
    Thai-Nghe, Nguyen
    IEEE ACCESS, 2024, 12 : 9719 - 9737
  • [38] A new hybrid approach for pneumonia detection using chest X-rays based on ACNN-LSTM and attention mechanism
    Lafraxo, Samira
    El Ansari, Mohamed
    Koutti, Lahcen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73055 - 73077
  • [39] Federated learning with deep convolutional neural networks for the detection of multiple chest diseases using chest x-rays
    Malik, Hassaan
    Anees, Tayyaba
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 63017 - 63045
  • [40] COVID-19 classification using chest X-ray images based on fusion-assisted deep Bayesian optimization and Grad-CAM visualization
    Hamza, Ameer
    Khan, Muhammad Attique
    Wang, Shui-Hua
    Alhaisoni, Majed
    Alharbi, Meshal
    Hussein, Hany S.
    Alshazly, Hammam
    Kim, Ye Jin
    Cha, Jaehyuk
    FRONTIERS IN PUBLIC HEALTH, 2022, 10