Enhancing histopathological image analysis: An explainable vision transformer approach with comprehensive interpretation methods and evaluation of explanation quality

被引:3
作者
Mir, Aqib Nazir [1 ]
Rizvi, Danish Raza [1 ]
Ahmad, Md Rizwan [2 ]
机构
[1] Jamia Millia Islamia, Dept Comp Engn, New Delhi 110025, India
[2] Forbes Advisor P&G Plaza, Mumbai 400076, India
关键词
Vision transformer; Explainable artificial intelligence; Histopathology; Model explainability; Interpretability metrics;
D O I
10.1016/j.engappai.2025.110519
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning models are increasingly reshaping medical imaging, with growing attention on ensuring transparency and trust in their decision-making processes. This study presents the Explainable Vision Transformer (XViT), a model specifically designed for histopathological image analysis. By incorporating advanced interpretability techniques, the XViT model addresses three core aspects: feature learning and classification, generating explainable outputs, and qualitatively evaluating these explanations. Three novel interpretability methods are introduced: attention-based, model-agnostic, and gradient-based, offering diverse perspectives on model behavior. The model's performance and generalizability were rigorously evaluated on two histopathological datasets: lung colon 25000 (LCS25000) with 96.2% accuracy across three classes and Kangbuk Samsung Hospital (KBSMC) with 88.6% accuracy across four classes. XViT provides actionable insights by highlighting diagnostically relevant regions in input images, significantly enhancing clinical trust and decision-making. The evaluation of its explainability methods through metrics like sensitivity, faithfulness, and complexity demonstrated that layer-wise relevance propagation for transformers outperforms standard techniques like local interpretable model-agnostic explanations (LIME) and attention visualization. This robust performance underscores the XViT model's potential to bridge the gap between AI accuracy and interpretability in medical imaging. Our findings emphasize the need for well-defined evaluation criteria when comparing interpretability methods and highlight the model's potential for integration into clinical workflows. This work represents a step forward in creating reliable and interpretable AI solutions, ensuring that the benefits of advanced deep learning models extend seamlessly into practical healthcare settings.
引用
收藏
页数:15
相关论文
共 60 条
[51]   Scale-Aware Transformers for Diagnosing Melanocytic Lesions [J].
Wu, Wenjun ;
Mehta, Sachin ;
Nofallah, Shima ;
Knezevich, Stevan ;
May, Caitlin J. ;
Chang, Oliver H. ;
Elmore, Joann G. ;
Shapiro, Linda G. .
IEEE ACCESS, 2021, 9 :163526-163541
[52]   Transformer with convolution and graph-node co-embedding: An accurate and interpretable vision backbone for predicting gene expressions from local histopathological image [J].
Xiao, Xiao ;
Kong, Yan ;
Li, Ronghan ;
Wang, Zuoheng ;
Lu, Hui .
MEDICAL IMAGE ANALYSIS, 2024, 91
[53]   An Efficient Technique for Nuclei Segmentation Based on Ellipse Descriptor Analysis and Improved Seed Detection Algorithm [J].
Xu, Hongming ;
Lu, Cheng ;
Mandal, Mrinal .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2014, 18 (05) :1729-1741
[54]  
Yeh CK, 2019, ADV NEUR IN, V32
[55]  
Ying Zou, 2021, 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), P1235, DOI 10.1109/BIBM52615.2021.9669903
[56]   Explainable hybrid vision transformers and convolutional network for multimodal glioma segmentation in brain MRI [J].
Zeineldin, Ramy A. ;
Karar, Mohamed E. ;
Elshaer, Ziad ;
Coburger, Jan ;
Wirtz, Christian R. ;
Burgert, Oliver ;
Mathis-Ullrich, Franziska .
SCIENTIFIC REPORTS, 2024, 14 (01)
[57]   Shifting machine learning for healthcare from development to deployment and from models to data [J].
Zhang, Angela ;
Xing, Lei ;
Zou, James ;
Wu, Joseph C. .
NATURE BIOMEDICAL ENGINEERING, 2022, 6 (12) :1330-1345
[58]   MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing [J].
Zhang, Huaqi ;
Chen, Huang ;
Qin, Jin ;
Wang, Bei ;
Ma, Guolin ;
Wang, Pengyu ;
Zhong, Dingrong ;
Liu, Jie .
FRONTIERS IN ONCOLOGY, 2022, 12
[59]   A Multi-branch Hybrid Transformer Network for Corneal Endothelial Cell Segmentation [J].
Zhang, Yinglin ;
Higashita, Risa ;
Fu, Huazhu ;
Xu, Yanwu ;
Zhang, Yang ;
Liu, Haofeng ;
Zhang, Jian ;
Liu, Jiang .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 :99-108
[60]  
Zheng Yi, 2022, IEEE Trans Med Imaging, V41, P3003, DOI [10.1109/tmi.2022.3176598, 10.1109/TMI.2022.3176598]