RadFormer: Transformers with global-local attention for interpretable and accurate Gallbladder Cancer detection

被引:31
作者
Basu, Soumen [1 ]
Gupta, Mayank [1 ]
Rana, Pratyaksha [2 ]
Gupta, Pankaj [2 ]
Arora, Chetan [1 ]
机构
[1] Indian Inst Technol Delhi, Dept Comp Sci, New Delhi, India
[2] Postgrad Inst Med Educ & Res, Dept Radiodiag & Imaging, Chandigarh, India
关键词
Explainable AI; Visual transformer; Gallbladder Cancer; Ultrasound Sonography; NEURAL-NETWORK; DEEP; DIAGNOSIS; WALL; SEGMENTATION; ULTRASOUND;
D O I
10.1016/j.media.2022.102676
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel deep neural network architecture to learn interpretable representation for medical image analysis. Our architecture generates a global attention for region of interest, and then learns bag of words style deep feature embeddings with local attention. The global, and local feature maps are combined using a contemporary transformer architecture for highly accurate Gallbladder Cancer (GBC) detection from Ultrasound (USG) images. Our experiments indicate that the detection accuracy of our model beats even human radiologists, and advocates its use as the second reader for GBC diagnosis. Bag of words embeddings allow our model to be probed for generating interpretable explanations for GBC detection consistent with the ones reported in medical literature. We show that the proposed model not only helps understand decisions of neural network models but also aids in discovery of new visual features relevant to the diagnosis of GBC. Source-code is available at https://github.com/sbasu276/RadFormer.
引用
收藏
页数:13
相关论文
共 86 条
[71]  
Wu H., 2022, CVPR, P11666
[72]  
Wu H., 2021, ICCV, P3489
[73]  
Wu H., 2022, IEEE Trans. Cybern.
[74]   Semi-supervised segmentation of echocardiography videos via noise-resilient spatiotemporal semantic calibration and fusion [J].
Wu, Huisi ;
Liu, Jiasheng ;
Xiao, Fangyan ;
Wen, Zhenkun ;
Cheng, Lan ;
Qin, Jing .
MEDICAL IMAGE ANALYSIS, 2022, 78
[75]   FAT-Net: Feature adaptive transformers for automated skin lesion segmentation [J].
Wu, Huisi ;
Chen, Shihuai ;
Chen, Guilian ;
Wang, Wei ;
Lei, Baiying ;
Wen, Zhenkun .
MEDICAL IMAGE ANALYSIS, 2022, 76
[76]  
Wu HS, 2021, AAAI CONF ARTIF INTE, V35, P2907
[77]  
Wu HS, 2021, AAAI CONF ARTIF INTE, V35, P2916
[78]   SCS-Net: A Scale and Context Sensitive Network for Retinal Vessel Segmentation [J].
Wu, Huisi ;
Wang, Wei ;
Zhong, Jiafu ;
Lei, Baiying ;
Wen, Zhenkun ;
Qin, Jing .
MEDICAL IMAGE ANALYSIS, 2021, 70
[79]   Automated left ventricular segmentation from cardiac magnetic resonance images via adversarial learning with multi-stage pose estimation network and co-discriminator [J].
Wu, Huisi ;
Lu, Xuheng ;
Lei, Baiying ;
Wen, Zhenkun .
MEDICAL IMAGE ANALYSIS, 2021, 68
[80]   Automated Skin Lesion Segmentation Via an Adaptive Dual Attention Module [J].
Wu, Huisi ;
Pan, Junquan ;
Li, Zhuoying ;
Wen, Zhenkun ;
Qin, Jing .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (01) :357-370