Self-Supervised Generalized Zero Shot Learning for Medical Image Classification Using Novel Interpretable Saliency Maps

被引:31
作者
Mahapatra, Dwarikanath [1 ,2 ]
Ge, Zongyuan [2 ,3 ]
Reyes, Mauricio [4 ]
机构
[1] Inception Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[2] Monash Univ, Fac Engn, Melbourne, Vic 3800, Australia
[3] Airdoc, Melbourne, Vic 3800, Australia
[4] Univ Bern, ARTORG Ctr Biomed Engn Res, CH-3012 Bern, Switzerland
关键词
Task analysis; Medical diagnostic imaging; Diseases; Semantics; Visualization; X-ray imaging; Image segmentation; Generalized zero shot learning; self supervised learning; saliency; classification; X-ray; pathology;
D O I
10.1109/TMI.2022.3163232
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In many real world medical image classification settings, access to samples of all disease classes is not feasible, affecting the robustness of a system expected to have high performance in analyzing novel test data. This is a case of generalized zero shot learning (GZSL) aiming to recognize seen and unseen classes. We propose a GZSL method that uses self supervised learning (SSL) for: 1) selecting representative vectors of disease classes; and 2) synthesizing features of unseen classes. We also propose a novel approach to generate GradCAM saliency maps that highlight diseased regions with greater accuracy. We exploit information from the novel saliency maps to improve the clustering process by: 1) Enforcing the saliency maps of different classes to be different; and 2) Ensuring that clusters in the space of image and saliency features should yield class centroids having similar semantic information. This ensures the anchor vectors are representative of each class. Different from previous approaches, our proposed approach does not require class attribute vectors which are essential part of GZSL methods for natural images but are not available for medical images. Using a simple architecture the proposed method outperforms state of the art SSL based GZSL performance for natural images as well as multiple types of medical images. We also conduct many ablation studies to investigate the influence of different loss terms in our method.
引用
收藏
页码:2443 / 2456
页数:14
相关论文
共 68 条
[21]   Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs [J].
Gulshan, Varun ;
Peng, Lily ;
Coram, Marc ;
Stumpe, Martin C. ;
Wu, Derek ;
Narayanaswamy, Arunachalam ;
Venugopalan, Subhashini ;
Widner, Kasumi ;
Madams, Tom ;
Cuadros, Jorge ;
Kim, Ramasamy ;
Raman, Rajiv ;
Nelson, Philip C. ;
Mega, Jessica L. ;
Webster, R. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2016, 316 (22) :2402-2410
[22]  
Pham HH, 2020, Arxiv, DOI arXiv:1911.06475
[23]  
Hayat N, 2021, PR MACH LEARN RES, V149, P461
[24]   Generative Dual Adversarial Network for Generalized Zero-shot Learning [J].
Huang, He ;
Wang, Changhu ;
Yu, Philip S. ;
Wang, Chang-Dong .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :801-810
[25]  
Irvin J, 2019, AAAI CONF ARTIF INTE, P590
[26]   Self-supervised Learning for Spinal MRIs [J].
Jamaludin, Amir ;
Kadir, Timor ;
Zisserman, Andrew .
DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, 2017, 10553 :294-302
[27]  
Kaggle and EyePacs, 2015, KAGGLE DIABETIC RETI
[28]   Momentum Contrast for Unsupervised Visual Representation Learning [J].
He, Kaiming ;
Fan, Haoqi ;
Wu, Yuxin ;
Xie, Saining ;
Girshick, Ross .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735
[29]   Deep Learning-Based Gleason Grading of Prostate Cancer From Histopathology Images-Role of Multiscale Decision Aggregation and Data Augmentation [J].
Karimi, Davood ;
Nir, Guy ;
Fazli, Ladan ;
Black, Peter C. ;
Goldenberg, Larry ;
Salcudean, Septimiu E. .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (05) :1413-1426
[30]   Generalized Zero-Shot Learning Via Over-Complete Distribution [J].
Keshari, Rohit ;
Singh, Richa ;
Vatsa, Mayank .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13297-13305