A cross-modal deep metric learning model for disease diagnosis based on chest x-ray images

被引:7
作者
Jin, Yufei [1 ,2 ]
Lu, Huijuan [1 ,2 ]
Li, Zhao [3 ]
Wang, Yanbin [4 ]
机构
[1] China JiLiang Univ, Hangzhou 310018, Zhejiang, Peoples R China
[2] Key Lab Electromagnet Wave Informat Technol & Metr, Hangzhou 310018, Zhejiang, Peoples R China
[3] Zhejiang Univ, Hangzhou 310018, Zhejiang, Peoples R China
[4] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Chest X-ray; Generalized zero-shot learning; Deep metric learning; Cross-modal; Multi-label classification; CLASSIFICATION;
D O I
10.1007/s11042-023-14790-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emergence of unknown diseases is often with few or no samples available. Zero-shot learning and few-shot learning have promising applications in medical image analysis. In this paper, we propose a Cross-Modal Deep Metric Learning Generalized Zero-Shot Learning (CM-DML-GZSL) model. The proposed network consists of a visual feature extractor, a fixed semantic feature extractor, and a deep regression module. The network belongs to a two-stream network for multiple modalities. In a multi-label setting, each sample contains a small number of positive labels and a large number of negative labels on average. This positive-negative imbalance dominates the optimization procedure and may prevent the establishment of an effective correspondence between visual features and semantic vectors during training, resulting in a low degree of accuracy. A novel weighted focused Euclidean distance metric loss is introduced in this regard. This loss not only can dynamically increase the weight of hard samples and decrease the weight of simple samples, but it can also promote the connection between samples and semantic vectors corresponding to their positive labels, which helps mitigate bias in predicting unseen classes in the generalized zero-shot learning setting. The weighted focused Euclidean distance metric loss function can dynamically adjust sample weights, enabling zero-shot multi-label learning for chest X-ray diagnosis, as experimental results on large publicly available datasets demonstrate.
引用
收藏
页码:33421 / 33442
页数:22
相关论文
共 68 条
  • [1] Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
  • [2] Label-Embedding for Attribute-Based Classification
    Akata, Zeynep
    Perronnin, Florent
    Harchaoui, Zaid
    Schmid, Cordelia
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 819 - 826
  • [3] Learning multi-label scene classification
    Boutell, MR
    Luo, JB
    Shen, XP
    Brown, CM
    [J]. PATTERN RECOGNITION, 2004, 37 (09) : 1757 - 1771
  • [4] Hyperspectral imagery classification with deep metric learning
    Cao, Xianghai
    Ge, Yiming
    Li, Renjie
    Zhao, Jing
    Jiao, Licheng
    [J]. NEUROCOMPUTING, 2019, 356 : 217 - 227
  • [5] Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition
    Chen, Tianshui
    Xu, Muxin
    Hui, Xiaolu
    Wu, Hefeng
    Lin, Liang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 522 - 531
  • [6] Multi-Label Image Recognition with Graph Convolutional Networks
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Wang, Peng
    Guo, Yanwen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
  • [7] Task-wise attention guided part complementary learning for few-shot image classification
    Cheng, Gong
    Li, Ruimin
    Lang, Chunbo
    Han, Junwei
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (02)
  • [8] When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs
    Cheng, Gong
    Yang, Ceyuan
    Yao, Xiwen
    Guo, Lei
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05): : 2811 - 2821
  • [9] A hierarchical multi-label classification method based on neural networks for gene function prediction
    Feng, Shou
    Fu, Ping
    Zheng, Wenbin
    [J]. BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2018, 32 (06) : 1613 - 1621
  • [10] Fu Y, 2015, ARXIV