Improving generalized zero-shot learning via cluster-based semantic disentangling representation

被引:3
作者
Gao, Yi [1 ]
Feng, Wentao [1 ]
Xiao, Rong [1 ]
He, Lihuo [2 ]
He, Zhenan [1 ]
Lv, Jiancheng [1 ]
Tang, Chenwei [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
基金
美国国家科学基金会;
关键词
Generalized zero-shot learning; Domain shift; Semantic gap; Cluster; Semantic disentangling representation;
D O I
10.1016/j.patcog.2024.110320
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generalized Zero -Shot Learning (GZSL) aims to recognize both seen and unseen classes by training only the seen classes, in which the instances of unseen classes tend to be biased towards the seen class. In this paper, we propose a Cluster -based Semantic Disentangling Representation (CSDR) method to improve GZSL by alleviating the problems of domain shift and semantic gap. First, we cluster the seen data into multiple clusters, where the samples in each cluster belong to several original seen categories, so as to facilitate finegrained semantic disentangling of visual feature vectors. Then, we introduce representation random swapping and contrastive learning based on the clustering results to realize the disentangling semantic representations of semantic -unspecific, class -shared, and class -unique. The fine-grained semantic disentangling representations show high intra-class similarity and inter -class discriminability, which improve the performance of GZSL by alleviating the problem of domain shift. Finally, we construct the visual -semantic embedding space by the variational auto -encoder and alignment module, which can bridge the semantic gap by generating strongly discriminative unseen class samples. Extensive experimental results on four public data sets prove that our method significantly outperforms state-of-the-art methods in generalized and conventional settings.
引用
收藏
页数:14
相关论文
共 40 条
  • [1] Chen SM, 2022, AAAI CONF ARTIF INTE, P330
  • [2] FREE: Feature Refinement for Generalized Zero-Shot Learning
    Chen, Shiming
    Wang, Wenjie
    Xia, Beihao
    Peng, Qinmu
    You, Xinge
    Zheng, Feng
    Shao, Ling
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 122 - 131
  • [3] Chen Ting, 2019, P INT C MACH LEARN
  • [4] Semantics Disentangling for Generalized Zero-Shot Learning
    Chen, Zhi
    Luo, Yadan
    Qiu, Ruihong
    Wang, Sen
    Huang, Zi
    Li, Jingjing
    Zhang, Zheng
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8692 - 8700
  • [5] Hybrid routing transformer for zero-shot learning
    Cheng, De
    Wang, Gerong
    Wang, Bo
    Zhang, Qiang
    Han, Jungong
    Zhang, Dingwen
    [J]. PATTERN RECOGNITION, 2023, 137
  • [6] Fine-Grained Generalized Zero-Shot Learning via Dense Attribute-Based Attention
    Dat Huynh
    Elhamifar, Ehsan
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4482 - 4492
  • [7] Farhadi A, 2009, PROC CVPR IEEE, P1778, DOI 10.1109/CVPRW.2009.5206772
  • [8] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [9] Momentum Contrast for Unsupervised Visual Representation Learning
    He, Kaiming
    Fan, Haoqi
    Wu, Yuxin
    Xie, Saining
    Girshick, Ross
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9726 - 9735
  • [10] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778