Dual Projective Zero-Shot Learning Using Text Descriptions

被引:7
|
作者
Rao, Yunbo [1 ]
Yang, Ziqiang [1 ]
Zeng, Shaoning [2 ]
Wang, Qifeng [3 ]
Pu, Jiansu [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, 4,Sect 2,North Jianshe Rd, Chengdu 610054, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Chengdu 313000, Sichuan, Peoples R China
[3] Google Berkeley, Berkeley, CA 94720 USA
[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, 4,Sect 2,North Jianshe Rd, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; generalized zero-shot learning; autoencoder; inductive zero-shot learning;
D O I
10.1145/3514247
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning (ZSL) aims to recognize image instances of unseen classes solely based on the semantic descriptions of the unseen classes. In this field, Generalized Zero-Shot Learning (GZSL) is a challenging problem in which the images of both seen and unseen classes are mixed in the testing phase of learning. Existing methods formulate GZSL as a semantic-visual correspondence problem and apply generative models such as Generative Adversarial Networks and Variational Autoencoders to solve the problem. However, these methods suffer from the bias problem since the images of unseen classes are often misclassified into seen classes. In this work, a novel model named the Dual Projective model for Zero-Shot Learning (DPZSL) is proposed using text descriptions. In order to alleviate the bias problem, we leverage two autoencoders to project the visual and semantic features into a latent space and evaluate the embeddings by a visual-semantic correspondence loss function. An additional novel classifier is also introduced to ensure the discriminability of the embedded features. Our method focuses on a more challenging inductive ZSL setting in which only the labeled data from seen classes are used in the training phase. The experimental results, obtained from two popular datasets-Caltech-UCSD Birds-200-2011 (CUB) and North America Birds (NAB)-show that the proposed DPZSL model significantly outperforms both the inductive ZSL and GZSL settings. Particularly in the GZSL setting, our model yields an improvement up to 15.2% in comparison with state-of-the-art CANZSL on datasets CUB and NAB with two splittings.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Explanatory Object Part Aggregation for Zero-Shot Learning
    Chen, Xin
    Deng, Xiaoling
    Lan, Yubin
    Long, Yongbing
    Weng, Jian
    Liu, Zhiquan
    Tian, Qi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 851 - 868
  • [42] Discriminative Latent Attribute Autoencoder for Zero-Shot Learning
    Chen, Runqing
    Wu, Songsong
    Sun, Guangcheng
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 873 - 877
  • [43] Extreme Reverse Projection Learning for Zero-Shot Recognition
    Guan, Jiechao
    Zhao, An
    Lu, Zhiwu
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 125 - 141
  • [44] A Semantic Similarity Supervised Autoencoder for Zero-Shot Learning
    Shen, Fengli
    Lu, Zhe-Ming
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (06): : 1419 - 1422
  • [45] GENERALIZED ZERO-SHOT LEARNING USING CONDITIONAL WASSERSTEIN AUTOENCODER
    Kim, Junhan
    Shim, Byonghyo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3413 - 3417
  • [46] Unified benchmark for zero-shot Turkish text classification
    celik, Emrecan
    Dalyan, Tugba
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [47] Learning Using Privileged Information for Zero-Shot Action Recognition
    Gao, Zhiyi
    Hou, Yonghong
    Li, Wanqing
    Guo, Zihui
    Yu, Bin
    COMPUTER VISION - ACCV 2022, PT IV, 2023, 13844 : 347 - 362
  • [48] Zero-Shot Learning with Missing Attributes using Semantic Correlations
    Braytee, Ali
    Naji, Mohamad
    Anaissi, Ali
    Chaturvedi, Kunal
    Prasad, Mukesh
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [49] Zero-shot learning for action recognition using synthesized features
    Mishra, Ashish
    Pandey, Anubha
    Murthy, Hema A.
    NEUROCOMPUTING, 2020, 390 : 117 - 130
  • [50] ZERO-SHOT LEARNING USING STACKED AUTOENCODER WITH MANIFOLD REGULARIZATIONS
    Song, Jianqiang
    Shi, Guangming
    Xie, Xuemei
    Gao, Dahua
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3651 - 3655