Domain-Aware Prototype Network for Generalized Zero-Shot Learning

被引:0
|
作者
Hu, Yongli [1 ]
Feng, Lincong [1 ]
Jiang, Huajie [1 ]
Liu, Mengting [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Fac Informat Technol, Beijing 100124, Peoples R China
关键词
Visualization; Prototypes; Semantics; Transformers; Image recognition; Feature extraction; Task analysis; Generalized zero-shot learning; transformer-based dual attention; domain detection;
D O I
10.1109/TCSVT.2023.3313727
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Generalized zero-shot learning(GZSL) aims to recognize images from seen and unseen classes with side information, such as manually annotated attribute vectors. Traditional methods focus on mapping images and semantics into a common latent space, thus achieving the visual-semantics alignment. Since the unseen classes are unavailable during training, there is a serious problem of recognition bias, which will tend to recognize unseen classes as seen classes. To solve this problem, we propose a Domain-aware Prototype Network(DPN), which splits the GZSL problem into the seen class recognition and unseen class recognition problem. For the seen classes, we design a domain-aware prototype learning branch with a dual attention feature encoder to capture the essential visual information, which aims to recognize the seen classes and discriminate the novel categories. To further recognize the fine-grained unseen classes, a visual-semantic embedding branch is designed, which aims to align the visual and semantic information for unseen-class recognition. Through the multi-task learning of the prototype learning branch and visual-semantic embedding branch, our model can achieve excellent performance on three popular GZSL datasets.
引用
收藏
页码:3180 / 3191
页数:12
相关论文
共 50 条
  • [31] Generalized Zero-Shot Learning With Multi-Channel Gaussian Mixture VAE
    Shao, Jie
    Li, Xiaorui
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 456 - 460
  • [32] A Semantic Encoding Out-of-Distribution Classifier for Generalized Zero-Shot Learning
    Ding, Jiayu
    Hu, Xiao
    Zhong, Xiaorong
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1395 - 1399
  • [33] Convolutional prototype learning for zero-shot recognition
    Liu, Zhizhe
    Zhang, Xingxing
    Zhu, Zhenfeng
    Zheng, Shuai
    Zhao, Yao
    Cheng, Jian
    IMAGE AND VISION COMPUTING, 2020, 98
  • [34] Domain-Oriented Semantic Embedding for Zero-Shot Learning
    Min, Shaobo
    Yao, Hantao
    Xie, Hongtao
    Zha, Zheng-Jun
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3919 - 3930
  • [35] Incremental Zero-Shot Learning
    Wei, Kun
    Deng, Cheng
    Yang, Xu
    Tao, Dacheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13788 - 13799
  • [36] Spherical Zero-Shot Learning
    Shen, Jiayi
    Xiao, Zehao
    Zhen, Xiantong
    Zhang, Lei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 634 - 645
  • [37] Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning
    Ye, Yalan
    He, Yukun
    Pan, Tongjie
    Li, Jingjing
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1325 - 1337
  • [38] Enhancing Domain-Invariant Parts for Generalized Zero-Shot Learning
    Zhang, Yang
    Feng, Songhe
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6283 - 6291
  • [39] A Joint Label Space for Generalized Zero-Shot Classification
    Li, Jin
    Lan, Xuguang
    Long, Yang
    Liu, Yang
    Chen, Xingyu
    Shao, Ling
    Zheng, Nanning
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5817 - 5831
  • [40] A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning
    Rahman, Shafin
    Khan, Salman
    Porikli, Fatih
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5652 - 5667