Domain-Aware Prototype Network for Generalized Zero-Shot Learning

被引：3

作者：

Hu, Yongli ^{[1
]}

Feng, Lincong ^{[1
]}

Jiang, Huajie ^{[1
]}

Liu, Mengting ^{[1
]}

Yin, Baocai ^{[1
]}

机构：

[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Fac Informat Technol, Beijing 100124, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 05期

关键词：

Visualization; Prototypes; Semantics; Transformers; Image recognition; Feature extraction; Task analysis; Generalized zero-shot learning; transformer-based dual attention; domain detection;

D O I：

10.1109/TCSVT.2023.3313727

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Generalized zero-shot learning(GZSL) aims to recognize images from seen and unseen classes with side information, such as manually annotated attribute vectors. Traditional methods focus on mapping images and semantics into a common latent space, thus achieving the visual-semantics alignment. Since the unseen classes are unavailable during training, there is a serious problem of recognition bias, which will tend to recognize unseen classes as seen classes. To solve this problem, we propose a Domain-aware Prototype Network(DPN), which splits the GZSL problem into the seen class recognition and unseen class recognition problem. For the seen classes, we design a domain-aware prototype learning branch with a dual attention feature encoder to capture the essential visual information, which aims to recognize the seen classes and discriminate the novel categories. To further recognize the fine-grained unseen classes, a visual-semantic embedding branch is designed, which aims to align the visual and semantic information for unseen-class recognition. Through the multi-task learning of the prototype learning branch and visual-semantic embedding branch, our model can achieve excellent performance on three popular GZSL datasets.

引用

页码：3180 / 3191

页数：12

共 66 条

[1]

Arik SO, 2020, J MACH LEARN RES, V21

[2] Adaptive Confidence Smoothing for Generalized Zero-Shot Learning [J].

Atzmon, Yuval ;

Chechik, Gal .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11663-11672

[3] Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning [J].

Changpinyo, Soravit ;

Chao, Wei-Lun ;

Sha, Fei .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3496-3505

[4]

Chen CF, 2019, ADV NEUR IN, V32

[5] Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Networks [J].

Chen, Long ;

Zhang, Hanwang ;

Xiao, Jun ;

Liu, Wei ;

Chang, Shih-Fu .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1043-1052

[6]

Chen M., P ICML 2020, P1691

[7]

Chen SM, 2022, AAAI CONF ARTIF INTE, P330

[8] MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning [J].

Chen, Shiming ;

Hong, Ziming ;

Xie, Guo-Sen ;

Yang, Wenhan ;

Peng, Qinmu ;

Wang, Kai ;

Zhao, Jian ;

You, Xinge .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :7602-7611

[9] FREE: Feature Refinement for Generalized Zero-Shot Learning [J].

Chen, Shiming ;

Wang, Wenjie ;

Xia, Beihao ;

Peng, Qinmu ;

You, Xinge ;

Zheng, Feng ;

Shao, Ling .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :122-131

[10]

Chen SZ, 2021, ADV NEUR IN, V34

← 1 2 3 4 5 6 7 →