CROSS-MODAL REPRESENTATION RECONSTRUCTION FOR ZERO-SHOT CLASSIFICATION

被引:2
作者
Wang, Yu [1 ]
Zhao, Shenjie [2 ]
机构
[1] Minist Educ, Key Lab Embedded Syst & Serv Comp, Ningbo, Peoples R China
[2] Mojie Technol, Ningbo, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
基金
中国国家自然科学基金;
关键词
Zero-Shot Learning; Representation Reconstruction; Cross-Modal Learning; Knowledge Transfer;
D O I
10.1109/ICASSP39728.2021.9413572
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Zero-shot learning (ZSL) aims to recognize novel classes without training samples through transferring knowledge from seen classes, based on the assumption that both the seen and unseen classes share a latent semantic space. Previous works either focus on directly learning various mapping functions between visual space and semantic space, or searching a latent common subspace to alleviate semantic gap between different modalities. However, few methods directly learn modality-invariant representations for ZSL. In this paper, we propose a Cross-Modal Representation Reconstruction (CMRR) framework to bridge the semantic gap between visual features and semantic attributes, as well as introducing a novel regularizer for automatically feature selection. Moreover, an iterative optimization process based on the ALM (Augmented Lagrangian Method) algorithm with the alternating direction strategy is developed to solve the proposed formulation. Extensive experiments on four benchmark datasets show the effectiveness of the proposed approach, and even the performance surpasses some deep learning based methods.
引用
收藏
页码:2820 / 2824
页数:5
相关论文
共 30 条
[1]  
Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
[2]   Preserving Semantic Relations for Zero-Shot Learning [J].
Annadani, Yashas ;
Biswas, Soma .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7603-7612
[3]  
[Anonymous], 2014, ICLR
[4]   Synthesized Classifiers for Zero-Shot Learning [J].
Changpinyo, Soravit ;
Chao, Wei-Lun ;
Gong, Boqing ;
Sha, Fei .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5327-5336
[5]  
Chen H, 2019, INT CONF ACOUST SPEE, P3697, DOI 10.1109/ICASSP.2019.8683427
[6]   Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Networks [J].
Chen, Long ;
Zhang, Hanwang ;
Xiao, Jun ;
Liu, Wei ;
Chang, Shih-Fu .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1043-1052
[7]   Fine-Grained Generalized Zero-Shot Learning via Dense Attribute-Based Attention [J].
Dat Huynh ;
Elhamifar, Ehsan .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4482-4492
[8]  
Farhadi A, 2009, PROC CVPR IEEE, P1778, DOI 10.1109/CVPRW.2009.5206772
[9]  
Frome A., 2013, Proc. Adv. Neural Inf. Process. Syst., P2121
[10]  
He R, 2012, PROC CVPR IEEE, P2504, DOI 10.1109/CVPR.2012.6247966