Generalized Zero-Shot Learning Based on Manifold Alignment

被引:2
作者
Xu, Rui [1 ]
Shao, Shuai [2 ]
Liu, Baodi [1 ]
Liu, Weifeng [1 ]
机构
[1] China Univ Petr East China, Coll Control Sci & Engn, Qingdao, Peoples R China
[2] Zhejiang Lab, Hangzhou, Peoples R China
来源
2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1 | 2022年
基金
中国国家自然科学基金;
关键词
manifold alignment; variational autoencoder; generalized zero-shot learning;
D O I
10.1109/ICSP56322.2022.9965280
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Generalized zero-shot learning is a method that can classify seen and unseen samples by learning training samples' visual and semantic modal information. Visual modal information is generally extracted by feature extraction networks pre-trained with a large-scale data set, and semantic modal information is typically represented by class attributes. Different categories have shared semantic information, therefore, through learning the mapping between two modal information, the transferable knowledge can be used to classify testing samples. However, most methods align the two modal information of the per-sample rather than considering the alignment of the distribution of multiple instances in the two modalities. We utilize variational autoencoders mapping two modalities' information to a shared latent space, then align the samples' manifold structure of them to promote the accuracy of model classification. We evaluate the proposed method on several benchmark datasets (CUB, SUN, and AWA2), and the significant improvements have proved the method's effectiveness.
引用
收藏
页码:202 / 207
页数:6
相关论文
共 28 条
[1]   Label-Embedding for Image Classification [J].
Akata, Zeynep ;
Perronnin, Florent ;
Harchaoui, Zaid ;
Schmid, Cordelia .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) :1425-1438
[2]   Synthesized Classifiers for Zero-Shot Learning [J].
Changpinyo, Soravit ;
Chao, Wei-Lun ;
Gong, Boqing ;
Sha, Fei .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5327-5336
[3]   Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Networks [J].
Chen, Long ;
Zhang, Hanwang ;
Xiao, Jun ;
Liu, Wei ;
Chang, Shih-Fu .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1043-1052
[4]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[5]   Multi-modal Cycle-Consistent Generalized Zero-Shot Learning [J].
Felix, Rafael ;
Kumar, B. G. Vijay ;
Reid, Ian ;
Carneiro, Gustavo .
COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :21-37
[6]  
Frome A., 2013, Advances in neural information processing systems
[7]  
GIVENS CR, 1984, MICH MATH J, V31, P231
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]  
Higgins I., 2017, BETA VAE LEARNING BA, V3, P1
[10]   Meta-Learning in Neural Networks: A Survey [J].
Hospedales, Timothy ;
Antoniou, Antreas ;
Micaelli, Paul ;
Storkey, Amos .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :5149-5169