Stack-VAE Network for Zero-Shot Learning

被引:1
作者
Xie, Jinghao [1 ]
Wu, Jigang [1 ]
Liang, Tianyou [1 ]
Meng, Min [1 ]
机构
[1] Guangdong Univ Technol, Guangzhou 510006, Peoples R China
来源
NEURAL INFORMATION PROCESSING, ICONIP 2021, PT IV | 2021年 / 13111卷
基金
中国国家自然科学基金;
关键词
Zero-shot learning; Generalized zero-shot learning; Generative model; Object recognition;
D O I
10.1007/978-3-030-92273-3_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning aims to transfer knowledge from the seen classes to unseen ones through some high-level semantics (e.g., per-class attributes), enabling the learning model to recognize novel classes without retraining. Among them, the generative methods adopt the scheme that synthesizes samples for the unseen classes, thereby converting the task into a standard classification problem. However, most existing work inevitably suffers from the domain shift problem when only the seen classes are used for supervision. Furthermore, they can not fully leverage the semantic information in data synthesis due to the limited expressiveness of the generator. In this paper, we develop a novel network, named stack-VAE, to alleviate the above problems. The proposal mainly consists of a generative module and a feature core agent. Specifically, we design the generator based on hierarchical VAE, which exploits multi-layer Gaussian distribution to improve the expressiveness, thereby better mimicking the real data distribution of the unseen classes. Besides, we propose a feature core agent based objective, which is beneficial to mitigate seen class bias by enforcing the inter-class separability and reducing the intra-class scatter. Experimental results conducted on three widely used datasets, i.e., AWA2, SUN, CUB, show that the proposed network outperforms the baselines and achieves a new state-of-the-art.
引用
收藏
页码:250 / 261
页数:12
相关论文
共 33 条
[1]   An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild [J].
Chao, Wei-Lun ;
Changpinyo, Soravit ;
Gong, Boqing ;
Sha, Fei .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :52-68
[2]   Multi-modal Cycle-Consistent Generalized Zero-Shot Learning [J].
Felix, Rafael ;
Kumar, B. G. Vijay ;
Reid, Ian ;
Carneiro, Gustavo .
COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :21-37
[3]  
Frome A., 2013, Proc. Adv. Neural Inf. Process. Syst., P2121
[4]   Transductive Multi-View Zero-Shot Learning [J].
Fu, Yanwei ;
Hospedales, Timothy M. ;
Xiang, Tao ;
Gong, Shaogang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (11) :2332-2345
[5]  
Gao R., 2018, P EUR C COMP VIS ECC, P631
[6]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]  
Huang Yijie, 2020, IEEE INT C MULTIMEDI
[9]   Generalized Zero-Shot Learning Via Over-Complete Distribution [J].
Keshari, Rohit ;
Singh, Richa ;
Vatsa, Mayank .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13297-13305
[10]  
Kingma D. P., 2013, ARXIV