Multi-Modality Adversarial Auto-Encoder for Zero-Shot Learning

被引:3
作者
Ji, Zhong [1 ]
Dai, Guangwen [1 ]
Yu, Yunlong [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; adversarial network; auto-encoder; image recognition;
D O I
10.1109/ACCESS.2019.2962298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing generative Zero-Shot Learning (ZSL) methods only consider the unidirectional alignment from the class semantics to the visual features while ignoring the alignment from the visual features to the class semantics, which fails to construct the visual-semantic interactions well. In this paper, we propose to generate visual features based on an auto-encoder framework paired with multi-modality adversarial networks respectively for visual and semantic modalities to reinforce the visual-semantic interactions with a bidirectional alignment, which ensures the generated visual features to fit the real visual distribution and to be highly related to the semantics. The encoder aims at generating real-like visual features while the decoder forces both the real and the generated visual features to be more related to the class semantics. To further capture the discriminative information of the generated visual features, both the real and generated visual features are forced to be classified into the correct classes via a classification network. Experimental results on four benchmark datasets show that the proposed approach is particularly competitive on both the traditional ZSL and the generalized ZSL tasks.
引用
收藏
页码:9287 / 9295
页数:9
相关论文
共 53 条
[41]   A Simple Exponential Family Framework for Zero-Shot Learning [J].
Verma, Vinay Kumar ;
Rai, Piyush .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT II, 2017, 10535 :792-808
[42]  
Wang WL, 2018, AAAI CONF ARTIF INTE, P4211
[43]   Zero-Shot Learning - The Good, the Bad and the Ugly [J].
Xian, Yongqin ;
Schiele, Bernt ;
Akata, Zeynep .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3077-3086
[44]   Latent Embeddings for Zero-shot Classification [J].
Xian, Yongqin ;
Akata, Zeynep ;
Sharma, Gaurav ;
Nguyen, Quynh ;
Hein, Matthias ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :69-77
[45]  
Xiao-Yu Z., 2019, P IEEE INT C SIGN IN, P1
[46]  
Yu YL, 2018, ADV NEUR IN, V31
[47]   Transductive Zero-Shot Learning With a Self-Training Dictionary Approach [J].
Yu, Yunlong ;
Ji, Zhong ;
Li, Xi ;
Guo, Jichang ;
Zhang, Zhongfei ;
Ling, Haibin ;
Wu, Fei .
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (10) :2908-2919
[48]   Zero-Shot Kernel Learning [J].
Zhang, Hongguang ;
Koniusz, Piotr .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7670-7679
[49]   Learning a Deep Embedding Model for Zero-Shot Learning [J].
Zhang, Li ;
Xiang, Tao ;
Gong, Shaogang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3010-3019
[50]   AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization [J].
Zhang, Xiao-Yu ;
Li, Changsheng ;
Shi, Haichao ;
Zhu, Xiaobin ;
Li, Peng ;
Dong, Jing .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) :1852-1863