Multi-Modality Adversarial Auto-Encoder for Zero-Shot Learning

被引：3

作者：

Ji, Zhong ^{[1
]}

Dai, Guangwen ^{[1
]}

Yu, Yunlong ^{[2
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金;

关键词：

Zero-shot learning; adversarial network; auto-encoder; image recognition;

D O I：

10.1109/ACCESS.2019.2962298

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The existing generative Zero-Shot Learning (ZSL) methods only consider the unidirectional alignment from the class semantics to the visual features while ignoring the alignment from the visual features to the class semantics, which fails to construct the visual-semantic interactions well. In this paper, we propose to generate visual features based on an auto-encoder framework paired with multi-modality adversarial networks respectively for visual and semantic modalities to reinforce the visual-semantic interactions with a bidirectional alignment, which ensures the generated visual features to fit the real visual distribution and to be highly related to the semantics. The encoder aims at generating real-like visual features while the decoder forces both the real and the generated visual features to be more related to the class semantics. To further capture the discriminative information of the generated visual features, both the real and generated visual features are forced to be classified into the correct classes via a classification network. Experimental results on four benchmark datasets show that the proposed approach is particularly competitive on both the traditional ZSL and the generalized ZSL tasks.

引用

页码：9287 / 9295

页数：9

共 53 条

[1] Label-Embedding for Image Classification [J].

Akata, Zeynep ;

Perronnin, Florent ;

Harchaoui, Zaid ;

Schmid, Cordelia .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) :1425-1438

[2]

Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911

[3] Preserving Semantic Relations for Zero-Shot Learning [J].

Annadani, Yashas ;

Biswas, Soma .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7603-7612

[4]

[Anonymous], P COMP VIS PATT REC

[5]

[Anonymous], ARXIV14126568

[6]

[Anonymous], IEEE ACCESS

[7]

[Anonymous], 2013, NIPS

[8]

[Anonymous], 2013, NIPS

[9]

[Anonymous], P COMP VIS PATT REC

[10]

[Anonymous], ARXIV13125650

← 1 2 3 4 5 6 →