Cross-modal distribution alignment embedding network for generalized zero-shot learning

被引：8

作者：

Li, Qin ^{[1
]}

Hou, Mingzhen ^{[2
]}

Lai, Hong ^{[1
]}

Yang, Ming ^{[3
,4
]}

机构：

[1] Shenzhen Inst Informat Technol, Sch Software Engn, Shenzhen 518172, Peoples R China

[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xi'an 710071, Shaanxi, Peoples R China

[3] Westfield State Univ, Dept Math, Westfield, MA 01086 USA

[4] Westfield State Univ, Dept Comp & Informat Sci, Westfield, MA 01086 USA

来源：

NEURAL NETWORKS | 2022年 / 148卷

关键词：

Generalized zero-shot learning; Weakly-supervised learning; Image classification;

D O I：

10.1016/j.neunet.2022.01.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many approaches in generalized zero-shot learning (GZSL) rely on cross-modal mapping between the image feature space and the class embedding space, which achieves knowledge transfer from seen to unseen classes. However, these two spaces are completely different space and their manifolds are inconsistent, the existing methods suffer from highly overlapped semantic description of different classes, as in GZSL tasks unseen classes can be easily misclassified into seen classes. To handle these problems, we adopt a novel semantic embedding network which helps to encode more discriminative information from initial semantic attributes to semantic embeddings in visual space. Meanwhile, a distribution alignment constraint is adopted to help keep the distribution of the learned semantic embeddings consistent with the distribution of real image features. Moreover, an auxiliary classifier is adopted to strengthen the quality of the learned semantic embeddings. Finally, a relation network is used to classify the unseen images by computing the relation scores between the semantic embeddings and image features, which is much more flexible than the fixed distance metric functions. Experimental results demonstrate that our proposed method is superior to other state-of-the-arts. (C)& nbsp;2022 Published by Elsevier Ltd.

引用

页码：176 / 182

页数：7

共 50 条

[1] Cross-modal propagation network for generalized zero-shot learning
Guo, Ting
Liang, Jianqing
Liang, Jiye
Xie, Guo-Sen
PATTERN RECOGNITION LETTERS, 2022, 159 : 125 - 131
[2] Manifold regularized cross-modal embedding for zero-shot learning
Ji, Zhong
Yu, Yunlong
Pang, Yanwei
Guo, Jichang
Zhang, Zhongfei
INFORMATION SCIENCES, 2017, 378 : 48 - 58
[3] Generalized Zero-Shot Cross-Modal Retrieval
Dutta, Titir
Biswas, Soma
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (12) : 5953 - 5962
[4] A Cross-Modal Alignment for Zero-Shot Image Classification
Wu, Lu
Wu, Chenyu
Guo, Han
Zhao, Zhihao
IEEE ACCESS, 2023, 11 : 9067 - 9073
[5] Learning Aligned Cross-Modal Representation for Generalized Zero-Shot Classification
Fang, Zhiyu
Zhu, Xiaobin
Yang, Chun
Han, Zheng
Qin, Jingyan
Yin, Xu-Cheng
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6605 - 6613
[6] Cross-modal Zero-shot Hashing
Liu, Xuanwu
Li, Zhao
Wang, Jun
Yu, Guoxian
Domeniconi, Carlotta
Zhang, Xiangliang
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 449 - 458
[7] Correlated Features Synthesis and Alignment for Zero-shot Cross-modal Retrieval
Xu, Xing
Lin, Kaiyi
Lu, Huimin
Gao, Lianli
Shen, Heng Tao
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1419 - 1428
[8] Cross-modal Representation Learning for Zero-shot Action Recognition
Lin, Chung-Ching
Lin, Kevin
Wang, Lijuan
Liu, Zicheng
Li, Linjie
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19946 - 19956
[9] Attribute-Guided Network for Cross-Modal Zero-Shot Hashing
Ji, Zhong
Sun, Yuxin
Yu, Yunlong
Pang, Yanwei
Han, Jungong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (01) : 321 - 330
[10] Learning Deep Cross-Modal Embedding Networks for Zero-Shot Remote Sensing Image Scene Classification
Li, Yansheng
Zhu, Zhihui
Yu, Jin-Gang
Zhang, Yongjun
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (12): : 10590 - 10603

← 1 2 3 4 5 →