Manifold regularized cross-modal embedding for zero-shot learning

被引：32

作者：

Ji, Zhong ^{[1
]}

Yu, Yunlong ^{[1
]}

Pang, Yanwei ^{[1
]}

Guo, Jichang ^{[1
]}

Zhang, Zhongfei ^{[2
]}

机构：

[1] Tianjin Univ, Sch Elect Informat Engn, Tianjin 300072, Peoples R China

[2] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA

来源：

INFORMATION SCIENCES | 2017年 / 378卷

基金：

中国国家自然科学基金;

关键词：

Zero-shot learning; Image classification; Cross-modal embedding; Manifold; Domain adaptation; RECOGNITION;

D O I：

10.1016/j.ins.2016.10.025

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Zero-Shot Learning (ZSL) aims at classifying previously unseen class samples and has gained its popularity in applications where samples of some categories are scarce for training. The basic idea to address this issue is transferring knowledge from the seen classes to the unseen classes through mapping the visual feature to an embedding space spanned by class semantic information. The class semantic information can be obtained from human labeled attributes or text corpus in an unsupervised fashion. Therefore, the embedding function from visual space to the embedding space is extremely important. However, the existing embedding approaches to ZSL mainly focus on aligning pairwise semantic consistency from heterogeneous spaces but ignore the intrinsic structure of the locally homogeneous isomorph. In order to preserve the locally visual structure in the embedding process, this paper proposes a Manifold regularized Cross-Modal Embedding (MCME) approach for ZSL by formulating the manifold constraint for intrinsic structure of the visual features as well as aligning pairwise consistency. The linear, closed-form solution makes MCME efficient to compute. Furthermore, rather than applying the embedding function learned from the seen classes directly, we also propose a new domain adaptation strategy to overcome the domain-shift problem during the knowledge transfer process. The MCME with the domain adaptation method is called MCME-DA. Extensive experiments on the benchmark datasets of AwA and CUB validate the superiority and promise of MCME and MCME-DA. (C) 2016 Elsevier Inc. All rights reserved.

引用

页码：48 / 58

页数：11

共 50 条

[21] Semantic-Adversarial Graph Convolutional Network for Zero-Shot Cross-Modal Retrieval
Li, Chuang
Fei, Lunke
Kang, Peipei
Liang, Jiahao
Fang, Xiaozhao
Teng, Shaohua
PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 459 - 472
[22] CROSS-MODAL ALIGNMENT OF LOCAL AND GLOBAL FEATURES FOR ZERO-SHOT CHINESE CHARACTER RECOGNITION
Cai, Hongyi
Zhu, Anna
2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2024, : 2041 - 2047
[23] INTER-MODALITY FUSION BASED ATTENTION FOR ZERO-SHOT CROSS-MODAL RETRIEVAL
Chakraborty, Bela
Wang, Peng
Wang, Lei
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2648 - 2652
[24] Cross-Modal Zero-Shot-Learning for Tactile Object Recognition
Liu, Huaping
Sun, Fuchun
Fang, Bin
Guo, Di
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (07): : 2466 - 2474
[25] ClusterE-ZSL: A Novel Cluster-Based Embedding for Enhanced Zero-Shot Learning in Contrastive Pre-Training Cross-Modal Retrieval
Tariq, Umair
Hu, Zonghai
Tasneem, Khawaja Tauseef
Bin Heyat, Md Belal
Iqbal, Muhammad Shahid
Aziz, Kamran
IEEE ACCESS, 2024, 12 : 162622 - 162637
[26] Zero-shot Learning using Graph Regularized Latent Discriminative Cross-domain Triplets
Gune, Omkar
Vora, Meet
Banerjee, Biplab
Chaudhuri, Subhasis
ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
[27] Domain-Oriented Semantic Embedding for Zero-Shot Learning
Min, Shaobo
Yao, Hantao
Xie, Hongtao
Zha, Zheng-Jun
Zhang, Yongdong
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3919 - 3930
[28] Discriminative Embedding Autoencoder With a Regressor Feedback for Zero-Shot Learning
Shi, Ying
Wei, Wei
IEEE ACCESS, 2020, 8 : 11019 - 11030
[29] Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval
Deng, Cheng
Xu, Xinxun
Wang, Hao
Yang, Muli
Tao, Dacheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8892 - 8902
[30] Cross-modal Self-distillation for Zero-shot Sketch-based Image Retrieval
Tian J.-L.
Xu X.
Shen F.-M.
Shen H.-T.
Ruan Jian Xue Bao/Journal of Software, 2022, 33 (09):

← 1 2 3 4 5 →