Manifold regularized cross-modal embedding for zero-shot learning

被引:32
|
作者
Ji, Zhong [1 ]
Yu, Yunlong [1 ]
Pang, Yanwei [1 ]
Guo, Jichang [1 ]
Zhang, Zhongfei [2 ]
机构
[1] Tianjin Univ, Sch Elect Informat Engn, Tianjin 300072, Peoples R China
[2] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
基金
中国国家自然科学基金;
关键词
Zero-shot learning; Image classification; Cross-modal embedding; Manifold; Domain adaptation; RECOGNITION;
D O I
10.1016/j.ins.2016.10.025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-Shot Learning (ZSL) aims at classifying previously unseen class samples and has gained its popularity in applications where samples of some categories are scarce for training. The basic idea to address this issue is transferring knowledge from the seen classes to the unseen classes through mapping the visual feature to an embedding space spanned by class semantic information. The class semantic information can be obtained from human labeled attributes or text corpus in an unsupervised fashion. Therefore, the embedding function from visual space to the embedding space is extremely important. However, the existing embedding approaches to ZSL mainly focus on aligning pairwise semantic consistency from heterogeneous spaces but ignore the intrinsic structure of the locally homogeneous isomorph. In order to preserve the locally visual structure in the embedding process, this paper proposes a Manifold regularized Cross-Modal Embedding (MCME) approach for ZSL by formulating the manifold constraint for intrinsic structure of the visual features as well as aligning pairwise consistency. The linear, closed-form solution makes MCME efficient to compute. Furthermore, rather than applying the embedding function learned from the seen classes directly, we also propose a new domain adaptation strategy to overcome the domain-shift problem during the knowledge transfer process. The MCME with the domain adaptation method is called MCME-DA. Extensive experiments on the benchmark datasets of AwA and CUB validate the superiority and promise of MCME and MCME-DA. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:48 / 58
页数:11
相关论文
共 50 条
  • [21] Semantic-Adversarial Graph Convolutional Network for Zero-Shot Cross-Modal Retrieval
    Li, Chuang
    Fei, Lunke
    Kang, Peipei
    Liang, Jiahao
    Fang, Xiaozhao
    Teng, Shaohua
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 459 - 472
  • [22] CROSS-MODAL ALIGNMENT OF LOCAL AND GLOBAL FEATURES FOR ZERO-SHOT CHINESE CHARACTER RECOGNITION
    Cai, Hongyi
    Zhu, Anna
    2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2024, : 2041 - 2047
  • [23] INTER-MODALITY FUSION BASED ATTENTION FOR ZERO-SHOT CROSS-MODAL RETRIEVAL
    Chakraborty, Bela
    Wang, Peng
    Wang, Lei
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2648 - 2652
  • [24] Cross-Modal Zero-Shot-Learning for Tactile Object Recognition
    Liu, Huaping
    Sun, Fuchun
    Fang, Bin
    Guo, Di
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (07): : 2466 - 2474
  • [25] ClusterE-ZSL: A Novel Cluster-Based Embedding for Enhanced Zero-Shot Learning in Contrastive Pre-Training Cross-Modal Retrieval
    Tariq, Umair
    Hu, Zonghai
    Tasneem, Khawaja Tauseef
    Bin Heyat, Md Belal
    Iqbal, Muhammad Shahid
    Aziz, Kamran
    IEEE ACCESS, 2024, 12 : 162622 - 162637
  • [26] Zero-shot Learning using Graph Regularized Latent Discriminative Cross-domain Triplets
    Gune, Omkar
    Vora, Meet
    Banerjee, Biplab
    Chaudhuri, Subhasis
    ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
  • [27] Domain-Oriented Semantic Embedding for Zero-Shot Learning
    Min, Shaobo
    Yao, Hantao
    Xie, Hongtao
    Zha, Zheng-Jun
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3919 - 3930
  • [28] Discriminative Embedding Autoencoder With a Regressor Feedback for Zero-Shot Learning
    Shi, Ying
    Wei, Wei
    IEEE ACCESS, 2020, 8 : 11019 - 11030
  • [29] Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval
    Deng, Cheng
    Xu, Xinxun
    Wang, Hao
    Yang, Muli
    Tao, Dacheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8892 - 8902
  • [30] Cross-modal Self-distillation for Zero-shot Sketch-based Image Retrieval
    Tian J.-L.
    Xu X.
    Shen F.-M.
    Shen H.-T.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (09):