Towards learning a semantic-consistent subspace for cross-modal retrieval

被引:5
|
作者
Xu, Meixiang [1 ,2 ]
Zhu, Zhenfeng [1 ,2 ]
Zhao, Yao [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[2] Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal; Semantic-correlation; Subspace learning; Multi-label;
D O I
10.1007/s11042-018-6578-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A great many of approaches have been developed for cross-modal retrieval, among which subspace learning based ones dominate the landscape. Concerning whether using the semantic label information or not, subspace learning based approaches can be categorized into two paradigms, unsupervised and supervised. However, for multi-label cross-modal retrieval, supervised approaches just simply exploit multi-label information towards a discriminative subspace, without considering the correlations between multiple labels shared by multi-modalities, which often leads to an unsatisfactory retrieval performance. To address this issue, in this paper we propose a general framework, which jointly incorporates semantic correlations into subspace learning for multi-label cross-modal retrieval. By introducing the HSIC-based regularization term, the correlation information among multiple labels can be not only leveraged but also the consistency between the modality similarity from each modality is well preserved. Besides, based on the semantic-consistency projection, the semantic gap between the low-level feature space of each modality and the shared high-level semantic space can be balanced by a mid-level consistent one, where multi-label cross-modal retrieval can be performed effectively and efficiently. To solve the optimization problem, an effective iterative algorithm is designed, along with its convergence analysis theoretically and experimentally. Experimental results on real-world datasets have shown the superiority of the proposed method over several existing cross-modal subspace learning methods.
引用
收藏
页码:389 / 412
页数:24
相关论文
共 50 条
  • [1] Towards learning a semantic-consistent subspace for cross-modal retrieval
    Meixiang Xu
    Zhenfeng Zhu
    Yao Zhao
    Multimedia Tools and Applications, 2019, 78 : 389 - 412
  • [2] Semantic-Consistent and Multilayer Similarity Based Cross-Modal Hashing Retrieval
    Liu, Yuanyuan
    Wang, Xiaoyan
    Zhang, Yuxin
    Zhu, Lu
    Data Analysis and Knowledge Discovery, 2024, 8 (07) : 89 - 102
  • [3] Semantic-consistent cross-modal hashing for large-scale image retrieval
    Gu, Xuesong
    Dong, Guohua
    Zhang, Xiang
    Lan, Long
    Luo, Zhigang
    NEUROCOMPUTING, 2021, 433 : 181 - 198
  • [4] Semantic consistent adversarial cross-modal retrieval exploiting semantic similarity
    Weihua Ou
    Ruisheng Xuan
    Jianping Gou
    Quan Zhou
    Yongfeng Cao
    Multimedia Tools and Applications, 2020, 79 : 14733 - 14750
  • [5] Semantic consistent adversarial cross-modal retrieval exploiting semantic similarity
    Ou, Weihua
    Xuan, Ruisheng
    Gou, Jianping
    Zhou, Quan
    Cao, Yongfeng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14733 - 14750
  • [6] Domain Invariant Subspace Learning for Cross-Modal Retrieval
    Liu, Chenlu
    Xu, Xing
    Yang, Yang
    Lu, Huimin
    Shen, Fumin
    Ji, Yanli
    MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 94 - 105
  • [7] Combination subspace graph learning for cross-modal retrieval
    Xu, Gongwen
    Li, Xiaomei
    Shi, Lin
    Zhang, Zhijun
    Zhai, Aidong
    ALEXANDRIA ENGINEERING JOURNAL, 2020, 59 (03) : 1333 - 1343
  • [8] Joint Dictionary Learning and Semantic Constrained Latent Subspace Projection for Cross-Modal Retrieval
    Wu, Jianlong
    Lin, Zhouchen
    Zha, Hongbin
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1663 - 1666
  • [9] Semantic supervised learning based Cross-Modal Retrieval
    Li, Zhuoyi
    Fu, Hao
    Gu, Guanghua
    PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 207 - 209
  • [10] Joint Latent Subspace Learning and Regression for Cross-Modal Retrieval
    Wu, Jianlong
    Lin, Zhouchen
    Zha, Hongbin
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 917 - 920