Towards learning a semantic-consistent subspace for cross-modal retrieval

被引：5

作者：

Xu, Meixiang ^{[1
,2
]}

Zhu, Zhenfeng ^{[1
,2
]}

Zhao, Yao ^{[1
,2
]}

机构：

[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China

[2] Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2019年 / 78卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Cross-modal; Semantic-correlation; Subspace learning; Multi-label;

D O I：

10.1007/s11042-018-6578-0

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A great many of approaches have been developed for cross-modal retrieval, among which subspace learning based ones dominate the landscape. Concerning whether using the semantic label information or not, subspace learning based approaches can be categorized into two paradigms, unsupervised and supervised. However, for multi-label cross-modal retrieval, supervised approaches just simply exploit multi-label information towards a discriminative subspace, without considering the correlations between multiple labels shared by multi-modalities, which often leads to an unsatisfactory retrieval performance. To address this issue, in this paper we propose a general framework, which jointly incorporates semantic correlations into subspace learning for multi-label cross-modal retrieval. By introducing the HSIC-based regularization term, the correlation information among multiple labels can be not only leveraged but also the consistency between the modality similarity from each modality is well preserved. Besides, based on the semantic-consistency projection, the semantic gap between the low-level feature space of each modality and the shared high-level semantic space can be balanced by a mid-level consistent one, where multi-label cross-modal retrieval can be performed effectively and efficiently. To solve the optimization problem, an effective iterative algorithm is designed, along with its convergence analysis theoretically and experimentally. Experimental results on real-world datasets have shown the superiority of the proposed method over several existing cross-modal subspace learning methods.

引用

页码：389 / 412

页数：24

共 50 条

[1] Towards learning a semantic-consistent subspace for cross-modal retrieval
Meixiang Xu
Zhenfeng Zhu
Yao Zhao
Multimedia Tools and Applications, 2019, 78 : 389 - 412
[2] Semantic-Consistent and Multilayer Similarity Based Cross-Modal Hashing Retrieval
Liu, Yuanyuan
Wang, Xiaoyan
Zhang, Yuxin
Zhu, Lu
Data Analysis and Knowledge Discovery, 2024, 8 (07) : 89 - 102
[3] Semantic-consistent cross-modal hashing for large-scale image retrieval
Gu, Xuesong
Dong, Guohua
Zhang, Xiang
Lan, Long
Luo, Zhigang
NEUROCOMPUTING, 2021, 433 : 181 - 198
[4] Semantic consistent adversarial cross-modal retrieval exploiting semantic similarity
Weihua Ou
Ruisheng Xuan
Jianping Gou
Quan Zhou
Yongfeng Cao
Multimedia Tools and Applications, 2020, 79 : 14733 - 14750
[5] Semantic consistent adversarial cross-modal retrieval exploiting semantic similarity
Ou, Weihua
Xuan, Ruisheng
Gou, Jianping
Zhou, Quan
Cao, Yongfeng
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14733 - 14750
[6] Domain Invariant Subspace Learning for Cross-Modal Retrieval
Liu, Chenlu
Xu, Xing
Yang, Yang
Lu, Huimin
Shen, Fumin
Ji, Yanli
MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 94 - 105
[7] Combination subspace graph learning for cross-modal retrieval
Xu, Gongwen
Li, Xiaomei
Shi, Lin
Zhang, Zhijun
Zhai, Aidong
ALEXANDRIA ENGINEERING JOURNAL, 2020, 59 (03) : 1333 - 1343
[8] Joint Dictionary Learning and Semantic Constrained Latent Subspace Projection for Cross-Modal Retrieval
Wu, Jianlong
Lin, Zhouchen
Zha, Hongbin
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1663 - 1666
[9] Semantic supervised learning based Cross-Modal Retrieval
Li, Zhuoyi
Fu, Hao
Gu, Guanghua
PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 207 - 209
[10] Joint Latent Subspace Learning and Regression for Cross-Modal Retrieval
Wu, Jianlong
Lin, Zhouchen
Zha, Hongbin
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 917 - 920

← 1 2 3 4 5 →