Deep Semantic Space with Intra-class Low-rank Constraint for Cross-modal Retrieval

被引：9

作者：

Kang, Peipei ^{[1
]}

Lin, Zehang ^{[1
]}

Yang, Zhenguo ^{[1
,2
]}

Fang, Xiaozhao ^{[3
]}

Li, Qing ^{[4
]}

Liu, Wenyin ^{[1
]}

机构：

[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Guangdong, Peoples R China

[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

[3] Guangdong Univ Technol, Dept Automat, Guangzhou, Guangdong, Peoples R China

[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

来源：

ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL | 2019年

基金：

中国国家自然科学基金;

关键词：

cross-modal retrieval; deep neural networks; intra-class low-rank; semantic space;

D O I：

10.1145/3323873.3325029

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, a novel Deep Semantic Space learning model with Intra-class Low-rank constraint (DSSIL) is proposed for cross-modal retrieval, which is composed of two subnetworks for modality-specific representation learning, followed by projection layers for common space mapping. In particular, DSSIL takes into account semantic consistency to fuse the cross-modal data in a high-level common space, and constrains the common representation matrix within the same class to be low-rank, in order to induce the intra-class representations more relevant. More formally, two regularization terms are devised for the two aspects, which have been incorporated into the objective of DSSIL. To optimize the modality-specific subnetworks and the projection layers simultaneously by exploiting the gradient decent directly, we approximate the nonconvex low-rank constraint by minimizing a few smallest singular values of the intra-class matrix with theoretical analysis. Extensive experiments conducted on three public datasets demonstrate the competitive superiority of DSSIL for cross-modal retrieval compared with the state-of-the-art methods.

引用

页码：226 / 234

页数：9

共 50 条

[11] Learning to rank with relational graph and pointwise constraint for cross-modal retrieval
Qingzhen Xu
Miao Li
Mengjing Yu
Soft Computing, 2019, 23 : 9413 - 9427
[12] ONLINE LOW-RANK SIMILARITY FUNCTION LEARNING WITH ADAPTIVE RELATIVE MARGIN FOR CROSS-MODAL RETRIEVAL
Wu, Yiling
Wang, Shuhui
Zhang, Weigang
Huang, Qingming
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 823 - 828
[13] Deep semantic hashing with dual attention for cross-modal retrieval
Jiagao Wu
Weiwei Weng
Junxia Fu
Linfeng Liu
Bin Hu
Neural Computing and Applications, 2022, 34 : 5397 - 5416
[14] Deep semantic similarity adversarial hashing for cross-modal retrieval
Qiang, Haopeng
Wan, Yuan
Xiang, Lun
Meng, Xiaojing
NEUROCOMPUTING, 2020, 400 : 24 - 33
[15] Semantic decomposition and enhancement hashing for deep cross-modal retrieval
Fei, Lunke
He, Zhihao
Wong, Wai Keung
Zhu, Qi
Zhao, Shuping
Wen, Jie
PATTERN RECOGNITION, 2025, 160
[16] Deep supervised multimodal semantic autoencoder for cross-modal retrieval
Tian, Yu
Yang, Wenjing
Liu, Qingsong
Yang, Qiong
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2020, 31 (4-5)
[17] Deep semantic hashing with dual attention for cross-modal retrieval
Wu, Jiagao
Weng, Weiwei
Fu, Junxia
Liu, Linfeng
Hu, Bin
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (07): : 5397 - 5416
[18] Deep Semantic Correlation with Adversarial Learning for Cross-Modal Retrieval
Hua, Yan
Du, Jianhe
PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, : 252 - 255
[19] Deep Visual-Semantic Hashing for Cross-Modal Retrieval
Cao, Yue
Long, Mingsheng
Wang, Jianmin
Yang, Qiang
Yu, Philip S.
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1445 - 1454
[20] Deep Multi-Level Semantic Hashing for Cross-Modal Retrieval
Ji, Zhenyan
Yao, Weina
Wei, Wei
Song, Houbing
Pi, Huaiyu
IEEE ACCESS, 2019, 7 : 23667 - 23674

← 1 2 3 4 5 →