Deep Semantic Space with Intra-class Low-rank Constraint for Cross-modal Retrieval

被引：9

作者：

Kang, Peipei ^{[1
]}

Lin, Zehang ^{[1
]}

Yang, Zhenguo ^{[1
,2
]}

Fang, Xiaozhao ^{[3
]}

Li, Qing ^{[4
]}

Liu, Wenyin ^{[1
]}

机构：

[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Guangdong, Peoples R China

[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

[3] Guangdong Univ Technol, Dept Automat, Guangzhou, Guangdong, Peoples R China

[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

来源：

ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL | 2019年

基金：

中国国家自然科学基金;

关键词：

cross-modal retrieval; deep neural networks; intra-class low-rank; semantic space;

D O I：

10.1145/3323873.3325029

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, a novel Deep Semantic Space learning model with Intra-class Low-rank constraint (DSSIL) is proposed for cross-modal retrieval, which is composed of two subnetworks for modality-specific representation learning, followed by projection layers for common space mapping. In particular, DSSIL takes into account semantic consistency to fuse the cross-modal data in a high-level common space, and constrains the common representation matrix within the same class to be low-rank, in order to induce the intra-class representations more relevant. More formally, two regularization terms are devised for the two aspects, which have been incorporated into the objective of DSSIL. To optimize the modality-specific subnetworks and the projection layers simultaneously by exploiting the gradient decent directly, we approximate the nonconvex low-rank constraint by minimizing a few smallest singular values of the intra-class matrix with theoretical analysis. Extensive experiments conducted on three public datasets demonstrate the competitive superiority of DSSIL for cross-modal retrieval compared with the state-of-the-art methods.

引用

页码：226 / 234

页数：9

共 50 条

[1] Intra-class low-rank regularization for supervised and semi-supervised cross-modal retrieval
Peipei Kang
Zehang Lin
Zhenguo Yang
Xiaozhao Fang
Alexander M. Bronstein
Qing Li
Wenyin Liu
Applied Intelligence, 2022, 52 : 33 - 54
[2] Intra-class low-rank regularization for supervised and semi-supervised cross-modal retrieval
Kang, Peipei
Lin, Zehang
Yang, Zhenguo
Fang, Xiaozhao
Bronstein, Alexander M.
Li, Qing
Liu, Wenyin
APPLIED INTELLIGENCE, 2022, 52 (01) : 33 - 54
[3] Supervised Group Sparse Representation via Intra-class Low-Rank Constraint
Kang, Peipei
Fang, Xiaozhao
Zhang, Wei
Teng, Shaohua
Fei, Lunke
Xu, Yong
Zheng, Yubao
BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 206 - 213
[4] Reconstruction regularized low-rank subspace learning for cross-modal retrieval
Wu, Jianlong
Xie, Xingxu
Nie, Liqiang
Lin, Zhouchen
Zha, Hongbin
PATTERN RECOGNITION, 2021, 113
[5] Deep Semantic Mapping for Cross-Modal Retrieval
Wang, Cheng
Yang, Haojin
Meinel, Christoph
2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 234 - 241
[6] Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval
Wu, Yiling
Wang, Shuhui
Huang, Qingming
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) : 1310 - 1322
[7] Semantic consistency cross-modal dictionary learning with rank constraint
Shang, Fei
Zhang, Huaxiang
Sun, Jiande
Liu, Li
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 259 - 266
[8] DEEP RANK CROSS-MODAL HASHING WITH SEMANTIC CONSISTENT FOR IMAGE-TEXT RETRIEVAL
Liu, Xiaoqing
Zeng, Huanqiang
Shi, Yifan
Zhu, Jianqing
Ma, Kai-Kuang
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2022, 2022-May : 4828 - 4832
[9] DEEP RANK CROSS-MODAL HASHING WITH SEMANTIC CONSISTENT FOR IMAGE-TEXT RETRIEVAL
Liu, Xiaoqing
Zeng, Huanqiang
Shi, Yifan
Zhu, Jianqing
Ma, Kai-Kuang
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4828 - 4832
[10] Learning to rank with relational graph and pointwise constraint for cross-modal retrieval
Xu, Qingzhen
Li, Miao
Yu, Mengjing
SOFT COMPUTING, 2019, 23 (19) : 9413 - 9427

← 1 2 3 4 5 →