Deep Semantic Space with Intra-class Low-rank Constraint for Cross-modal Retrieval

被引:9
|
作者
Kang, Peipei [1 ]
Lin, Zehang [1 ]
Yang, Zhenguo [1 ,2 ]
Fang, Xiaozhao [3 ]
Li, Qing [4 ]
Liu, Wenyin [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Guangdong, Peoples R China
[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Guangdong Univ Technol, Dept Automat, Guangzhou, Guangdong, Peoples R China
[4] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
来源
ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL | 2019年
基金
中国国家自然科学基金;
关键词
cross-modal retrieval; deep neural networks; intra-class low-rank; semantic space;
D O I
10.1145/3323873.3325029
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, a novel Deep Semantic Space learning model with Intra-class Low-rank constraint (DSSIL) is proposed for cross-modal retrieval, which is composed of two subnetworks for modality-specific representation learning, followed by projection layers for common space mapping. In particular, DSSIL takes into account semantic consistency to fuse the cross-modal data in a high-level common space, and constrains the common representation matrix within the same class to be low-rank, in order to induce the intra-class representations more relevant. More formally, two regularization terms are devised for the two aspects, which have been incorporated into the objective of DSSIL. To optimize the modality-specific subnetworks and the projection layers simultaneously by exploiting the gradient decent directly, we approximate the nonconvex low-rank constraint by minimizing a few smallest singular values of the intra-class matrix with theoretical analysis. Extensive experiments conducted on three public datasets demonstrate the competitive superiority of DSSIL for cross-modal retrieval compared with the state-of-the-art methods.
引用
收藏
页码:226 / 234
页数:9
相关论文
共 50 条
  • [1] Intra-class low-rank regularization for supervised and semi-supervised cross-modal retrieval
    Peipei Kang
    Zehang Lin
    Zhenguo Yang
    Xiaozhao Fang
    Alexander M. Bronstein
    Qing Li
    Wenyin Liu
    Applied Intelligence, 2022, 52 : 33 - 54
  • [2] Intra-class low-rank regularization for supervised and semi-supervised cross-modal retrieval
    Kang, Peipei
    Lin, Zehang
    Yang, Zhenguo
    Fang, Xiaozhao
    Bronstein, Alexander M.
    Li, Qing
    Liu, Wenyin
    APPLIED INTELLIGENCE, 2022, 52 (01) : 33 - 54
  • [3] Supervised Group Sparse Representation via Intra-class Low-Rank Constraint
    Kang, Peipei
    Fang, Xiaozhao
    Zhang, Wei
    Teng, Shaohua
    Fei, Lunke
    Xu, Yong
    Zheng, Yubao
    BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 : 206 - 213
  • [4] Reconstruction regularized low-rank subspace learning for cross-modal retrieval
    Wu, Jianlong
    Xie, Xingxu
    Nie, Liqiang
    Lin, Zhouchen
    Zha, Hongbin
    PATTERN RECOGNITION, 2021, 113
  • [5] Deep Semantic Mapping for Cross-Modal Retrieval
    Wang, Cheng
    Yang, Haojin
    Meinel, Christoph
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 234 - 241
  • [6] Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval
    Wu, Yiling
    Wang, Shuhui
    Huang, Qingming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) : 1310 - 1322
  • [7] Semantic consistency cross-modal dictionary learning with rank constraint
    Shang, Fei
    Zhang, Huaxiang
    Sun, Jiande
    Liu, Li
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 259 - 266
  • [8] DEEP RANK CROSS-MODAL HASHING WITH SEMANTIC CONSISTENT FOR IMAGE-TEXT RETRIEVAL
    Liu, Xiaoqing
    Zeng, Huanqiang
    Shi, Yifan
    Zhu, Jianqing
    Ma, Kai-Kuang
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2022, 2022-May : 4828 - 4832
  • [9] DEEP RANK CROSS-MODAL HASHING WITH SEMANTIC CONSISTENT FOR IMAGE-TEXT RETRIEVAL
    Liu, Xiaoqing
    Zeng, Huanqiang
    Shi, Yifan
    Zhu, Jianqing
    Ma, Kai-Kuang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4828 - 4832
  • [10] Learning to rank with relational graph and pointwise constraint for cross-modal retrieval
    Xu, Qingzhen
    Li, Miao
    Yu, Mengjing
    SOFT COMPUTING, 2019, 23 (19) : 9413 - 9427