Intra-class low-rank regularization for supervised and semi-supervised cross-modal retrieval

被引：0

作者：

Peipei Kang

Zehang Lin

Zhenguo Yang

Xiaozhao Fang

Alexander M. Bronstein

Qing Li

Wenyin Liu

机构：

[1] Guangdong University of Technology,School of Computer Science and Technology

[2] Technion,Computer Science Department

[3] Hong Kong Polytechnic University,Department of Computing

[4] Guangdong University of Technology,Department of Automation

来源：

Applied Intelligence | 2022年 / 52卷

关键词：

Cross-modal retrieval; Deep neural networks; Intra-class low-rank; Supervised learning; Semi-supervised learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Cross-modal retrieval aims to retrieve related items across different modalities, for example, using an image query to retrieve related text. The existing deep methods ignore both the intra-modal and inter-modal intra-class low-rank structures when fusing various modalities, which decreases the retrieval performance. In this paper, two deep models (denoted as ILCMR and Semi-ILCMR) based on intra-class low-rank regularization are proposed for supervised and semi-supervised cross-modal retrieval, respectively. Specifically, ILCMR integrates the image network and text network into a unified framework to learn a common feature space by imposing three regularization terms to fuse the cross-modal data. First, to align them in the label space, we utilize semantic consistency regularization to convert the data representations to probability distributions over the classes. Second, we introduce an intra-modal low-rank regularization, which encourages the intra-class samples that originate from the same space to be more relevant in the common feature space. Third, an inter-modal low-rank regularization is applied to reduce the cross-modal discrepancy. To enable the low-rank regularization to be optimized using automatic gradients during network back-propagation, we propose the rank-r approximation and specify the explicit gradients for theoretical completeness. In addition to the three regularization terms that rely on label information incorporated by ILCMR, we propose Semi-ILCMR in the semi-supervised regime, which introduces a low-rank constraint before projecting the general representations into the common feature space. Extensive experiments on four public cross-modal datasets demonstrate the superiority of ILCMR and Semi-ILCMR over other state-of-the-art methods.

引用

页码：33 / 54

页数：21

共 50 条

[1] Intra-class low-rank regularization for supervised and semi-supervised cross-modal retrieval
Kang, Peipei
Lin, Zehang
Yang, Zhenguo
Fang, Xiaozhao
Bronstein, Alexander M.
Li, Qing
Liu, Wenyin
APPLIED INTELLIGENCE, 2022, 52 (01) : 33 - 54
[2] Semantic Consistency Cross-Modal Retrieval With Semi-Supervised Graph Regularization
Xu, Gongwen
Li, Xiaomei
Zhang, Zhijun
IEEE ACCESS, 2020, 8 : 14278 - 14288
[3] Deep Semantic Space with Intra-class Low-rank Constraint for Cross-modal Retrieval
Kang, Peipei
Lin, Zehang
Yang, Zhenguo
Fang, Xiaozhao
Li, Qing
Liu, Wenyin
ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 226 - 234
[4] A semi-supervised cross-modal memory bank for cross-modal retrieval
Huang, Yingying
Hu, Bingliang
Zhang, Yipeng
Gao, Chi
Wang, Quan
NEUROCOMPUTING, 2024, 579
[5] Semi-Supervised Cross-Modal Retrieval With Label Prediction
Mandal, Devraj
Rao, Pramod
Biswas, Soma
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (09) : 2345 - 2353
[6] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
Fuhao Zou
Xingqiang Bai
Chaoyang Luan
Kai Li
Yunfei Wang
Hefei Ling
World Wide Web, 2019, 22 : 825 - 841
[7] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
Zou, Fuhao
Bai, Xingqiang
Luan, Chaoyang
Li, Kai
Wang, Yunfei
Ling, Hefei
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 825 - 841
[8] Semi-supervised discrete hashing for efficient cross-modal retrieval
Wang, Xingzhi
Liu, Xin
Peng, Shu-Juan
Zhong, Bineng
Chen, Yewang
Du, Ji-Xiang
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (35-36) : 25335 - 25356
[9] Semi-Supervised Cross-Modal Retrieval Based on Discriminative Comapping
Liu, Li
Dong, Xiao
Wang, Tianshi
COMPLEXITY, 2020, 2020
[10] Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval
Zhang, Liang
Ma, Bingpeng
He, Jianfeng
Li, Guorong
Huang, Qingming
Tian, Qi
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3406 - 3412

← 1 2 3 4 5 →