Cross-modal Retrieval with Label Completion

被引：8

作者：

Xu, Xing ^{[1
]}

Shen, Fumin ^{[1
]}

Yang, Yang ^{[1
]}

Shen, Heng Tao ^{[1
,2
]}

He, Li ^{[3
]}

Song, Jingkuan ^{[4
]}

机构：

[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China

[2] Univ Queensland, Brisbane, Qld, Australia

[3] Qualcomm R&D Ctr, San Diego, CA USA

[4] Univ Trento, Trento, Italy

来源：

MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE | 2016年

关键词：

Cross-modal retrieval; label completion; IMAGES;

D O I：

10.1145/2964284.2967231

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-modal retrieval has been attracting increasing attention because of the explosion of multi-modal data, e.g., texts and images. Most supervised cross-modal retrieval methods learn discriminant common subspaces minimizing the heterogeneity of different modalities by exploiting the label information. However, these methods neglect the fact that, in practice, the given labels of training data might be incomplete (i.e., some of their labels are missing). The low-quality labels result in less effective subspace and consequent unsatisfactory retrieval performance. To tackle this, we propose a novel model that simultaneously performs label completion and cross-modal retrieval. Specifically, we assume the to-be-learned common subspace can be jointly derived through two aspects: 1) linear projection from modality-specific features and 2) enriching mapping from the incomplete labels. We thus formulate the subspace learning problem as a co-regularized learning framework based on multi-modal features and incomplete labels. Extensive experiments on two large-scale multi-modal datasets demonstrate the superiority of our model for both label completion and cross-modal retrieval over the state-of-the-arts.

引用

页码：302 / 306

页数：5

共 50 条

[31] Deep Noisy Multi-label Learning for Robust Cross-Modal Retrieval [J].

Pu, Ruitao ;

Peng, Dezhong ;

Hua, Fujun .

PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 :304-317

[32] Multi-label adversarial fine-grained cross-modal retrieval [J].

Sun, Chunpu ;

Zhang, Huaxiang ;

Liu, Li ;

Liu, Dongmei ;

Wang, Lin .

SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 117

[33] Adaptive Label-Aware Graph Convolutional Networks for Cross-Modal Retrieval [J].

Qian, Shengsheng ;

Xue, Dizhan ;

Fang, Quan ;

Xu, Changsheng .

IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 :3520-3532

[34] Pseudo-label driven deep hashing for unsupervised cross-modal retrieval [J].

XianHua Zeng ;

Ke Xu ;

YiCai Xie .

International Journal of Machine Learning and Cybernetics, 2023, 14 :3437-3456

[35] Scalable multi-label canonical correlation analysis for cross-modal retrieval [J].

Shu, Xin ;

Zhao, Guoying .

PATTERN RECOGNITION, 2021, 115

[36] Cross-modal retrieval with dual optimization [J].

Xu, Qingzhen ;

Liu, Shuang ;

Qiao, Han ;

Li, Miao .

MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (05) :7141-7157

[37] Active Supervised Cross-Modal Retrieval [J].

Zhang, Huaiwen ;

Yang, Yang ;

Qi, Fan ;

Qian, Shengsheng ;

Xu, Changsheng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (06) :5112-5126

[38] CROSS-MODAL RETRIEVAL WITH NOISY LABELS [J].

Mandal, Devraj ;

Biswas, Soma .

2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, :2326-2330

[39] Soft Contrastive Cross-Modal Retrieval [J].

Song, Jiayu ;

Hu, Yuxuan ;

Zhu, Lei ;

Zhang, Chengyuan ;

Zhang, Jian ;

Zhang, Shichao .

APPLIED SCIENCES-BASEL, 2024, 14 (05)

[40] A Graph Model for Cross-modal Retrieval [J].

Wang, Shixun ;

Pan, Peng ;

Lu, Yansheng .

PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 :1090-1097

← 1 2 3 4 5 →