Cross-modal Retrieval with Label Completion

被引：8

作者：

Xu, Xing ^{[1
]}

Shen, Fumin ^{[1
]}

Yang, Yang ^{[1
]}

Shen, Heng Tao ^{[1
,2
]}

He, Li ^{[3
]}

Song, Jingkuan ^{[4
]}

机构：

[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China

[2] Univ Queensland, Brisbane, Qld, Australia

[3] Qualcomm R&D Ctr, San Diego, CA USA

[4] Univ Trento, Trento, Italy

来源：

MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE | 2016年

关键词：

Cross-modal retrieval; label completion; IMAGES;

D O I：

10.1145/2964284.2967231

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-modal retrieval has been attracting increasing attention because of the explosion of multi-modal data, e.g., texts and images. Most supervised cross-modal retrieval methods learn discriminant common subspaces minimizing the heterogeneity of different modalities by exploiting the label information. However, these methods neglect the fact that, in practice, the given labels of training data might be incomplete (i.e., some of their labels are missing). The low-quality labels result in less effective subspace and consequent unsatisfactory retrieval performance. To tackle this, we propose a novel model that simultaneously performs label completion and cross-modal retrieval. Specifically, we assume the to-be-learned common subspace can be jointly derived through two aspects: 1) linear projection from modality-specific features and 2) enriching mapping from the incomplete labels. We thus formulate the subspace learning problem as a co-regularized learning framework based on multi-modal features and incomplete labels. Extensive experiments on two large-scale multi-modal datasets demonstrate the superiority of our model for both label completion and cross-modal retrieval over the state-of-the-arts.

引用

页码：302 / 306

页数：5

共 50 条

[41] Semantics Disentangling for Cross-Modal Retrieval [J].

Wang, Zheng ;

Xu, Xing ;

Wei, Jiwei ;

Xie, Ning ;

Yang, Yang ;

Shen, Heng Tao .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :2226-2237

[42] Cross-modal retrieval with dual optimization [J].

Qingzhen Xu ;

Shuang Liu ;

Han Qiao ;

Miao Li .

Multimedia Tools and Applications, 2023, 82 :7141-7157

[43] FedCMR: Federated Cross-Modal Retrieval [J].

Zong, Linlin ;

Xie, Qiujie ;

Zhou, Jiahui ;

Wu, Peiran ;

Zhang, Xianchao ;

Xu, Bo .

SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, :1672-1676

[44] Adversarial cross-modal retrieval based on dictionary learning [J].

Shang, Fei ;

Zhang, Huaxiang ;

Zhu, Lei ;

Sun, Jiande .

NEUROCOMPUTING, 2019, 355 :93-104

[45] UNSUPERVISED CROSS-MODAL RETRIEVAL THROUGH ADVERSARIAL LEARNING [J].

He, Li ;

Xu, Xing ;

Lu, Huimin ;

Yang, Yang ;

Shen, Fumin ;

Shen, Heng Tao .

2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, :1153-1158

[46] Deep adversarial metric learning for cross-modal retrieval [J].

Xu, Xing ;

He, Li ;

Lu, Huimin ;

Gao, Lianli ;

Ji, Yanli .

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02) :657-672

[47] Combining Generic and Specific Information for Cross-modal Retrieval [J].

Thi Quynh Nhi Tran ;

Le Borgne, Nerve ;

Crucianu, Michel .

ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, :551-554

[48] Semantically-enhanced kernel canonical correlation analysis: a multi-label cross-modal retrieval [J].

Jia, Yuhua ;

Bai, Liang ;

Liu, Shuang ;

Wang, Peng ;

Guo, Jinlin ;

Xie, Yuxiang .

MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (10) :13169-13188

[49] Label-wise Deep Semantic-Alignment Hashing for Cross-Modal Retrieval [J].

Li, Liang ;

Sun, Weiwei .

PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, :416-424

[50] Coding self-representative and label-relaxed hashing for cross-modal retrieval [J].

Jiang, Lin ;

Wu, Jigang ;

Zhao, Shuping ;

Li, Jiaxing .

PATTERN RECOGNITION LETTERS, 2024, 185 :1-7

← 1 2 3 4 5 →