Cross-modal Retrieval with Label Completion

被引:8
作者
Xu, Xing [1 ]
Shen, Fumin [1 ]
Yang, Yang [1 ]
Shen, Heng Tao [1 ,2 ]
He, Li [3 ]
Song, Jingkuan [4 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Univ Queensland, Brisbane, Qld, Australia
[3] Qualcomm R&D Ctr, San Diego, CA USA
[4] Univ Trento, Trento, Italy
来源
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE | 2016年
关键词
Cross-modal retrieval; label completion; IMAGES;
D O I
10.1145/2964284.2967231
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal retrieval has been attracting increasing attention because of the explosion of multi-modal data, e.g., texts and images. Most supervised cross-modal retrieval methods learn discriminant common subspaces minimizing the heterogeneity of different modalities by exploiting the label information. However, these methods neglect the fact that, in practice, the given labels of training data might be incomplete (i.e., some of their labels are missing). The low-quality labels result in less effective subspace and consequent unsatisfactory retrieval performance. To tackle this, we propose a novel model that simultaneously performs label completion and cross-modal retrieval. Specifically, we assume the to-be-learned common subspace can be jointly derived through two aspects: 1) linear projection from modality-specific features and 2) enriching mapping from the incomplete labels. We thus formulate the subspace learning problem as a co-regularized learning framework based on multi-modal features and incomplete labels. Extensive experiments on two large-scale multi-modal datasets demonstrate the superiority of our model for both label completion and cross-modal retrieval over the state-of-the-arts.
引用
收藏
页码:302 / 306
页数:5
相关论文
共 50 条
[21]   Label-Semantic-Enhanced Online Hashing for Efficient Cross-modal Retrieval [J].
Jiang, Xueting ;
Liu, Xin ;
Cheung, Yiu-ming ;
Xu, Xing ;
Zheng, Shukai ;
Li, Taihao .
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, :984-989
[22]   Pseudo-label driven deep hashing for unsupervised cross-modal retrieval [J].
Zeng, XianHua ;
Xu, Ke ;
Xie, YiCai .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (10) :3437-3456
[23]   Label guided correlation hashing for large-scale cross-modal retrieval [J].
Guohua Dong ;
Xiang Zhang ;
Long Lan ;
Shiwei Wang ;
Zhigang Luo .
Multimedia Tools and Applications, 2019, 78 :30895-30922
[24]   Multi-label double-layer learning for cross-modal retrieval [J].
He, Jianfeng ;
Ma, Bingpeng ;
Wang, Shuhui ;
Liu, Yugui ;
Huang, Qingming .
NEUROCOMPUTING, 2018, 275 :1893-1902
[25]   Label Consistent Flexible Matrix Factorization Hashing for Efficient Cross-modal Retrieval [J].
Zhang, Donglin ;
Wu, Xiao-Jun ;
Yu, Jun .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (03)
[26]   Cross-modal retrieval via label category supervised matrix factorization hashing [J].
Xue, Feng ;
Wang, Wenbo ;
Zhou, Wenjie ;
Zeng, Tao ;
Yang, Tian .
PATTERN RECOGNITION LETTERS, 2020, 138 :469-475
[27]   Multi-label guided graph similarity learning for cross-modal retrieval [J].
Zhu, Jie ;
Wang, Dan ;
Shi, Guangtian ;
Wu, Shufang .
INFORMATION FUSION, 2025, 121
[28]   Contrastive Label Correlation Enhanced Unified Hashing Encoder for Cross-modal Retrieval [J].
Wu, Hongfa ;
Zhang, Lisai ;
Chen, Qingcai ;
Deng, Yimeng ;
Siebert, Joanna ;
Han, Yunpeng ;
Li, Zhonghua ;
Kong, Dejiang ;
Cao, Zhao .
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, :2158-2168
[29]   Label guided correlation hashing for large-scale cross-modal retrieval [J].
Dong, Guohua ;
Zhang, Xiang ;
Lan, Long ;
Wang, Shiwei ;
Luo, Zhigang .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) :30895-30922
[30]   DEEP PAIRWISE RANKING WITH MULTI-LABEL INFORMATION FOR CROSS-MODAL RETRIEVAL [J].
Jian, Yangwo ;
Xiao, Jing ;
Cao, Yang ;
Khan, Asad ;
Zhu, Jia .
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, :1810-1815