Cross-modal Retrieval with Label Completion

被引:8
|
作者
Xu, Xing [1 ]
Shen, Fumin [1 ]
Yang, Yang [1 ]
Shen, Heng Tao [1 ,2 ]
He, Li [3 ]
Song, Jingkuan [4 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Univ Queensland, Brisbane, Qld, Australia
[3] Qualcomm R&D Ctr, San Diego, CA USA
[4] Univ Trento, Trento, Italy
来源
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE | 2016年
关键词
Cross-modal retrieval; label completion; IMAGES;
D O I
10.1145/2964284.2967231
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-modal retrieval has been attracting increasing attention because of the explosion of multi-modal data, e.g., texts and images. Most supervised cross-modal retrieval methods learn discriminant common subspaces minimizing the heterogeneity of different modalities by exploiting the label information. However, these methods neglect the fact that, in practice, the given labels of training data might be incomplete (i.e., some of their labels are missing). The low-quality labels result in less effective subspace and consequent unsatisfactory retrieval performance. To tackle this, we propose a novel model that simultaneously performs label completion and cross-modal retrieval. Specifically, we assume the to-be-learned common subspace can be jointly derived through two aspects: 1) linear projection from modality-specific features and 2) enriching mapping from the incomplete labels. We thus formulate the subspace learning problem as a co-regularized learning framework based on multi-modal features and incomplete labels. Extensive experiments on two large-scale multi-modal datasets demonstrate the superiority of our model for both label completion and cross-modal retrieval over the state-of-the-arts.
引用
收藏
页码:302 / 306
页数:5
相关论文
共 50 条
  • [31] Pseudo-label driven deep hashing for unsupervised cross-modal retrieval
    XianHua Zeng
    Ke Xu
    YiCai Xie
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 3437 - 3456
  • [32] Adaptive Label-Aware Graph Convolutional Networks for Cross-Modal Retrieval
    Qian, Shengsheng
    Xue, Dizhan
    Fang, Quan
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3520 - 3532
  • [33] Scalable multi-label canonical correlation analysis for cross-modal retrieval
    Shu, Xin
    Zhao, Guoying
    PATTERN RECOGNITION, 2021, 115
  • [34] Generalized Semantic Preserving Hashing for n-Label Cross-Modal Retrieval
    Mandal, Devraj
    Chaudhury, Kunal N.
    Biswas, Soma
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2633 - 2641
  • [35] Deep Noisy Multi-label Learning for Robust Cross-Modal Retrieval
    Pu, Ruitao
    Peng, Dezhong
    Hua, Fujun
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 304 - 317
  • [36] DEEP PAIRWISE RANKING WITH MULTI-LABEL INFORMATION FOR CROSS-MODAL RETRIEVAL
    Jian, Yangwo
    Xiao, Jing
    Cao, Yang
    Khan, Asad
    Zhu, Jia
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1810 - 1815
  • [37] Adaptive multi-label structure preserving network for cross-modal retrieval
    Zhu, Jie
    Zhang, Hui
    Chen, Junfen
    Xie, Bojun
    Liu, Jianan
    Zhang, Junsan
    INFORMATION SCIENCES, 2024, 682
  • [38] Label guided correlation hashing for large-scale cross-modal retrieval
    Dong, Guohua
    Zhang, Xiang
    Lan, Long
    Wang, Shiwei
    Luo, Zhigang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) : 30895 - 30922
  • [39] Soft Contrastive Cross-Modal Retrieval
    Song, Jiayu
    Hu, Yuxuan
    Zhu, Lei
    Zhang, Chengyuan
    Zhang, Jian
    Zhang, Shichao
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [40] Probabilistic Embeddings for Cross-Modal Retrieval
    Chun, Sanghyuk
    Oh, Seong Joon
    de Rezende, Rafael Sampaio
    Kalantidis, Yannis
    Larlus, Diane
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8411 - 8420