Multi-Label Cross-modal Retrieval

被引:187
|
作者
Ranjan, Viresh [1 ]
Rasiwasia, Nikhil [2 ]
Jawahar, C. V. [3 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
[2] Snapdeal Com, Delhi, India
[3] IIIT Hyderabad, CVIT, Hyderabad, Andhra Pradesh, India
来源
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2015年
关键词
D O I
10.1109/ICCV.2015.466
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we address the problem of cross-modal retrieval in presence of multi-label annotations. In particular, we introduce multi-label Canonical Correlation Analysis (ml-CCA), an extension of CCA, for learning shared subspaces taking into account the high level semantic information in the form of multi-label annotations. Unlike CCA, ml-CCA does not rely on explicit pairings between the modalities, instead it uses the multi-label information to establish correspondences. This results in a discriminative subspace which is better suited for cross-modal retrieval tasks. We also present Fast ml-CCA, a computationally efficient version of ml-CCA, which is able to handle large scale datasets. We show the efficacy of our approach by conducting extensive cross-modal retrieval experiments on three standard benchmark datasets. The results show that the proposed approach achieves state-of-the-art retrieval performance on the three datasets.
引用
收藏
页码:4094 / 4102
页数:9
相关论文
共 50 条
  • [1] Graph Convolutional Multi-Label Hashing for Cross-Modal Retrieval
    Shen, Xiaobo
    Chen, Yinfan
    Liu, Weiwei
    Zheng, Yuhui
    Sun, Quan-Sen
    Pan, Shirui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [2] Hypergraph clustering based multi-label cross-modal retrieval
    Guo, Shengtang
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Lu, Xu
    Li, Liujian
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
  • [3] Deep Noisy Multi-label Learning for Robust Cross-Modal Retrieval
    Pu, Ruitao
    Peng, Dezhong
    Hua, Fujun
    PATTERN RECOGNITION AND COMPUTER VISION, PT V, PRCV 2024, 2025, 15035 : 304 - 317
  • [4] DEEP PAIRWISE RANKING WITH MULTI-LABEL INFORMATION FOR CROSS-MODAL RETRIEVAL
    Jian, Yangwo
    Xiao, Jing
    Cao, Yang
    Khan, Asad
    Zhu, Jia
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1810 - 1815
  • [5] Multi-label adversarial fine-grained cross-modal retrieval
    Sun, Chunpu
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Wang, Lin
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 117
  • [6] Multi-label double-layer learning for cross-modal retrieval
    He, Jianfeng
    Ma, Bingpeng
    Wang, Shuhui
    Liu, Yugui
    Huang, Qingming
    NEUROCOMPUTING, 2018, 275 : 1893 - 1902
  • [7] Adaptive multi-label structure preserving network for cross-modal retrieval
    Zhu, Jie
    Zhang, Hui
    Chen, Junfen
    Xie, Bojun
    Liu, Jianan
    Zhang, Junsan
    INFORMATION SCIENCES, 2024, 682
  • [8] Scalable multi-label canonical correlation analysis for cross-modal retrieval
    Shu, Xin
    Zhao, Guoying
    PATTERN RECOGNITION, 2021, 115
  • [9] Deep Class-Guided Hashing for Multi-Label Cross-Modal Retrieval
    Chen, Hao
    Zou, Zhuoyang
    Liu, Yiqiang
    Zhu, Xinghui
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [10] Deep robust multilevel semantic hashing for multi-label cross-modal retrieval
    Song, Ge
    Tan, Xiaoyang
    Zhao, Jun
    Yang, Ming
    PATTERN RECOGNITION, 2021, 120