Deep Cross-Modal Hashing Based on Semantic Consistent Ranking

被引:21
|
作者
Liu, Xiaoqing [1 ]
Zeng, Huanqiang [1 ,2 ]
Shi, Yifan [2 ]
Zhu, Jianqing [2 ]
Hsia, Chih-Hsien [3 ]
Ma, Kai-Kuang [4 ]
机构
[1] Huaqiao Univ, Sch Informat Sci & Engn, Quanzhou 362021, Peoples R China
[2] Huaqiao Univ, Sch Engn, Quanzhou 362021, Peoples R China
[3] Ilan Univ, Dept Comp Sci & Informat Engn, Yilan City 260, Taiwan
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Nanyang 639798, Singapore
关键词
Cross-modal hashing; rank learning; heterogeneous gap; intra-modal similarity; IMAGE RETRIEVAL; REPRESENTATIONS; NETWORK; CODES;
D O I
10.1109/TMM.2023.3254199
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The amount of multi-modal data available on the Internet is enormous. Cross-modal hash retrieval maps heterogeneous cross-modal data into a single Hamming space to offer fast and flexible retrieval services. However, existing cross-modal methods mainly rely on the feature-level similarity between multi-modal data and ignore the relationship between relative rankings and label-level fine-grained similarity of neighboring instances. To overcome these issues, we propose a novel Deep Cross-modal Hashing based on Semantic Consistent Ranking (DCH-SCR) that comprehensively investigates the intra-modal semantic similarity relationship. Firstly, to the best of our knowledge, it is an early attempt to preserve semantic similarity for cross-modal hashing retrieval by combining label-level and feature-level information. Secondly, the inherent gap between modalities is narrowed by developing a ranking alignment loss function. Thirdly, the compact and efficient hash codes are optimized based on the common semantic space. Finally, we use the gradient to specify the optimization direction and introduce the Normalized Discounted Cumulative Gain (NDCG) to achieve varying optimization strengths for data pairs with different similarities. Extensive experiments on three real-world image-text retrieval datasets demonstrate the superiority of DCH-SCR over several state-of-the-art cross-modal retrieval methods.
引用
收藏
页码:9530 / 9542
页数:13
相关论文
共 50 条
  • [31] Semantic consistency hashing for cross-modal retrieval
    Yao, Tao
    Kong, Xiangwei
    Fu, Haiyan
    Tian, Qi
    NEUROCOMPUTING, 2016, 193 : 250 - 259
  • [32] Semantic embedding based online cross-modal hashing method
    Zhang, Meijia
    Li, Junzheng
    Zheng, Xiyuan
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [33] Semantic embedding based online cross-modal hashing method
    Meijia Zhang
    Junzheng Li
    Xiyuan Zheng
    Scientific Reports, 14
  • [34] CROSS-MODAL HASHING THROUGH RANKING SUBSPACE LEARNING
    Li, Kai
    Qi, Guojun
    Ye, Jun
    Hua, Kien A.
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [35] Linear Subspace Ranking Hashing for Cross-Modal Retrieval
    Li, Kai
    Qi, Guo-Jun
    Ye, Jun
    Hua, Kien A.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (09) : 1825 - 1838
  • [36] Semantic-consistent cross-modal hashing for large-scale image retrieval
    Gu, Xuesong
    Dong, Guohua
    Zhang, Xiang
    Lan, Long
    Luo, Zhigang
    NEUROCOMPUTING, 2021, 433 : 181 - 198
  • [37] Label consistent locally linear embedding based cross-modal hashing
    Zeng, Hui
    Zhang, Huaxiang
    Zhu, Lei
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [38] LABEL CONSISTENT MATRIX FACTORIZATION BASED HASHING FOR CROSS-MODAL RETRIEVAL
    Mandal, Devraj
    Biswas, Soma
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2901 - 2905
  • [39] DEEP SEMANTIC ADVERSARIAL HASHING BASED ON AUTOENCODER FOR LARGE-SCALE CROSS-MODAL RETRIEVAL
    Li, Mingyong
    Wang, Hongya
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [40] Multilevel Deep Semantic Feature Asymmetric Network for Cross-Modal Hashing Retrieval
    Jiang, Xiaolong
    Fan, Jiabao
    Zhang, Jie
    Lin, Ziyong
    Li, Mingyong
    IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (08) : 621 - 631