Discrete online cross-modal hashing with consistency preservation

被引:0
作者
Kang, Xiao [1 ]
Liu, Xingbo [2 ]
Xue, Wen [1 ]
Zhang, Xuening [1 ]
Nie, Xiushan [2 ,3 ]
Yin, Yilong
机构
[1] Shandong Univ, Sch Software, Jinan, Peoples R China
[2] Shandong Jianzhu Univ, Sch Comp Sci & Technol, Jinan, Peoples R China
[3] Shandong Yunhai Guochuang Cloud Comp Equipment Ind, Jinan, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal retrieval; Supervised online hashing; Continuous semantic embedding; Fine-grained similarity preserving; Modality deviation calibration;
D O I
10.1016/j.patcog.2024.110688
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online cross-modal hashing has attracted widespread attention with the rapid expansion of large-scale streaming data, which can reduce storage requirements and enhance efficiency for online cross-modal retrieval. However, despite promising progress, existing methods still suffer from defective accuracy in a way, primarily attributed to two issues: insufficient semantic information exploitation and mismatched training-retrieval process. To address these challenges, we propose a novel supervised hashing method with dual consistency preservation, called Discrete Online Cross-Modal Hashing (DOCMH). On the one hand, we design more informative continuous semantic labels and fine-grained similarity graphs to preserve semantic consistency across different streaming data chunks and modality representations. On the other hand, we propose an effective modality deviation calibration mechanism for preserving learning process consistency between the training and retrieval phases. Extensive experiments on three widely used benchmark datasets demonstrate the superior performance of the proposed DOCMH under various scenarios.
引用
收藏
页数:12
相关论文
共 41 条
  • [1] Cakir F, 2015, IEEE IMAGE PROC, P2606, DOI 10.1109/ICIP.2015.7351274
  • [2] Chen XX, 2017, CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017)
  • [3] Collaborative multiview hashing
    Chen, Zhixiang
    Zhou, Jie
    [J]. PATTERN RECOGNITION, 2018, 75 : 149 - 160
  • [4] MS2GAH: Multi-label semantic supervised graph attention hashing for robust cross-modal retrieval
    Duan, Youxiang
    Chen, Ning
    Zhang, Peiying
    Kumar, Neeraj
    Chang, Lunjie
    Wen, Wu
    [J]. PATTERN RECOGNITION, 2022, 128
  • [5] Golub G., 2009, Matrix Computations, P392
  • [6] RIDGE REGRESSION - APPLICATIONS TO NONORTHOGONAL PROBLEMS
    HOERL, AE
    KENNARD, RW
    [J]. TECHNOMETRICS, 1970, 12 (01) : 69 - &
  • [7] Huang L., 2013, IJCAI 2013, P1422
  • [8] Huiskes M.J., 2008, P 1 ACM INT C MULT I, P39, DOI DOI 10.1145/1460096.1460104
  • [9] The segmented and annotated IAPR TC-12 benchmark
    Jair Escalante, Hugo
    Hernandez, Carlos A.
    Gonzalez, Jesus A.
    Lopez-Lopez, A.
    Montes, Manuel
    Morales, Eduardo F.
    Sucar, L. Enrique
    Villasenor, Luis
    Grubinger, Michael
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2010, 114 (04) : 419 - 428
  • [10] Deep Cross-Modal Hashing
    Jiang, Qing-Yuan
    Li, Wu-Jun
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3270 - 3278