Discrete online cross-modal hashing with consistency preservation

被引：0

作者：

Kang, Xiao ^{[1
]}

Liu, Xingbo ^{[2
]}

Xue, Wen ^{[1
]}

Zhang, Xuening ^{[1
]}

Nie, Xiushan ^{[2
,3
]}

Yin, Yilong

机构：

[1] Shandong Univ, Sch Software, Jinan, Peoples R China

[2] Shandong Jianzhu Univ, Sch Comp Sci & Technol, Jinan, Peoples R China

[3] Shandong Yunhai Guochuang Cloud Comp Equipment Ind, Jinan, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 155卷

基金：

中国国家自然科学基金;

关键词：

Cross-modal retrieval; Supervised online hashing; Continuous semantic embedding; Fine-grained similarity preserving; Modality deviation calibration;

D O I：

10.1016/j.patcog.2024.110688

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Online cross-modal hashing has attracted widespread attention with the rapid expansion of large-scale streaming data, which can reduce storage requirements and enhance efficiency for online cross-modal retrieval. However, despite promising progress, existing methods still suffer from defective accuracy in a way, primarily attributed to two issues: insufficient semantic information exploitation and mismatched training-retrieval process. To address these challenges, we propose a novel supervised hashing method with dual consistency preservation, called Discrete Online Cross-Modal Hashing (DOCMH). On the one hand, we design more informative continuous semantic labels and fine-grained similarity graphs to preserve semantic consistency across different streaming data chunks and modality representations. On the other hand, we propose an effective modality deviation calibration mechanism for preserving learning process consistency between the training and retrieval phases. Extensive experiments on three widely used benchmark datasets demonstrate the superior performance of the proposed DOCMH under various scenarios.

引用

页数：12

共 41 条

[1] Cakir F, 2015, IEEE IMAGE PROC, P2606, DOI 10.1109/ICIP.2015.7351274
[2] Chen XX, 2017, CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017)
[3] Collaborative multiview hashing
Chen, Zhixiang
Zhou, Jie
[J]. PATTERN RECOGNITION, 2018, 75 : 149 - 160
[4] MS2GAH: Multi-label semantic supervised graph attention hashing for robust cross-modal retrieval
Duan, Youxiang
Chen, Ning
Zhang, Peiying
Kumar, Neeraj
Chang, Lunjie
Wen, Wu
[J]. PATTERN RECOGNITION, 2022, 128
[5] Golub G., 2009, Matrix Computations, P392
[6] RIDGE REGRESSION - APPLICATIONS TO NONORTHOGONAL PROBLEMS
HOERL, AE
KENNARD, RW
[J]. TECHNOMETRICS, 1970, 12 (01) : 69 - &
[7] Huang L., 2013, IJCAI 2013, P1422
[8] Huiskes M.J., 2008, P 1 ACM INT C MULT I, P39, DOI DOI 10.1145/1460096.1460104
[9] The segmented and annotated IAPR TC-12 benchmark
Jair Escalante, Hugo
Hernandez, Carlos A.
Gonzalez, Jesus A.
Lopez-Lopez, A.
Montes, Manuel
Morales, Eduardo F.
Sucar, L. Enrique
Villasenor, Luis
Grubinger, Michael
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2010, 114 (04) : 419 - 428
[10] Deep Cross-Modal Hashing
Jiang, Qing-Yuan
Li, Wu-Jun
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3270 - 3278

← 1 2 3 4 5 →