Pseudo-label driven deep hashing for unsupervised cross-modal retrieval

被引:0
作者
XianHua Zeng
Ke Xu
YiCai Xie
机构
[1] Chongqing University of Posts and Telecommunications,Chongqing Key Laboratory of Image Cognition, College of Computer Science and Technology
来源
International Journal of Machine Learning and Cybernetics | 2023年 / 14卷
关键词
Hashing; Cross-modal retrieval; Unsupervised learning; Clustering;
D O I
暂无
中图分类号
学科分类号
摘要
With the rapid development of big data and the Internet, cross-modal retrieval has become a popular research topic. Cross-modal hashing is an important research direction in cross-modal retrieval, due to its highly efficiency and small memory consumption. Recently, many unsupervised cross-modal hashing methods achieved great results on cross-modal retrieval tasks. However, how to narrow the heterogeneous gap between different modalities and generate more discriminative hash codes are still the main problems of unsupervised hashing. In this paper, we propose a novel unsupervised cross-modal hashing method Pseudo-label Driven Deep Hashing to solve aforementioned problems. We introduce clustering into our modal to obtain initialized semantical information called pseudo-label, and we propose a novel adjusting method that uses pseudo-labels to adjust joint-semantic similarity matrix. We construct a similarity consistency loss function that focuses on the heterogeneity gap between different modalities, and a real values and binary codes fine-tuning strategy for closing the gap between real value space and Hamming space. We conduct experiments on five datasets including three natural datasets which have larger inter-class distances and two medical datasets which have smaller inter-class distances, the results demonstrate the superiority of our method compared with several unsupervised cross-modal hashing methods.
引用
收藏
页码:3437 / 3456
页数:19
相关论文
共 42 条
[1]  
Fang X(2021)Discrete matrix factorization hashing for cross-modal retrieval Int J Mach Learn Cybern 12 3023-3036
[2]  
Liu Z(2018)Deep binary reconstruction for cross-modal hashing IEEE Trans Multimedia 21 973-985
[3]  
Han N(2019)Multimodal adversarial network for cross-modal retrieval Knowl-Based Syst 180 38-50
[4]  
Hu D(2015)Learning consistent feature representation for cross-modal multimedia retrieval IEEE Trans Multimedia 17 370-381
[5]  
Nie F(2016)Cross-view retrieval via probability-based semantics-preserving hashing IEEE Trans Cybern 47 4342-4355
[6]  
Li X(2020)Mask cross-modal hashing networks IEEE Trans Multimedia 23 550-558
[7]  
Hu P(2019)Cyclematch: a cycle-consistent embedding network for image-text matching Pattern Recogn 93 365-379
[8]  
Peng D(2019)Efficient discrete latent semantic hashing for scalable cross-modal retrieval Signal Process 154 217-231
[9]  
Wang X(2013)On the role of correlation and abstraction in cross-modal multimedia retrieval IEEE Trans Pattern Anal Mach Intell 36 521-535
[10]  
Kang C(2015)Cross-modal learning to rank via latent joint representation IEEE Trans Image Process 24 1497-1509