Deep hashing with mutual information: A comprehensive strategy for image retrieval

被引:4
作者
Chen, Yinqi [1 ]
Lu, Zhiyi [2 ]
Zheng, Yangting [1 ]
Li, Peiwen [1 ]
Luo, Weijian [1 ]
Kang, Shuo [1 ]
机构
[1] Jihua Lab, Foshan 528200, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Jinan 510000, Peoples R China
基金
国家重点研发计划;
关键词
Hashing; Image retrieval; Deep learning; Mutual information;
D O I
10.1016/j.eswa.2024.125880
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While deep learning has significantly enhanced image retrieval, several challenges persist. These include inherent information loss during the hash-coding layer, the adverse impact of hard samples during training, and the complexity of generating hash centers within the real number field. To address these issues, we introduce a novel approach termed Deep Hashing with Mutual Information (DHMI), leveraging principles of mutual information. DHMI employs mutual information theory to entire deep hashing to address the aforementioned challenges, respectively: 1) Multi-stage hash-coding layer construction: DHMI constructs the hash-coding layer as multi-stage process (high-dimensional real space-* high-dimensional Hamming space-* low-dimensional Hamming space) that preserves the mutual information throughout the spatial transformation, thereby reducing the information loss during hash-coding. 2) Resilient training process: DHMI constrains the mutual information between the output hash code and the pre-generated hash centers, and develops a loss function capable of automatically adjusting the weight of hard samples, making the training process more resilient. 3) Efficient hash center generation: DHMI utilizes his own hash-coding layer as the center generation network. By inputting a Hadamard matrix instead of image feature, DHMI avoids local optima associated with various discrete optimization algorithms and generates linearly independent hash centers. Leveraging mutual information, DHMI surpasses state-of-the-art (SOTA) deep hashing methods, demonstrating exceptional retrieval performance.
引用
收藏
页数:15
相关论文
共 49 条
[1]  
Alberto L.-G., 1994, Probability and random processes for electrical engineering
[2]  
[Anonymous], 2015, International Journal of Computer Vision
[3]  
[Anonymous], 2009, P ACM INT C IM VID R, DOI [10.1145/1646396.1646452, DOI 10.1145/1646396.1646452]
[4]  
[Anonymous], 2008, Principles of Digital Communication
[5]   Hashing with Mutual Information [J].
Cakir, Fatih ;
He, Kun ;
Bargal, Sarah Adel ;
Sclaroff, Stan .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (10) :2424-2437
[6]  
[曹玉东 Cao Yudong], 2016, [计算机应用研究, Application Research of Computers], V33, P1693
[7]   Deep Cauchy Hashing for Hamming Space Retrieval [J].
Cao, Yue ;
Long, Mingsheng ;
Liu, Bin ;
Wang, Jianmin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1229-1237
[8]   HashNet: Deep Learning to Hash by Continuation [J].
Cao, Zhangjie ;
Long, Mingsheng ;
Wang, Jianmin ;
Yu, Philip S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5609-5618
[9]   Deep internally connected transformer hashing for image retrieval [J].
Chao, Zijian ;
Cheng, Shuli ;
Li, Yongming .
KNOWLEDGE-BASED SYSTEMS, 2023, 279
[10]   SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval [J].
Chen, Zhen-Duo ;
Li, Chuan-Xiang ;
Luo, Xin ;
Nie, Liqiang ;
Zhang, Wei ;
Xu, Xin-Shun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) :2262-2275