Deep hashing with mutual information: A comprehensive strategy for image retrieval

被引：4

作者：

Chen, Yinqi ^{[1
]}

Lu, Zhiyi ^{[2
]}

Zheng, Yangting ^{[1
]}

Li, Peiwen ^{[1
]}

Luo, Weijian ^{[1
]}

Kang, Shuo ^{[1
]}

机构：

[1] Jihua Lab, Foshan 528200, Peoples R China

[2] Chinese Acad Sci, Aerosp Informat Res Inst, Jinan 510000, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 264卷

基金：

国家重点研发计划;

关键词：

Hashing; Image retrieval; Deep learning; Mutual information;

D O I：

10.1016/j.eswa.2024.125880

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While deep learning has significantly enhanced image retrieval, several challenges persist. These include inherent information loss during the hash-coding layer, the adverse impact of hard samples during training, and the complexity of generating hash centers within the real number field. To address these issues, we introduce a novel approach termed Deep Hashing with Mutual Information (DHMI), leveraging principles of mutual information. DHMI employs mutual information theory to entire deep hashing to address the aforementioned challenges, respectively: 1) Multi-stage hash-coding layer construction: DHMI constructs the hash-coding layer as multi-stage process (high-dimensional real space-* high-dimensional Hamming space-* low-dimensional Hamming space) that preserves the mutual information throughout the spatial transformation, thereby reducing the information loss during hash-coding. 2) Resilient training process: DHMI constrains the mutual information between the output hash code and the pre-generated hash centers, and develops a loss function capable of automatically adjusting the weight of hard samples, making the training process more resilient. 3) Efficient hash center generation: DHMI utilizes his own hash-coding layer as the center generation network. By inputting a Hadamard matrix instead of image feature, DHMI avoids local optima associated with various discrete optimization algorithms and generates linearly independent hash centers. Leveraging mutual information, DHMI surpasses state-of-the-art (SOTA) deep hashing methods, demonstrating exceptional retrieval performance.

引用

页数：15

共 49 条

[1]

Alberto L.-G., 1994, Probability and random processes for electrical engineering

[2]

[Anonymous], 2015, International Journal of Computer Vision

[3]

[Anonymous], 2009, P ACM INT C IM VID R, DOI [10.1145/1646396.1646452, DOI 10.1145/1646396.1646452]

[4]

[Anonymous], 2008, Principles of Digital Communication

[5] Hashing with Mutual Information [J].

Cakir, Fatih ;

He, Kun ;

Bargal, Sarah Adel ;

Sclaroff, Stan .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (10) :2424-2437

[6]

[曹玉东 Cao Yudong], 2016, [计算机应用研究, Application Research of Computers], V33, P1693

[7] Deep Cauchy Hashing for Hamming Space Retrieval [J].

Cao, Yue ;

Long, Mingsheng ;

Liu, Bin ;

Wang, Jianmin .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1229-1237

[8] HashNet: Deep Learning to Hash by Continuation [J].

Cao, Zhangjie ;

Long, Mingsheng ;

Wang, Jianmin ;

Yu, Philip S. .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5609-5618

[9] Deep internally connected transformer hashing for image retrieval [J].

Chao, Zijian ;

Cheng, Shuli ;

Li, Yongming .

KNOWLEDGE-BASED SYSTEMS, 2023, 279

[10] SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval [J].

Chen, Zhen-Duo ;

Li, Chuan-Xiang ;

Luo, Xin ;

Nie, Liqiang ;

Zhang, Wei ;

Xu, Xin-Shun .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) :2262-2275

← 1 2 3 4 5 →