Ensemble learning framework for image retrieval via deep hash ranking

被引:14
作者
Li, Donggen [1 ]
Dai, Dawei [1 ]
Chen, Jiancu [2 ]
Xia, Shuyin [1 ]
Wang, Guoyin [1 ]
机构
[1] Chongqing Univ Telecommun & Posts, Coll Chongqing Key Lab Computat Intelligence, Chongqing, Peoples R China
[2] Chongqing Three Gorges Univ, Key Lab Intelligent Informat Proc & Control, Chongqing, Peoples R China
关键词
Image retrieval; Hash coding; Ensemble learning; Convolutional neural network; Transformer;
D O I
10.1016/j.knosys.2022.110128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep hashing combines feature extraction or representation with hash coding jointly, which can significantly improve the speed of large-scale image retrieval. However, we notice that compared with traditional retrieval methods, due to the reduction of dimension and information loss, the retrieval performance of binaryhash coding has declined to a certain extent. Most hash retrieval algorithms focus on the semantic similarity between image pairs, and ignore the ranking information between the returned samples. The returned samples should not only match the retrieved samples, but also rank the correct samples in front of the returned list. In addition, the performance difference of the deep model used in deep hash retrieval will also limit the efficiency of retrieval. To address such problem, we proposed an ensemble deep neural model robust framework for image retrieval, which can learn compact hash codes containing rich semantic information through hash constraints. The ensemble strategy is introduced, and the weighted voting is applied to integrate the ranking list. Comprehensive experiments on three benchmark datasets show that the proposed method achieves very competitive results. Codes are available at https://github.com/lidonggen-123/Ensemble_Deephash_Image_Retrieval.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 52 条
[1]   Semantic content-based image retrieval: A comprehensive study [J].
Alzu'bi, Ahmad ;
Amira, Abbes ;
Ramzan, Naeem .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 32 :20-54
[2]  
[Anonymous], 2011, P 28 INT C MACH LEAR
[3]  
[Anonymous], 2009, Advances in Neural Information Processing Systems
[4]   Neighborhood-Exact Nearest Neighbor Search for face retrieval [J].
Chen, Fanglin ;
Pei, Wenjie ;
Lu, Guangming .
KNOWLEDGE-BASED SYSTEMS, 2022, 248
[5]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[6]   A novel image retrieval model based on the most relevant features [J].
ElAlami, M. E. .
KNOWLEDGE-BASED SYSTEMS, 2011, 24 (01) :23-32
[7]   A knowledge-based component library for high-level computer vision tasks [J].
Fernandez-Lopez, D. ;
Cabido, R. ;
Sierra-Alonso, A. ;
Montemayor, A. S. ;
Pantrigo, J. J. .
KNOWLEDGE-BASED SYSTEMS, 2014, 70 :407-419
[8]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[9]   Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval [J].
Gong, Yunchao ;
Lazebnik, Svetlana ;
Gordo, Albert ;
Perronnin, Florent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2916-2929
[10]   Joint learning based deep supervised hashing for large-scale image retrieval [J].
Gu, Guanghua ;
Liu, Jiangtao ;
Li, Zhuoyi ;
Huo, Wenhua ;
Zhao, Yao .
NEUROCOMPUTING, 2020, 385 :348-357