Fast and accurate image retrieval using knowledge distillation from multiple deep pre-trained networks

被引:0
作者
Hasan Salman
Amir Hossein Taherinia
Davood Zabihzadeh
机构
[1] Ferdowsi University of Mashhad,Computer Engineering Department, Faculty of Engineering
[2] Hakim Sabzevari University,Department of Computer Engineering
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Information retrieval; Knowledge distillation; Model quantization; Semantic hash coding; Attention mechanism;
D O I
暂无
中图分类号
学科分类号
摘要
The content retrieval systems aim to retrieve images similar to a query image from a large data set. A feature extractor and similarity measure play a key role in these systems. Hand-crafted feature descriptors like SURF, SIFT, and GIST find a suitable pattern for measuring the similarity between images. Recently deep learning in this field has been given much attention, which performs feature extraction and similarity learning simultaneously. Various research shows that the feature vector extracted from pre-trained networks contains richer information than class labels in classifying or retrieving information. This paper presents an effective method, Deep Muti-teacher Transfer Hash (DMTH), which uses knowledge from several complex models to teach a simple one. Due to the variety of available pre-trained models and the diversity among their extracted features, we utilize an attention mechanism to obtain richer features from them to teach a simple model via an appropriate knowledge distillation loss. We test our method on widely used datasets Cifar10 & Cifar100 and compare our method with other state-of-the-art methods. The experimental results show that DMTH can improve the image retrieval performance by learning better features obtained through an attention mechanism from multiple teachers without increasing evaluation time. Specifically, the proposed multi-teacher model surpasses the best individual teacher by 2% in terms of accuracy on Cifar10. Meanwhile, it boosts the performance of the student model by more than 4% using our knowledge transfer mechanism.
引用
收藏
页码:33937 / 33959
页数:22
相关论文
共 39 条
[1]  
Dubey SR(2021)A Decade Survey of Content Based Image Retrieval Using Deep Learning IEEE Transact Circuits Syst Video Technol 32 2687-2704
[2]  
Dubey SR(2014)Rotation and Illumination Invariant Interleaved Intensity Order-Based Local Descriptor IEEE Trans Image Process 23 5323-5333
[3]  
Singh SK(2015)Local Diagonal Extrema Pattern: A New and Efficient Feature Descriptor for Ct Image Retrieval IEEE Sig Proc Lett 22 1215-1219
[4]  
Singh RK(2016)Multichannel decoded local binary patterns for content-based image retrieval IEEE Trans Image Process 25 4018-4032
[5]  
Dubey SR(2014)Local Oppugnant color texture pattern for image retrieval system Pattern Recogn Lett 42 72-78
[6]  
Singh SK(2011)Aggregating local image descriptors into compact codes IEEE Trans Pattern Anal Mach Intell 34 1704-1716
[7]  
Singh RK(2015)Weakly supervised deep metric learning for community-contributed image retrieval IEEE Transac Multimed 17 1989-1999
[8]  
Dubey SR(2018)Deep collaborative embedding for social image understanding IEEE Trans Pattern Anal Mach Intell 41 2070-2083
[9]  
Singh SK(2020)Weighted Multi-Deep Ranking Supervised Hashing for Efficient Image Retrieval Int J Mach Learn Cybern 11 883-897
[10]  
Singh RK(2004)Distinctive Image Features from Scale-Invariant Keypoints Int J Comput Vis 60 91-110