Object-oriented convolutional features for fine-grained image retrieval in large surveillance datasets

被引:24
作者
Ahmad, Jamil [1 ]
Muhammad, Khan [1 ]
Bakshi, Sambit [2 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Digital Contents Res Inst, Seoul, South Korea
[2] Natl Inst Technol, Dept Comp Sci & Engn, Rourkela, India
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2018年 / 81卷
基金
新加坡国家研究基金会;
关键词
Image retrieval; Object-oriented features; Convolutional neural network; Fine-grained retrieval; REPRESENTATION; QUANTIZATION;
D O I
10.1016/j.future.2017.11.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large scale visual surveillance generates huge volumes of data at a rapid pace, giving rise to massive image repositories. Efficient and reliable access to relevant data in these ever growing databases is a highly challenging task due to the complex nature of surveillance objects. Furthermore, inter-class visual similarity between vehicles requires extraction of fine-grained and highly discriminative features. In recent years, features from deep convolutional neural networks (CNN) have exhibited state-of-the-art performance in image retrieval. However, these features have been used without regard to their sensitivity to objects of a particular class. In this paper, we propose an object-oriented feature selection mechanism for deep convolutional features from a pre-trained CNN. Convolutional feature maps from a deep layer are selected based on the analysis of their responses to surveillance objects. The selected features serve to represent semantic features of surveillance objects and their parts with minimal influence of the background, effectively eliminating the need for background removal procedure prior to features extraction. Layer-wise mean activations from the selected features maps form the discriminative descriptor for each object. These object-oriented convolutional features (OOCF) are then projected onto low-dimensional hamming space using locality sensitive hashing approaches. The resulting compact binary hash codes allow efficient retrieval within large scale datasets. Results on five challenging datasets reveal that OOCF achieves better precision and recall than the full feature set for objects with varying backgrounds. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:314 / 330
页数:17
相关论文
共 58 条
[1]   Embedded deep vision in smart cameras for multi-view objects representation and retrieval [J].
Ahmad, Jamil ;
Mehmood, Irfan ;
Rho, Seungmin ;
Chilamkurti, Naveen ;
Baik, Sung Wook .
COMPUTERS & ELECTRICAL ENGINEERING, 2017, 61 :297-311
[2]   SiNC: Saliency-injected neural codes for representation and efficient retrieval of medical radiographs [J].
Ahmad, Jamil ;
Sajjad, Muhammad ;
Mehmood, Irfan ;
Baik, Sung Wook .
PLOS ONE, 2017, 12 (08)
[3]   Efficient object-based surveillance image search using spatial pooling of convolutional features [J].
Ahmad, Jamil ;
Mehmood, Irfan ;
Baik, Sung Wook .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 45 :62-76
[4]   Saliency-weighted graphs for efficient visual content description and their applications in real-time image retrieval systems [J].
Ahmad, Jamil ;
Sajjad, Muhammad ;
Mehmood, Irfan ;
Rho, Seungmin ;
Baik, Sung Wook .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2017, 13 (03) :431-447
[5]   Multi-scale local structure patterns histogram for describing visual contents in social image retrieval systems [J].
Ahmad, Jamil ;
Sajjad, Muhammad ;
Rho, Seungmin ;
Baik, Sung Wook .
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (20) :12669-12692
[6]  
Alharthi N., 2017, Scientific Modelling and Research, V2, P9, DOI 10.20448/808.2.1.9.18
[7]  
[Anonymous], 2014, PR MACH LEARN RES
[8]  
[Anonymous], 2007, P INT WORKSHOP WORKS
[9]  
[Anonymous], 2011, INT C MULT, DOI DOI 10.1145/2072298.2072354
[10]  
[Anonymous], LECT NOTES INFORM TH