Embedded deep vision in smart cameras for multi-view objects representation and retrieval

被引:10
作者
Ahmad, Jamil [1 ]
Mehmood, Irfan [1 ]
Rho, Seungmin [2 ]
Chilamkurti, Naveen [3 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Coll Software & Convergence Technol, Seoul, South Korea
[2] Sungkyul Univ, Dept Software, Anyang, South Korea
[3] La Trobe Univ, Comp Sci & IT, Melbourne, Vic, Australia
基金
新加坡国家研究基金会;
关键词
Embedded processing; Convolutional neural network; Transfer learning; Image retrieval; POSE ESTIMATION; SYSTEM;
D O I
10.1016/j.compeleceng.2017.05.033
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Active large scale surveillance of indoor and outdoor environments with multiple cameras is becoming an undeniable necessity in today's connected world. Enhanced computational and storage capabilities in smart cameras establish them as promising platforms for implementing intelligent and autonomous surveillance networks. However, poor resolution, limited number of samples per object, and pose variation in multi-view surveillance streams, make the task of efficient image representation highly challenging. To address these issues, we propose an efficient and powerful convolutional neural network (CNN) based framework for features extraction using embedded processing on smart cameras. Efficient, high performance, pre-trained CNNs are separately fine-tuned on persons and vehicles to obtain discriminative, low dimensional features from segmented surveillance objects. Furthermore, multi-view queries of surveillance objects are used to improve retrieval performance. Experiments reveal better efficiency and retrieval performance in different surveillance datasets. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:297 / 311
页数:15
相关论文
共 25 条
[1]   Efficient object-based surveillance image search using spatial pooling of convolutional features [J].
Ahmad, Jamil ;
Mehmood, Irfan ;
Baik, Sung Wook .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 45 :62-76
[2]   Multi-scale local structure patterns histogram for describing visual contents in social image retrieval systems [J].
Ahmad, Jamil ;
Sajjad, Muhammad ;
Rho, Seungmin ;
Baik, Sung Wook .
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (20) :12669-12692
[3]  
[Anonymous], C AUT ROB SYST
[4]  
[Anonymous], J REAL TIME IMAGE PR
[5]  
[Anonymous], J VIS COMMUN IMAGE R
[6]  
[Anonymous], 2014, P INT C INT MULT COM
[7]  
[Anonymous], 2014, Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM), 2014 International Conference on
[8]  
[Anonymous], 2010, Asian Conference on Computer Vision
[9]  
Baltieri D., 2011, P 2011 JOINT ACM WOR, P59
[10]   FPGA-Based Multimodal Embedded Sensor System Integrating Low- and Mid-Level Vision [J].
Botella, Guillermo ;
Antonio Martin H, Jose ;
Santos, Matilde ;
Meyer-Baese, Uwe .
SENSORS, 2011, 11 (08) :8164-8179