Optimized MobileNet plus SSD: a real-time pedestrian detection on a low-end edge device

被引:18
作者
Murthy, Chintakindi Balaram [1 ]
Hashmi, Mohammad Farukh [1 ]
Keskar, Avinash G. [2 ]
机构
[1] Natl Inst Technol, Dept Elect & Commun Engn, Warangal, Andhra Pradesh, India
[2] Visvesvaraya Natl Inst Technol, Dept Elect & Commun Engn, Nagpur, Maharashtra, India
关键词
Pedestrian detection; Computer vision (CV); Optimized MobileNet plus SSD Network; Caltech pedestrian dataset; Jetson Nano board;
D O I
10.1007/s13735-021-00212-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most fundamental challenges in computer vision is pedestrian detection since it involves both the classification and localization of pedestrians at a location. To achieve real-time pedestrian detection without having any loss in detection accuracy, an Optimized MobileNet + SSD network is proposed. There are four important components in pedestrian detection: feature extraction, deformation, occlusion handling and classification. The existing methods design these components either independently or in a sequential format, and the interaction among these components has not been explored yet. The proposed network lets the components work in coordination in such a manner that their strengths are improved and the number of parameters is decreased compared to recent detection architectures. We propose a concatenation feature fusion module for adding contextual information in the Optimized MobileNet + SSD network to improve the detection accuracy of pedestrians. The proposed model achieved 80.4% average precision with a detection speed of 34.01 frames per second (fps) when tested on the Jetson Nano board, which is much faster compared to standard video speed (30 fps). Experimental results have shown that the proposed network has a better detection effect during low light conditions and for darker pictures. Therefore, the proposed network is well suited for low-end edge devices.
引用
收藏
页码:171 / 184
页数:14
相关论文
共 32 条
[21]   Jointly Learning Deep Features, Deformable Parts, Occlusion and Classification for Pedestrian Detection [J].
Ouyang, Wanli ;
Zhou, Hui ;
Li, Hongsheng ;
Li, Quanquan ;
Yan, Junjie ;
Wang, Xiaogang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (08) :1874-1887
[22]  
Redmon J., 2015, YOU ONLY LOOK ONCE U, DOI [10.1109/CVPR.2016.91, DOI 10.1109/CVPR.2016.91]
[23]  
Redmon J, 2018, Arxiv, DOI arXiv:1804.02767
[24]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[25]  
Sadeghi MA, 2014, LECT NOTES COMPUT SC, V8689, P65, DOI 10.1007/978-3-319-10590-1_5
[26]  
Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
[27]  
Szarvas M, 2005, 2005 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, P224
[28]  
Viola P, 2003, NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, P734
[29]   Single Shot Multibox Detector With Kalman Filter for Online Pedestrian Detection in Video [J].
Yang, Fan ;
Chen, Houjin ;
Li, Jupeng ;
Li, Feng ;
Wang, Lei ;
Yan, Xiaomiao .
IEEE ACCESS, 2019, 7 :15478-15488
[30]   Occluded Pedestrian Detection Through Guided Attention in CNNs [J].
Zhang, Shanshan ;
Yang, Jian ;
Schiele, Bernt .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6995-7003