Vehicle and Pedestrian Detection Algorithm Based on Lightweight YOLOv3-Promote and Semi-Precision Acceleration

被引:58
作者
Xu, He [1 ,2 ]
Guo, Mingtao [1 ,2 ]
Nedjah, Nadia [3 ]
Zhang, Jindan [4 ]
Li, Peng [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China
[2] Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing 210023, Jiangsu, Peoples R China
[3] Univ Estado Rio De Janeiro, Dept Elect Engn & Telecommun, BR-20550013 Rio De Janeiro, Brazil
[4] Xianyang Vocat Tech Coll, Xianyang 712000, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution; Kernel; Computational modeling; Real-time systems; Object detection; Feature extraction; Telecommunications; Target detection; lightweight model; model prune; model deployment; semi-precision acceleration;
D O I
10.1109/TITS.2021.3137253
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Aiming at the shortcomings of the current YOLOv3 model, such as large size, slow response speed, and difficulty in deploying to real devices, this paper reconstructs the target detection model YOLOv3, and proposes a new lightweight target detection network YOLOv3-promote: Firstly, the G-Module combined with the Depth-Wise convolution is used to construct the backbone network of the entire model, and the attention mechanism is introduced and added to perform weighting operations on each channel to get more key features and remove redundant features, thereby strengthening the identification ability of feature network model's to distinguish target objects among background; Secondly, in order to delete some less important channels to achieve the effect of compressing the model size and improving the calculation speed, the size of the scaling factor gamma in the batch normalization layer is used; Finally, based on NVIDIA's TensorRT framework model conversion and half-precision acceleration were carried out, and the accelerated model was successfully deployed on the embedded platform Jetson Nano. The performed KITTI experimental results show that the inference speed of our proposed method is about 5 times that of the original model, the parameter volume is reduced to one tenth, the mAP is increased from 86.1% of the original model to 93.1%, and the FPS reaches 25.5fps, realizing the requirements of real-time detection with high precision.
引用
收藏
页码:19760 / 19771
页数:12
相关论文
共 40 条
[1]   Architecture of Fog-Enabled and Cloud-Enhanced Internet of Things Applications [J].
Ahuja, Sanjay P. ;
Wheeler, Nathan .
INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2020, 10 (01) :1-10
[2]   IoT transaction processing through cooperative concurrency control on fog-cloud computing environment [J].
Al-Qerem, Ahmad ;
Alauthman, Mohammad ;
Almomani, Ammar ;
Gupta, B. B. .
SOFT COMPUTING, 2020, 24 (08) :5695-5711
[3]  
Bochkovskiy A., 2020, PREPRINT
[4]   Vision meets robotics: The KITTI dataset [J].
Geiger, A. ;
Lenz, P. ;
Stiller, C. ;
Urtasun, R. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237
[5]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[6]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[7]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[8]   An overview of Internet of Things (IoT): Architectural aspects, challenges, and protocols [J].
Gupta, B. B. ;
Quamara, Megha .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (21)
[9]   GhostNet: More Features from Cheap Operations [J].
Han, Kai ;
Wang, Yunhe ;
Tian, Qi ;
Guo, Jianyuan ;
Xu, Chunjing ;
Xu, Chang .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586
[10]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830