EfficientLiteDet: a real-time pedestrian and vehicle detection algorithm

被引:0
作者
Chintakindi Balaram Murthy
Mohammad Farukh Hashmi
Avinash G. Keskar
机构
[1] National Institute of Technology,Department of Electronics and Communication Engineering
[2] National Institute of Technology,Department of Electronics and Communication Engineering
[3] Visvesvaraya National Institute of Technology,undefined
来源
Machine Vision and Applications | 2022年 / 33卷
关键词
Computer vision (CV); EfficientLiteDet; Light-weight; Pedestrian and vehicle detection; Tiny-YOLOv4;
D O I
暂无
中图分类号
学科分类号
摘要
Since safety plays a crucial role and the top priority, in both unmanned and driver-assistance driving systems, there is a need of efficient and accurate detection of captured objects by object detection algorithms in real-time. Directly applying existing models to tackle real-time pedestrian and vehicle detection tasks captured by high speed moving vehicle scenarios has two problems. First, the target scale varies drastically because the vehicle speed changes greatly. Second, captured images contain both tiny targets and high density targets, which brings in occlusion between targets. To solve the two issues, an efficient light weight real-time detection algorithm is proposed, which is referred to as EfficientLiteDet. Based on Tiny-YOLOv4, one more prediction head is introduced in the proposed model to detect multi-scale targets effectively. In order to detect tiny and occluded denser targets, we used Transformer Prediction Heads (TPH) instead of original anchor detection heads in our model. To explore the potential of self-attention mechanism in TPH, the proposed model integrates “convolutional block attention model” to locate crucial attention region on scenarios with denser targets. Further to improve the detection performance of our model, we applied various data augmentation strategies such as mosaic, mix-up, multi-scale, and random-horizontal-flip during the model training. Extensive experiments are conducted on five challenging pedestrian and vehicle datasets shows that the EfficientLiteDet model has better performance in real-time scenarios. On Pascal Voc-2007, Highway and Udacity datasets, the proposed model achieves mean average precision (mAP) 87.3%, 80.1% and 77.8%, respectively, which is quite better than Tiny-YOLOv4 state-of-the-art algorithm by + 2.4%, 1.8% and + 2.4%, respectively.
引用
收藏
相关论文
共 77 条
[11]  
Tian Y(2018)Scale-aware fast R-CNN for pedestrian detection IEEE Trans. Multimed. 20 985-947
[12]  
Yang G(2019)Small-scale pedestrian detection based on deep neural network IEEE Trans. Intell. Transp. Syst. 21 3046-23
[13]  
Wang Z(2020)Pedestrian detection algorithm for intelligent vehicles in complex scenarios Sensors 20 3646-16
[14]  
Wang H(2020)Ratio-and-scale-aware YOLO for pedestrian detection IEEE Trans. Image Process. 30 934-110236
[15]  
Li E(2019)An improved tiny-yolov3 pedestrian detection algorithm Optik 183 17-419
[16]  
Liang Z(2019)Vision-based vehicle detection and counting system using deep learning in highway scenes Eur. Transp. Res. Rev. 11 1-undefined
[17]  
He K(2020)Data-driven based Tiny-YOLOv3 method for front vehicle detection inducing SPP-Net IEEE Access 8 110227-undefined
[18]  
Zhang X(2020)Refning yolov4 for vehicle detection Int. J. Adv. Res. Eng. Technol. (IJARET) 11 409-undefined
[19]  
Ren S(2021)Weighted boxes fusion: Ensembling boxes from different object detection models Image Vis. Comput. 107 104117-undefined
[20]  
Sun J(undefined)undefined undefined undefined undefined-undefined