EfficientLiteDet: a real-time pedestrian and vehicle detection algorithm

被引:0
作者
Chintakindi Balaram Murthy
Mohammad Farukh Hashmi
Avinash G. Keskar
机构
[1] National Institute of Technology,Department of Electronics and Communication Engineering
[2] National Institute of Technology,Department of Electronics and Communication Engineering
[3] Visvesvaraya National Institute of Technology,undefined
来源
Machine Vision and Applications | 2022年 / 33卷
关键词
Computer vision (CV); EfficientLiteDet; Light-weight; Pedestrian and vehicle detection; Tiny-YOLOv4;
D O I
暂无
中图分类号
学科分类号
摘要
Since safety plays a crucial role and the top priority, in both unmanned and driver-assistance driving systems, there is a need of efficient and accurate detection of captured objects by object detection algorithms in real-time. Directly applying existing models to tackle real-time pedestrian and vehicle detection tasks captured by high speed moving vehicle scenarios has two problems. First, the target scale varies drastically because the vehicle speed changes greatly. Second, captured images contain both tiny targets and high density targets, which brings in occlusion between targets. To solve the two issues, an efficient light weight real-time detection algorithm is proposed, which is referred to as EfficientLiteDet. Based on Tiny-YOLOv4, one more prediction head is introduced in the proposed model to detect multi-scale targets effectively. In order to detect tiny and occluded denser targets, we used Transformer Prediction Heads (TPH) instead of original anchor detection heads in our model. To explore the potential of self-attention mechanism in TPH, the proposed model integrates “convolutional block attention model” to locate crucial attention region on scenarios with denser targets. Further to improve the detection performance of our model, we applied various data augmentation strategies such as mosaic, mix-up, multi-scale, and random-horizontal-flip during the model training. Extensive experiments are conducted on five challenging pedestrian and vehicle datasets shows that the EfficientLiteDet model has better performance in real-time scenarios. On Pascal Voc-2007, Highway and Udacity datasets, the proposed model achieves mean average precision (mAP) 87.3%, 80.1% and 77.8%, respectively, which is quite better than Tiny-YOLOv4 state-of-the-art algorithm by + 2.4%, 1.8% and + 2.4%, respectively.
引用
收藏
相关论文
共 77 条
[1]  
Viola P(2005)Detecting pedestrians using patterns of motion and appearance Int. J. Comput. Vision 63 153-161
[2]  
Jones MJ(2020)Investigations of object detection in images/videos using various deep learning techniques and embedded platforms-A comprehensive review Appl. Sci. 10 3280-1760
[3]  
Snow D(2015)Filtered channel features for pedestrian detection Proc. CVPR 1 1751-426
[4]  
Murthy CB(2019)Apple detection during different growth stages in orchards using the improved YOLO-V3 model Comput. Electron. Agric. 157 417-1916
[5]  
Hashmi MF(2015)Spatial pyramid pooling in deep convolutional networks for visual recognition IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 37 1904-184
[6]  
Bokde ND(2021)Optimized MobileNet+ SSD: a real-time pedestrian detection on a low-end edge device Int. J. Multimed. Inf. Retr. 3 171-10609
[7]  
Geem ZW(2020)DenseLightNet: a light-weight vehicle detection network for autonomous driving IEEE Trans. Ind. Electron. 67 10600-37
[8]  
Zhang S(2021)Little-YOLO-SPP: A delicate real-time vehicle detection algorithm Optik 225 165818-1887
[9]  
Benenson R(2019)Efficient and robust pedestrian detection using deep learning for human-aware navigation Robot. Auton. Syst. 113 23-996
[10]  
Schiele B(2018)Joint on IEEE Trans. Pattern Anal. Mach. Intell. 40 1874-3055