A lightweight multiple object detection algorithm for roadside perspective based on improved YOLOv4

被引：0

作者：

Jin, Li-Sheng ^{[1
,2
]}

Zhang, Shun-Ran ^{[2
]}

Guo, Bai-Cang ^{[1
,2
]}

Wang, Huan-Huan ^{[1
]}

Han, Zhuo-Tong ^{[1
]}

Liu, Xing-Chen ^{[1
]}

机构：

[1] College of Vehicles and Energy, Yanshan University, Qinhuangdao

[2] College of Transportation, Jilin University, Changchun

来源：

Kongzhi yu Juece/Control and Decision | 2024年 / 39卷 / 09期

关键词：

autonomous vehicle; deep learning; environmental perception; multiple object detection; roadside perspective; YOLOv4;

D O I：

10.13195/j.kzyjc.2023.0545

中图分类号：

学科分类号：

摘要：

Facing the detection requirements of multi category and variable scale vehicles in the road traffic scene, how to effectively construct structured data with low computational power to achieve beyond sight distance perception, and overcome the limitation of single vehicle sight distance is one of the important problems to be solved in the field of autonomous vehicle environment perception technology. In this paper, we propose a lightweight roadside perspective based multi object detection algorithm that balances accuracy and real-time performance. First, a reverse residual network structure embedded in the channel domain attention mechanism is used as the backbone of the network, replacing the single stage detection algorithm feature extraction network with a deep separable convolution to reduce the number of feature extraction network parameters. Second, spatial pyramid pooling (SPP) is used to process the output feature map of deep networks, then we select maps of different depth feature in the lightweight backbone network to output, and use the path aggregation network (PANet) to fuse deep semantic information and shallow superficial information to form the neck of the detection model. Finally, at appropriate network depth, three different network outputs of feature map sizes are set at the head of the detection model to regress the target information of different sizes of targets in the same image. A lightweight detection model M3-YOLOv4 is established. The experimental results show that the mAP of M3-YOLOv4 on RS-UA dataset is 0.906, which performs 1.1 % decrease compared to the YOLOv4. The parameter quantity of the M3-YOLOv4 model is reduced to 10 % of the YOLOv4, and the forward inference speed of the model on the same platform also shows significant advantages. © 2024 Northeast University. All rights reserved.

引用

页码：2885 / 2893

页数：8

共 29 条

[1] Li K Q, Dai Y F, Li S B, Et al., State-of-the-art and technical trends of intelligent and connected vehicles, Journal of Automotive Safety and Energy, 8, 1, pp. 1-14, (2017)
[2] Jin S S, Long W, Hu L X, Et al., Research progress of detection and multi-object tracking algorithm in intelligent traffic monitoring system, Control and Decision, 38, 4, pp. 890-901, (2023)
[3] Taghvaeeyan S, Rajamani R., Portable roadside sensors for vehicle counting, classification, and speed measurement, IEEE Transactions on Intelligent Transportation Systems, 15, 1, pp. 73-83, (2014)
[4] Du Y C, Shi Y P, Du Z Y, Et al., An online monitoring framework for data quality of roadside perception units in intelligent and connected environment, China Journal of Highway and Transport, 25, 3, pp. 273-285, (2022)
[5] Li K Q, Chang X Y, Li J W, Et al., Cloud control system for intelligent and connected vehicles and its application, Automotive Engineering, 42, 12, pp. 1595-1605, (2020)
[6] Yin H P, Chen B, Chai Y, Et al., Vision-based object detection and tracking: A review, Acta Automatica Sinica, 42, 10, pp. 1466-1489, (2016)
[7] Sun Z H, Bebis G, Miller R., Monocular precrash vehicle detection: Features and classifiers, IEEE Transactions on Image Processing, 15, 7, pp. 2019-2034, (2006)
[8] Sun Z H, Bebis G, Miller R., On-road vehicle detection: A review, IEEE Transactions on Pattern Analysis and Machine Intelligence, 28, 5, pp. 694-711, (2006)
[9] Dalal N, Triggs B., Histograms of oriented gradients for human detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 886-893, (2005)
[10] Platt J C., A fast algorithm for training support vector machines, Journal of Information Technology, 2, 5, pp. 1-28, (1998)

← 1 2 3 →