A lightweight vehicles detection network model based on YOLOv5

被引:238
作者
Dong, Xudong [1 ]
Yan, Shuai [2 ]
Duan, Chaoqun [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, 99 Shangda Rd, Shanghai 200444, Peoples R China
[2] ShanghaiTech Univ, Sch Life Sci & Technol, Shanghai 201210, Peoples R China
基金
中国国家自然科学基金;
关键词
Vehicle detection; Deep learning; Attention mechanism; GhostBottleneck; Loss function;
D O I
10.1016/j.engappai.2022.104914
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vehicle detection technology is of great significance for realizing automatic monitoring and AI-assisted driving systems. The state-of-the-art object detection method, namely, a class of YOLOv5, has often been used to detect vehicles. However, it suffers some challenges, such as a high computational load and undesirable detection rate. To address these issues, an improved lightweight YOLOv5 method is proposed for vehicle detection in this paper. In the presented method, C3Ghost and Ghost modules are introduced into the YOLOv5 neck network to reduce the floating-point operations (FLOPs) in the feature channel fusion process and enhance the feature expression performance. A convolutional block attention module (CBAM) is introduced to the YOLOv5 backbone network to select the information critical to the vehicle detection task and suppress uncritical information, thus improving the detection accuracy of the algorithm. Furthermore, CIoU_Loss is considered the bounding box regression loss function to accelerate the bounding box regression rate and improve the localization accuracy of the algorithm. To verify the performance of the proposed approach, we tested our model via two case studies, i.e., the PASCAL VOC dataset and MS COCO dataset. The results show that the detection precision of the proposed model increased 3.2%, the FLOPs decreased 15.24%, and the number of model parameters decreased 19.37% compared with those of the existing YOLOv5. Through case studies and comparisons, the effectiveness and superiority of the presented approach are demonstrated.
引用
收藏
页数:14
相关论文
共 54 条
[1]   Lightweight PVIDNet: A Priority Vehicles Detection Network Model Based on Deep Learning for Intelligent Traffic Lights [J].
Barbosa, Rodrigo Carvalho ;
Ayub, Muhammad Shoaib ;
Rosa, Renata Lopes ;
Rodriguez, Demostenes Zegarra ;
Wuttisittikulkij, Lunchakorn .
SENSORS, 2020, 20 (21)
[2]  
Bochkovskiy A., 2020, PREPRINT
[3]   Daytime Preceding Vehicle Brake Light Detection Using Monocular Vision [J].
Chen, Hua-Tsung ;
Wu, Yi-Chien ;
Hsu, Chun-Chieh .
IEEE SENSORS JOURNAL, 2016, 16 (01) :120-131
[4]   A self-constructing cascade classifier with AdaBoost and SVM for pedestrian detection [J].
Cheng, Wen-Chang ;
Jhan, Ding-Mao .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (03) :1016-1028
[5]  
Cui LS, 2020, Arxiv, DOI arXiv:1805.07009
[6]  
Fu CY, 2017, PREPRINT
[7]   Road pollution estimation from vehicle tracking in surveillance videos by deep convolutional neural networks [J].
Garcia-Gonzalez, Jorge ;
Molina-Cabello, Miguel A. ;
Luque-Baena, Rafael M. ;
Ortiz-de-Lazcano-Lobato, Juan M. ;
Lopez-Rubio, Ezequiel .
APPLIED SOFT COMPUTING, 2021, 113
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[10]   GhostNet: More Features from Cheap Operations [J].
Han, Kai ;
Wang, Yunhe ;
Tian, Qi ;
Guo, Jianyuan ;
Xu, Chunjing ;
Xu, Chang .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586