Enhanced YOLO Network for Improving the Efficiency of Traffic Sign Detection

被引：7

作者：

Cui, Yang ^{[1
]}

Guo, Dong ^{[1
]}

Yuan, Hao ^{[1
]}

Gu, Hengzhi ^{[1
]}

Tang, Hongbo ^{[1
]}

机构：

[1] Changan Univ, Sch Automobile, Xian 710064, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 02期

关键词：

traffic sign detection; feature fusion; object detection;

D O I：

10.3390/app14020555

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

One important task for autonomous driving is the precise detection and recognition of road traffic signs. This research focuses on a comprehensive set of 72 distinct traffic signs that are prevalent on urban roads in China, with the goal of developing an enhanced You Only Look Once (YOLO) network model tailored for this specific task. The modifications include the omission of the terminal convolution module and Conv3 (C3) module within the backbone network. Additionally, the 32-fold downsampling is replaced with a 16-fold downsampling, and a feature fusion module with dimensions of 152 x 152 is introduced in the feature layer. To capture a more encompassing context, a novel hybrid space pyramid pooling module, referred to as Hybrid Spatial Pyramid Pooling Fast (H-SPPF), is introduced. Furthermore, a channel attention mechanism is integrated into the framework, combined with three other improved methodologies. Upon evaluation, the enhanced algorithm demonstrates impressive results, achieving a precision rate of 91.72%, a recall rate of 91.77%, and a mean average precision (mAP) of 93.88% at an intersection over union (IoU) threshold of 0.5. Additionally, the method also achieves an mAP of 75.81% for a variety of IoU criteria between 0.5 and 0.95. These achievements are validated on an augmented dataset established for this study.

引用

页数：15

共 31 条

[1] Ahmad T., 2021, P 2021 IEEE 6 INT C, DOI [10.1109/icccbda51879.2021.9442501, DOI 10.1109/ICCCBDA51879.2021.9442501]
[2] Cui LS, 2020, Arxiv, DOI arXiv:1805.07009
[3] Context-Aware Block Net for Small Object Detection
Cui, Lisha
Lv, Pei
Jiang, Xiaoheng
Gao, Zhimin
Zhou, Bing
Zhang, Luming
Shao, Ling
Xu, Mingliang
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (04) : 2300 - 2313
[4] Res2Net: A New Multi-Scale Backbone Architecture
Gao, Shang-Hua
Cheng, Ming-Ming
Zhao, Kai
Zhang, Xin-Yu
Yang, Ming-Hsuan
Torr, Philip
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 652 - 662
[5] Traffic Sign Recognition and Classification Using YOLOv2, Faster RCNN and SSD
Garg, Priya
Chowdhury, Debapriyo Roy
More, Vidya N.
[J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
[6] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[7] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[8] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[9] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
[10] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]

← 1 2 3 4 →