Enhanced YOLO Network for Improving the Efficiency of Traffic Sign Detection

被引:7
作者
Cui, Yang [1 ]
Guo, Dong [1 ]
Yuan, Hao [1 ]
Gu, Hengzhi [1 ]
Tang, Hongbo [1 ]
机构
[1] Changan Univ, Sch Automobile, Xian 710064, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 02期
关键词
traffic sign detection; feature fusion; object detection;
D O I
10.3390/app14020555
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
One important task for autonomous driving is the precise detection and recognition of road traffic signs. This research focuses on a comprehensive set of 72 distinct traffic signs that are prevalent on urban roads in China, with the goal of developing an enhanced You Only Look Once (YOLO) network model tailored for this specific task. The modifications include the omission of the terminal convolution module and Conv3 (C3) module within the backbone network. Additionally, the 32-fold downsampling is replaced with a 16-fold downsampling, and a feature fusion module with dimensions of 152 x 152 is introduced in the feature layer. To capture a more encompassing context, a novel hybrid space pyramid pooling module, referred to as Hybrid Spatial Pyramid Pooling Fast (H-SPPF), is introduced. Furthermore, a channel attention mechanism is integrated into the framework, combined with three other improved methodologies. Upon evaluation, the enhanced algorithm demonstrates impressive results, achieving a precision rate of 91.72%, a recall rate of 91.77%, and a mean average precision (mAP) of 93.88% at an intersection over union (IoU) threshold of 0.5. Additionally, the method also achieves an mAP of 75.81% for a variety of IoU criteria between 0.5 and 0.95. These achievements are validated on an augmented dataset established for this study.
引用
收藏
页数:15
相关论文
共 31 条
  • [1] Ahmad T., 2021, P 2021 IEEE 6 INT C, DOI [10.1109/icccbda51879.2021.9442501, DOI 10.1109/ICCCBDA51879.2021.9442501]
  • [2] Cui LS, 2020, Arxiv, DOI arXiv:1805.07009
  • [3] Context-Aware Block Net for Small Object Detection
    Cui, Lisha
    Lv, Pei
    Jiang, Xiaoheng
    Gao, Zhimin
    Zhou, Bing
    Zhang, Luming
    Shao, Ling
    Xu, Mingliang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (04) : 2300 - 2313
  • [4] Res2Net: A New Multi-Scale Backbone Architecture
    Gao, Shang-Hua
    Cheng, Ming-Ming
    Zhao, Kai
    Zhang, Xin-Yu
    Yang, Ming-Hsuan
    Torr, Philip
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 652 - 662
  • [5] Traffic Sign Recognition and Classification Using YOLOv2, Faster RCNN and SSD
    Garg, Priya
    Chowdhury, Debapriyo Roy
    More, Vidya N.
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [6] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [7] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [8] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [9] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
  • [10] Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]