Traffic Sign Detection and Recognition Using Multi-Scale Fusion and Prime Sample Attention

被引:19
|
作者
Cao, Jinghao [1 ]
Zhang, Junju [1 ]
Huang, Wei [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
关键词
Traffic sign detection; multi-scale; prime sample attention; features extract;
D O I
10.1109/ACCESS.2020.3047414
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traffic sign detection, though one of the key technologies in intelligent transportation, still has bottleneck in accuracy due to the small size and diversity of traffic signs. To solve this problem, we proposed a two-stage CNN object detection algorithm based on multi-scale feature fusion and prime sample attention. We improved the original Faster R-cnn model in terms of feature extraction and sampling strategy. For feature extraction, to elevate the ability of neural networks to detect small objects, we adopted HRNet as the feature extractor. There are four stages in HRNet - a series of high resolution subnets as the starting point with repeated adding parallel high to low resolution subnets to form other stages. In the whole process, the information in the parallel multi-resolution sub-network is repeatedly exchanged to perform repeated multi-scale fusion. For sampling strategy, we adopted a simple and effective sampling and learning strategy called Prime Sample Attention (PISA), consisting of Importance-based Sample Reweighting (ISR) and Classification Aware Regression Loss (CARL). PISA proposed the concepts of IoU Hierarchical Partial Sorting (IoU-HLR) and Hierarchical Partial Score Sorting (Score-HLR), which sort the importance of positive samples and negative samples in mini-batch respectively. With the proposed method, the training process is focusing on prime samples rather than evenly treat all ones. The algorithm complexity of our method is lower than that of other state-of-the-art. After experiments by TT100K dataset, our method can attain a comparable or even better detection accuracy and robustness.
引用
收藏
页码:3579 / 3591
页数:13
相关论文
共 50 条
  • [31] Adaptive feature fusion with attention mechanism for multi-scale target detection
    Moran Ju
    Jiangning Luo
    Zhongbo Wang
    Haibo Luo
    Neural Computing and Applications, 2021, 33 : 2769 - 2781
  • [32] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [33] Hierarchical Feature Fusion With Text Attention For Multi-scale Text Detection
    Liu, Chao
    Zou, Yuexian
    Guan, Wenjie
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [34] Adaptive feature fusion with attention mechanism for multi-scale target detection
    Ju, Moran
    Luo, Jiangning
    Wang, Zhongbo
    Luo, Haibo
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (07): : 2769 - 2781
  • [35] EMDFNet: Efficient Multi-scale and Diverse Feature Network for Traffic Sign Detection
    Li, Pengyu
    Liu, Chenhe
    Li, Tengfei
    Wang, Xinyu
    Zhang, Shihui
    Yu, Dongyang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 120 - 136
  • [36] Lightweight Blueberry Fruit Recognition Based on Multi-Scale and Attention Fusion NCBAM
    Yang, Wenji
    Ma, Xinxin
    Hu, Wenchao
    Tang, Pengjie
    AGRONOMY-BASEL, 2022, 12 (10):
  • [37] Research on traffic sign recognition method based on multi-scale convolution neural network
    Wei T.
    Chen X.
    Yin Y.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2021, 39 (04): : 891 - 900
  • [38] Fast Traffic Sign Recognition Algorithm Based on Multi-scale Convolutional Neural Network
    Zhao, Cai
    Zheng, Wen
    2020 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2020), 2020, : 125 - 130
  • [39] Isolated Sign Language Recognition with Multi-scale Features using LSTM
    Mercanoglu Sincan, Ozge
    Tur, Anil Osman
    Yalim Keles, Hacer
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [40] Multi-scale local-temporal similarity fusion for continuous sign language recognition
    Xie, Pan
    Cui, Zhi
    Du, Yao
    Zhao, Mengyi
    Cui, Jianwei
    Wang, Bin
    Hu, Xiaohui
    PATTERN RECOGNITION, 2023, 136