Traffic Sign Detection and Recognition Using Multi-Scale Fusion and Prime Sample Attention

被引:19
|
作者
Cao, Jinghao [1 ]
Zhang, Junju [1 ]
Huang, Wei [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
关键词
Traffic sign detection; multi-scale; prime sample attention; features extract;
D O I
10.1109/ACCESS.2020.3047414
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traffic sign detection, though one of the key technologies in intelligent transportation, still has bottleneck in accuracy due to the small size and diversity of traffic signs. To solve this problem, we proposed a two-stage CNN object detection algorithm based on multi-scale feature fusion and prime sample attention. We improved the original Faster R-cnn model in terms of feature extraction and sampling strategy. For feature extraction, to elevate the ability of neural networks to detect small objects, we adopted HRNet as the feature extractor. There are four stages in HRNet - a series of high resolution subnets as the starting point with repeated adding parallel high to low resolution subnets to form other stages. In the whole process, the information in the parallel multi-resolution sub-network is repeatedly exchanged to perform repeated multi-scale fusion. For sampling strategy, we adopted a simple and effective sampling and learning strategy called Prime Sample Attention (PISA), consisting of Importance-based Sample Reweighting (ISR) and Classification Aware Regression Loss (CARL). PISA proposed the concepts of IoU Hierarchical Partial Sorting (IoU-HLR) and Hierarchical Partial Score Sorting (Score-HLR), which sort the importance of positive samples and negative samples in mini-batch respectively. With the proposed method, the training process is focusing on prime samples rather than evenly treat all ones. The algorithm complexity of our method is lower than that of other state-of-the-art. After experiments by TT100K dataset, our method can attain a comparable or even better detection accuracy and robustness.
引用
收藏
页码:3579 / 3591
页数:13
相关论文
共 50 条
  • [21] Small Object Detection using Multi-scale Feature Fusion and Attention
    Liu, Baokai
    Du, Shiqiang
    Li, Jiacheng
    Wang, Jianhua
    Liu, Wenjie
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7246 - 7251
  • [22] Traffic Sign Detection in Complex Environment based on Multi-Scale Feature Enhancement and Group Attention
    Fu, Jinfei
    Zhou, Yinghua
    6TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE, ICIAI2022, 2022, : 137 - 142
  • [23] Localized Traffic Sign Detection with Multi-scale Deconvolution Networks
    Pei, Songwen
    Tang, Fuwu
    Ji, Yanfei
    Fan, Jing
    Ning, Zhong
    2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2018, : 355 - 360
  • [24] MAFormer: A transformer network with multi-scale attention fusion for visual recognition
    Sun, Huixin
    Wang, Yunhao
    Wang, Xiaodi
    Zhang, Bin
    Xin, Ying
    Zhang, Baochang
    Cao, Xianbin
    Ding, Errui
    Han, Shumin
    NEUROCOMPUTING, 2024, 595
  • [25] A Novel Network Traffic Anomaly Detection Based on Multi-scale Fusion
    Cheng, Guozhen
    Cheng, Dongnian
    Lei, He
    MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION, PTS 1 AND 2, 2011, 48-49 : 102 - 105
  • [26] Text Detection Algorithm Based on Multi-Scale Attention Feature Fusion
    She, Xiangyang
    Liu, Zhe
    Dong, Lihong
    Computer Engineering and Applications, 2024, 60 (01) : 198 - 206
  • [27] JAMFN: Joint Attention Multi-Scale Fusion Network for Depression Detection
    Zhou, Li
    Liu, Zhenyu
    Shangguan, Zixuan
    Yuan, Xiaoyan
    Li, Yutong
    Hu, Bin
    INTERSPEECH 2023, 2023, : 3417 - 3421
  • [28] Multi-Scale Feature Attention Fusion for Image Splicing Forgery Detection
    Liang, Enji
    Zhang, Kuiyuan
    Hua, Zhongyun
    Jia, Xiaohua
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (01)
  • [29] Multi-scale Information Fusion Combined with Residual Attention for Text Detection
    Zhao, Wenxiu
    Dongye, Changlei
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT II, 2024, 14448 : 506 - 518
  • [30] JAMFN: Joint Attention Multi-Scale Fusion Network for Depression Detection
    Zhou, Li
    Liu, Zhenyu
    Shangguan, Zixuan
    Yuan, Xiaoyan
    Li, Yutong
    Hu, Bin
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023, 2023-August : 3417 - 3421