A Cascaded R-CNN With Multiscale Attention and Imbalanced Samples for Traffic Sign Detection

被引:244
作者
Zhang, Jianming [1 ]
Xie, Zhipeng [1 ,2 ]
Sun, Juan [1 ,2 ]
Zou, Xin [1 ,2 ]
Wang, Jin [1 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
[2] Changsha Univ Sci & Technol, Hunan Prov Key Lab Intelligent Proc Big Data Tran, Changsha 410114, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国国家自然科学基金;
关键词
Traffic sign detection; convolutional neural network; attention; object detection; Multiscale; MODEL; NETWORKS;
D O I
10.1109/ACCESS.2020.2972338
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the deep learning is applied to the field of traffic sign detection methods which achieves excellent performance. However, there are two main challenges in traffic sign detection to be solve urgently. For one thing, some traffic signs of small size are more difficult to detect than those of large size so that the small traffic signs are undetected. For another, some false signs are always detected because of interferences caused by the illumination variation, bad weather and some signs similar to the true traffic signs. Therefore, to solve the undetection and false detection, we first propose a cascaded R-CNN to obtain the multiscale features in pyramids. Each layer of the cascaded network except the first layer fuses the output bounding box of the previous one layer for joint training. This method contributes to the traffic sign detection. Then, we propose a multiscale attention method to obtain the weighted multiscale features by dot-product and softmax, which is summed to fine the features to highlight the traffic sign features and improve the accuracy of the traffic sign detection. Finally, we increase the number of difficult negative samples for dataset balance and data augmentation in the training to relieve the interference by complex environment and similar false traffic signs. The data augment method expands the German traffic sign training dataset by simulation of complex environment changes. We conduct numerous experiments to verify the effectiveness of our proposed algorithm. The accuracy and recall rate of our method are 98.7; and 90.5; in GTSDB, 99.7; and 83.62; in CCTSDB and 98.9; and 85.6; in Lisa dataset respectively.
引用
收藏
页码:29742 / 29754
页数:13
相关论文
共 49 条
[1]  
[Anonymous], 2017, P IEEE INT C COMPUTE
[2]   Evaluation of deep neural networks for traffic sign detection systems [J].
Arcos-Garcia, Alvaro ;
Alvarez-Garcia, Juan A. ;
Soria-Morillo, Luis M. .
NEUROCOMPUTING, 2018, 316 :332-344
[3]  
Aziz MJ, 2018, INT BHURBAN C APPL S, P579, DOI 10.1109/IBCAST.2018.8312283
[4]  
Cao Y., 2019, P IEEE CVF INT C COM, DOI [10.1109/ICCVW.2019.00246, DOI 10.1109/ICCVW.2019.00246]
[5]   A novel online incremental and decremental learning algorithm based on variable support vector machine [J].
Chen, Yuantao ;
Xiong, Jie ;
Xu, Weihong ;
Zuo, Jingwen .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3) :S7435-S7445
[6]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]  
Hendrycks Dan, 2018, Benchmarking neural network robustness to common corruptions and perturbations
[9]  
Houben S, 2013, IEEE INT C INTELL TR, P7, DOI 10.1109/ITSC.2013.6728595
[10]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]