A Traffic-Sign Detection Algorithm Based on Improved Sparse R-cnn

被引:35
作者
Cao, Jinghao [1 ]
Zhang, Junju [1 ]
Jin, Xin [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Jiangsu, Peoples R China
关键词
Feature extraction; Proposals; Object detection; Task analysis; Heuristic algorithms; Head; Detection algorithms; Deep learning; object detection; traffic-sign detection; self-attention mechanism; improved Sparse R-cnn;
D O I
10.1109/ACCESS.2021.3109606
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic traffic-sign detection is a hot topic in computer vision and one of the critical technologies of intelligent transportation. The Transformer structure has recently become a research hotspot due to its excellent performance. We hope to apply this structure to the design of traffic sign detection algorithms. Therefore, we make some improvements to Sparse R-cnn, a neural network model inspired by Transformer. Sparse R-cnn is a novel model, and its core idea is to replace hundreds of thousands of candidate anchors in the RPN network with a small set of proposal boxes. The experiments in our paper proved that the performance of the Sparse R-cnn model is better than other existing general object detection models. Based on the original Sparse R-cnn inspiration, an improved Sparse R-cnn model is proposed. First, a novel backbone for the task of traffic-sign detection is proposed. Multi-scale fusion structure is the essential method of improving the algorithm for small target detection, so improving the multi-scale capability of the backbone is a required method for designing traffic sign detection. So, we made further improvements to the existing backbone ResNest. We enhanced the multi-scale representation ability of the backbone by constructing hierarchical residual-like connections within each single radix block in the original ResNest. We call the improved backbone Res2Nest. The novel backbone proposed by us shows better performance without introducing excessive computational costs to the model. In addition, the attention mechanism is also an effective method to improve the detection of traffic signs, so we set up a branch network for recalibrating the channel feature response adaptively through the Global Average Pooling (GAP) operation and a fully connected layer. It can also be seen as the implementation of the cross-channel self-attention mechanism. After experiments by TT100K dataset, our method would attain a better accuracy and robustness.
引用
收藏
页码:122774 / 122788
页数:15
相关论文
共 52 条
[1]  
[Anonymous], 2014, Comput. Sci.
[2]   Traffic Sign Recognition Using Evolutionary Adaboost Detection and Forest-ECOC Classification [J].
Baro, Xavier ;
Escalera, Sergio ;
Vitria, Jordi ;
Pujol, Oriol ;
Radeva, Petia .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2009, 10 (01) :113-126
[3]   SURF: Speeded up robust features [J].
Bay, Herbert ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417
[4]   Traffic Sign Detection and Recognition Using Multi-Scale Fusion and Prime Sample Attention [J].
Cao, Jinghao ;
Zhang, Junju ;
Huang, Wei .
IEEE ACCESS, 2021, 9 :3579-3591
[5]   Prime Sample Attention in Object Detection [J].
Cao, Yuhang ;
Chen, Kai ;
Loy, Chen Change ;
Lin, Dahua .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11580-11588
[6]  
Chutian D., 2020, P INT C COMP DAT SCI, P19, DOI [10.1109/CDS49703.2020.00010, DOI 10.1109/CDS49703.2020.00010]
[7]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]  
Dubey A. R., 2020, DETECTION CLASSICATI, V766, DOI [10.1007/978-981-13-9683-0_6, DOI 10.1007/978-981-13-9683-0_6]
[10]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448