VSSA-NET: Vertical Spatial Sequence Attention Network for Traffic Sign Detection

被引:145
作者
Yuan, Yuan [1 ,2 ]
Xiong, Zhitong [1 ,2 ]
Wang, Qi [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Ctr Opt IMagery Anal & Learning Optimal, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Traffic sign detection; context modeling; small object; sequence attention model;
D O I
10.1109/TIP.2019.2896952
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although traffic sign detection has been studied for years and great progress has been made with the rise of deep learning technique, there are still many problems remaining to be addressed. For complicated real-world traffic scenes, there are two main challenges. First, traffic signs are usually small-sized objects, which makes them more difficult to detect than large ones; second, it is hard to distinguish false targets which resemble real traffic signs in complex street scenes without context information. To handle these problems, we propose a novel end-to-end deep learning method for traffic sign detection in complex environments. Our contributions are as follows: 1) we propose a multi-resolution feature fusion network architecture which exploits densely connected deconvolution layers with skip connections, and can learn more effective features for a small-size object and 2) we frame the traffic sign detection as a spatial sequence classification and regression task, and propose a vertical spatial sequence attention module to gain more context information for better detection performance. To comprehensively evaluate the proposed method, we experiment on several traffic sign datasets as well as the general object detection dataset, and the results have shown the effectiveness of our proposed method.
引用
收藏
页码:3423 / 3434
页数:12
相关论文
共 41 条
[1]  
[Anonymous], 2005, PROC CVPR IEEE
[2]   Real-time speed sign detection using the radial symmetry detector [J].
Barnes, Nick ;
Zelinsky, Alexander ;
Fletcher, Luke S. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2008, 9 (02) :322-332
[3]  
Bell S., 2015, Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks
[4]   A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection [J].
Cai, Zhaowei ;
Fan, Quanfu ;
Feris, Rogerio S. ;
Vasconcelos, Nuno .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :354-370
[5]   Feature-Fused SSD: Fast Detection for Small Objects [J].
Cao, Guimei ;
Xie, Xuemei ;
Yang, Wenzhe ;
Liao, Quan ;
Shi, Guangming ;
Wu, Jinjian .
NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
[6]   Boosted road sign detection and recognition [J].
Chen, Sin-Yu ;
Hsieh, Jun-wei .
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, :3823-3826
[7]   Accurate and Efficient Traffic Sign Detection Using Discriminative AdaBoost and Support Vector Regression [J].
Chen, Tao ;
Lu, Shijian .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (06) :4006-4015
[8]  
Chen X., 2017, SPATIAL MEMORY CONTE
[9]  
Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036
[10]   Fast Feature Pyramids for Object Detection [J].
Dollar, Piotr ;
Appel, Ron ;
Belongie, Serge ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1532-1545