Fast Detection of Multiple Objects in Traffic Scenes With a Common Detection Framework

被引:76
作者
Hu, Qichang [1 ,2 ]
Paisitkriangkrai, Sakrapee [1 ]
Shen, Chunhua [1 ,3 ]
van den Hengel, Anton [1 ,3 ]
Porikli, Fatih [2 ,3 ]
机构
[1] Univ Adelaide, Adelaide, SA 5005, Australia
[2] NICTA, Alexandria, NSW 1435, Australia
[3] Australian Ctr Robot Vis, Brisbane, Qld 4001, Australia
关键词
Traffic scene perception; traffic sign detection; car detection; cyclist detection; object subcategorization; ROAD-SIGN DETECTION; VISION; RECOGNITION; COLOR;
D O I
10.1109/TITS.2015.2496795
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traffic scene perception (TSP) aims to extract accurate real-time on-road environment information, which involves three phases: detection of objects of interest, recognition of detected objects, and tracking of objects in motion. Since recognition and tracking often rely on the results from detection, the ability to detect objects of interest effectively plays a crucial role in TSP. In this paper, we focus on three important classes of objects: traffic signs, cars, and cyclists. We propose to detect all the three important objects in a single learning-based detection framework. The proposed framework consists of a dense feature extractor and detectors of three important classes. Once the dense features have been extracted, these features are shared with all detectors. The advantage of using one common framework is that the detection speed is much faster, since all dense features need only to be evaluated once in the testing phase. In contrast, most previous works have designed specific detectors using different features for each of these three classes. To enhance the feature robustness to noises and image deformations, we introduce spatially pooled features as a part of aggregated channel features. In order to further improve the generalization performance, we propose an object subcategorization method as a means of capturing the intraclass variation of objects. We experimentally demonstrate the effectiveness and efficiency of the proposed framework in three detection applications: traffic sign detection, car detection, and cyclist detection. The proposed framework achieves the competitive performance with state-of-the-art approaches on several benchmark data sets.
引用
收藏
页码:1002 / 1014
页数:13
相关论文
共 69 条
[1]   Learning to detect objects in images via a sparse, part-based representation [J].
Agarwal, S ;
Awan, A ;
Roth, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (11) :1475-1490
[2]  
Ahonen T, 2004, LECT NOTES COMPUT SC, V3021, P469
[3]  
[Anonymous], 2010, P 18 ACM INT C MULT, DOI [10.1145/1873951.1874249, 10.1145/1873951.1874249.2]
[4]  
[Anonymous], 2011, P 28 INT C MACHINE L
[5]  
Behley J, 2013, IEEE INT C INT ROBOT, P4195, DOI 10.1109/IROS.2013.6696957
[6]  
Broggi A, 2008, IEEE INT VEH SYM, P1104, DOI 10.1109/IVS.2008.4621308
[7]   Vehicle Localisation Using a Single Camera [J].
Cui, Jianzhu ;
Liu, Fuqiang ;
Li, Zhipeng ;
Jia, Zhen .
2010 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2010, :871-876
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]   Traffic sign recognition and analysis for intelligent vehicles [J].
de la Escalera, A ;
Armingol, JM ;
Mata, M .
IMAGE AND VISION COMPUTING, 2003, 21 (03) :247-258
[10]   Road traffic sign detection and classification [J].
delaEscalera, A ;
Moreno, LE ;
Salichs, MA ;
Armingol, JM .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 1997, 44 (06) :848-859