Deep Neural Network for Structural Prediction and Lane Detection in Traffic Scene

被引:321
作者
Li, Jun [1 ]
Mei, Xue [2 ]
Prokhorov, Danil [3 ]
Tao, Dacheng [1 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat, Ctr Quantum Computat Intelligent Syst, Ultimo, NSW 2007, Australia
[2] Toyota Res Inst, Future Mobil Res Dept, Ann Arbor, MI 48105 USA
[3] Toyota Res Inst, Ann Arbor, MI 48105 USA
基金
澳大利亚研究理事会;
关键词
Image recognition; pattern analysis; recurrent neural networks; LEARNING DEEP; ADAPTIVE-BEHAVIOR; ALGORITHM; TRACKING;
D O I
10.1109/TNNLS.2016.2522428
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical neural networks have been shown to be effective in learning representative image features and recognizing object classes. However, most existing networks combine the low/middle level cues for classification without accounting for any spatial structures. For applications such as understanding a scene, how the visual cues are spatially distributed in an image becomes essential for successful analysis. This paper extends the framework of deep neural networks by accounting for the structural cues in the visual signals. In particular, two kinds of neural networks have been proposed. First, we develop a multitask deep convolutional network, which simultaneously detects the presence of the target and the geometric attributes (location and orientation) of the target with respect to the region of interest. Second, a recurrent neuron layer is adopted for structured visual detection. The recurrent neurons can deal with the spatial distribution of visible cues belonging to an object whose shape or structure is difficult to explicitly define. Both the networks are demonstrated by the practical task of detecting lane boundaries in traffic scenes. The multitask convolutional neural network provides auxiliary geometric information to help the subsequent modeling of the given lane structures. The recurrent neural network automatically detects lane boundaries, including those areas containing no marks, without any explicit prior knowledge or secondary modeling.
引用
收藏
页码:690 / 703
页数:14
相关论文
共 49 条
[1]   Measuring the Objectness of Image Windows [J].
Alexe, Bogdan ;
Deselaers, Thomas ;
Ferrari, Vittorio .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2189-2202
[2]   Real time Detection of Lane Markers in Urban Streets [J].
Aly, Mohamed .
2008 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2008, :165-170
[3]  
[Anonymous], 2006, NIPS
[4]  
[Anonymous], 2007, Predicting structured data
[5]  
[Anonymous], 2011, P 28 INT C MACH LEAR
[6]  
[Anonymous], 2014, Neural Information Processing Systems
[7]   Recent progress in road and lane detection: a survey [J].
Bar Hillel, Aharon ;
Lerner, Ronen ;
Levi, Dan ;
Raz, Guy .
MACHINE VISION AND APPLICATIONS, 2014, 25 (03) :727-745
[8]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[9]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[10]   On the Complexity of Neural Network Classifiers: A Comparison Between Shallow and Deep Architectures [J].
Bianchini, Monica ;
Scarselli, Franco .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (08) :1553-1565