HybridNet: A fast vehicle detection system for autonomous driving

被引:73
作者
Dai, Xuerui [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Elect & Informat Engn, 3 Shang Yuan Cun, Beijing 100044, Peoples R China
关键词
Vehicle detection; CNN; Two-stage; Decision refinement;
D O I
10.1016/j.image.2018.09.002
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For autonomous driving system, vehicle detection is an import part as well as a challenging problem due to the large intra-class differences caused by occlusion, truncation and different viewpoints. The detection system should be fast and accurate enough to support real-world applications. Most of the existing deep convolution neural network (CNN) based object detection methods can be roughly categorized into two streams: single-stage and two-stage modes. These single-stage methods are usually extremely fast and easy to train with losing some precision. As for two-stage methods, they often get high performance in object detection competitions, however, they are not competitive for real-world applications because of the speed limits. The detection system with high degree of precision and fast computation speed is desirable. In this paper, a new two-stage regression based cascade object detection system is proposed. This system can be fast detection of the vehicles which concentrated the advantages of the two aforementioned methods, denoted by HybridNet. In our design, the first and the second stage are both regression modes. We add a transitional stage to map proposals (generated in the first stage) on high resolution feature maps to get exact features for decision refinement in the second stage. The challenging KITTI and PASCAL VOC2007 data sets are used to evaluate our proposed method. The experimental results show that our approach is more fast and more accurate in vehicle detection than other state-of-the-art methods.
引用
收藏
页码:79 / 88
页数:10
相关论文
共 62 条
[1]  
Abadi M., 2015, P 12 USENIX S OPERAT
[2]  
Amato Giuseppe., 2016, Proceedings of the 2016 ACM Workshop on Multimedia COMMONS, P11
[3]  
[Anonymous], 2017, Deformable convolutional networks
[4]  
[Anonymous], TRAFF SAF FACTS
[5]   Robust object detection via soft cascade [J].
Bourdev, L ;
Brandt, J .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, :236-243
[6]  
Cai Z., 2017, CASCADE R CNN DELVIN
[7]   A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection [J].
Cai, Zhaowei ;
Fan, Quanfu ;
Feris, Rogerio S. ;
Vasconcelos, Nuno .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :354-370
[8]   Learning Complexity-Aware Cascades for Deep Pedestrian Detection [J].
Cai, Zhaowei ;
Saberian, Mohammad ;
Vasconcelos, Nuno .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3361-3369
[9]   Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image [J].
Chabot, Florian ;
Chaouch, Mohamed ;
Rabarisoa, Jaonary ;
Teuliere, Celine ;
Chateau, Thierry .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1827-1836
[10]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)