Detection of Vehicles in Multisensor Data via Multibranch Convolutional Neural Networks

被引:37
作者
Schilling, Hendrik [1 ]
Bulatov, Dimitri [1 ]
Niessner, Robin [1 ]
Middelmann, Wolfgang [1 ]
Soergel, Uwe [2 ]
机构
[1] Fraunhofer Inst Optron Syst Technol & Image Explo, Dept Scene Anal, D-76131 Karlsruhe, Germany
[2] Univ Stuttgart, Inst Photogrammetry, D-70174 Stuttgart, Germany
关键词
Convolution; image classification; neural networks; object detection; remote sensing; vehicles; CAR DETECTION; IMAGES; LIDAR; SYSTEM;
D O I
10.1109/JSTARS.2018.2825099
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural networks, or CNNs, raised the bar for most computer vision problems and have an increasing impact in remote sensing. However, since they usually contain multiple pooling layers, detection of exact borders of small objects at their original resolution remains yet a challenging topic. Additionally, efforts are being made to reduce the amount of training data. In this paper, we investigate the potential of fully convolutional neural networks (FCNs) for individual vehicle detection in combined elevation and optical data using relatively few training samples. By the proposed multibranch CNN, we combine object recognition within a deep learning framework with the object segmentation at a high resolution, for which two CNN branches are employed. Data fusion is accomplished with a pseudo-Siamese approach. The pixelwise classification likelihood, also referred to as heatmap, is harmoniously post processed by a vectorization module, which is based on the minimum bounding rectangle (MBR) extraction and allows for delineation of groups of vehicles. Two methods were developed in which MBRs are supported either by pairs of parallel lines or by region growing. Our approach allows efficient training with few training samples, while delivering high-quality detection results and good computational performance. In our detailed evaluation, we investigate the benefits of data fusion and compare our approach to other state-of-the-art networks. Different datasets were used, containing optical images and elevation data, derived either from airborne laser scanning or from photogrammetric reconstruction. The obtained results are very promising with F-1 scores up to 97%.
引用
收藏
页码:4299 / 4316
页数:18
相关论文
共 71 条
[21]   Vehicle Detection in Aerial Surveillance Using Dynamic Bayesian Networks [J].
Cheng, Hsu-Yung ;
Weng, Chih-Chia ;
Chen, Yi-Ying .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (04) :2152-2159
[22]   MEAN SHIFT, MODE SEEKING, AND CLUSTERING [J].
CHENG, YZ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (08) :790-799
[23]  
D'Errico John, 2017, MINIMUM BOUNDING SUI
[24]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[25]   Toward Fast and Accurate Vehicle Detection in Aerial Images Using Coupled Region-Based Convolutional Neural Networks [J].
Deng, Zhipeng ;
Sun, Hao ;
Zhou, Shilin ;
Zhao, Juanping ;
Zou, Huanxin .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (08) :3652-3664
[26]   Convolutional Neural Network With Data Augmentation for SAR Target Recognition [J].
Ding, Jun ;
Chen, Bo ;
Liu, Hongwei ;
Huang, Mengyuan .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (03) :364-368
[27]   Deep Supervised and Contractive Neural Network for SAR Image Classification [J].
Geng, Jie ;
Wang, Hongyu ;
Fan, Jianchao ;
Ma, Xiaorui .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (04) :2442-2459
[28]  
Girshick R., 2014, PROC CVPR IEEE, DOI [DOI 10.1109/CVPR.2014.81, 10.1109/CVPR.2014.81]
[29]   Dense Multi-Stereo Matching for High Quality Digital Elevation Models [J].
Haala, Norbert ;
Rothermel, Mathias .
PHOTOGRAMMETRIE FERNERKUNDUNG GEOINFORMATION, 2012, (04) :331-343
[30]  
He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]