Detection of Vehicles in Multisensor Data via Multibranch Convolutional Neural Networks

被引:37
作者
Schilling, Hendrik [1 ]
Bulatov, Dimitri [1 ]
Niessner, Robin [1 ]
Middelmann, Wolfgang [1 ]
Soergel, Uwe [2 ]
机构
[1] Fraunhofer Inst Optron Syst Technol & Image Explo, Dept Scene Anal, D-76131 Karlsruhe, Germany
[2] Univ Stuttgart, Inst Photogrammetry, D-70174 Stuttgart, Germany
关键词
Convolution; image classification; neural networks; object detection; remote sensing; vehicles; CAR DETECTION; IMAGES; LIDAR; SYSTEM;
D O I
10.1109/JSTARS.2018.2825099
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural networks, or CNNs, raised the bar for most computer vision problems and have an increasing impact in remote sensing. However, since they usually contain multiple pooling layers, detection of exact borders of small objects at their original resolution remains yet a challenging topic. Additionally, efforts are being made to reduce the amount of training data. In this paper, we investigate the potential of fully convolutional neural networks (FCNs) for individual vehicle detection in combined elevation and optical data using relatively few training samples. By the proposed multibranch CNN, we combine object recognition within a deep learning framework with the object segmentation at a high resolution, for which two CNN branches are employed. Data fusion is accomplished with a pseudo-Siamese approach. The pixelwise classification likelihood, also referred to as heatmap, is harmoniously post processed by a vectorization module, which is based on the minimum bounding rectangle (MBR) extraction and allows for delineation of groups of vehicles. Two methods were developed in which MBRs are supported either by pairs of parallel lines or by region growing. Our approach allows efficient training with few training samples, while delivering high-quality detection results and good computational performance. In our detailed evaluation, we investigate the benefits of data fusion and compare our approach to other state-of-the-art networks. Different datasets were used, containing optical images and elevation data, derived either from airborne laser scanning or from photogrammetric reconstruction. The obtained results are very promising with F-1 scores up to 97%.
引用
收藏
页码:4299 / 4316
页数:18
相关论文
共 71 条
[1]   Deep Learning Approach for Car Detection in UAV Imagery [J].
Ammour, Nassim ;
Alhichri, Haikel ;
Bazi, Yakoub ;
Benjdira, Bilel ;
Alajlan, Naif ;
Zuair, Mansour .
REMOTE SENSING, 2017, 9 (04)
[2]  
[Anonymous], 2014, THESIS
[3]  
[Anonymous], 2016 IEEE C COMPUTER
[4]  
[Anonymous], 2017, 34 ISPRS WG
[5]  
[Anonymous], 2007, P ACM INT S ADV GEOG
[6]  
[Anonymous], 2008, ISPRS
[7]  
[Anonymous], 2017, CoRR
[8]  
Audebert N., 2016, P INT C GEOGR OBJ BA
[9]   Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 :180-196
[10]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495