Improving public data for building segmentation from Convolutional Neural Networks (CNNs) for fused airborne lidar and image data using active contours

被引:64
作者
Griffiths, David [1 ]
Boehm, Jan [1 ]
机构
[1] UCL, Dept Civil Environm & Geomat Engn, Gower St, London WC1E 6BT, England
关键词
Deep learning; Convolutional neural networks; Segmentation; Image processing; Lidar; Aerial; SEMANTIC SEGMENTATION; AERIAL IMAGES; POINT CLOUDS; EXTRACTION; CLASSIFICATION; RECONSTRUCTION;
D O I
10.1016/j.isprsjprs.2019.05.013
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Robust and reliable automatic building detection and segmentation from aerial images/point clouds has been a prominent field of research in remote sensing, computer vision and point cloud processing for a number of decades. One of the largest issues associated with deep learning methods is the high quantity of data required for training. To help address this we present a method to improve public GIS building footprint labels by using Morphological Geodesic Active Contours (MorphGACs). We demonstrate by improving the quality of building footprint labels for detection and semantic segmentation, more robust and reliable models can be obtained. We evaluate these methods over a large UK-based dataset of 24556 images containing 169835 building instances. This is achieved by training several Mask/Faster R-CNN and RetinaNet deep convolutional neural networks. Networks are supplied with both RGB and fused RGB-lidar data. We offer quantitative analysis on the benefits of the inclusion of depth data for building segmentation. By employing both methods we achieve a detection accuracy of 0.92 (mAP@0.5) and segmentation f1 scores of 0.94 over a 4911 test images ranging from urban to rural scenes.
引用
收藏
页码:70 / 83
页数:14
相关论文
共 65 条
  • [1] Abadi M., 2015, TENSORFLOW LARGESCAL
  • [2] Segmentation Based Classification of 3D Urban Point Clouds: A Super-Voxel Based Approach with Evaluation
    Aijazi, Ahmad Kamal
    Checchin, Paul
    Trassoudaine, Laurent
    [J]. REMOTE SENSING, 2013, 5 (04) : 1624 - 1650
  • [3] Morphological Snakes
    Alvarez, Luis
    Baumela, Luis
    Henriquez, Pedro
    Marquez-Neila, Pablo
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2197 - 2202
  • [4] [Anonymous], P 3 INT C LEARNING R
  • [5] [Anonymous], P ISPRS WORKSH LAND
  • [6] [Anonymous], IEEE CVPR
  • [7] [Anonymous], 2017, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2017.322
  • [8] [Anonymous], 2017, ICCV
  • [9] [Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
  • [10] [Anonymous], 2015, P 6 INT S INF COMM T, DOI DOI 10.1145/2833258.2833272