An end-to-end shape modeling framework for vectorized building outline generation from aerial images

被引:63
作者
Chen, Qi [1 ]
Wang, Lei [2 ]
Waslander, Steven L. [2 ]
Liu, Xiuguo [1 ]
机构
[1] China Univ Geosci Wuhan, Sch Geog & Informat Engn, Wuhan, Peoples R China
[2] Univ Toronto, Inst Aerosp Studies, Toronto, ON, Canada
基金
中国国家自然科学基金;
关键词
Building segmentation; Boundary optimization; Automatic mapping; Deep learning; Shape modeling; CONVOLUTIONAL NEURAL-NETWORKS; EXTRACTION; SEGMENTATION; LIDAR; CLASSIFICATION; REFINEMENT; AREA;
D O I
10.1016/j.isprsjprs.2020.10.008
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
The identification and annotation of buildings has long been a tedious and expensive part of high-precision vector map production. The deep learning techniques such as fully convolution network (FCN) have largely promoted the accuracy of automatic building segmentation from remote sensing images. However, compared with the deep-learning-based building segmentation methods that greatly benefit from data-driven feature learning, the building boundary vector representation generation techniques mainly rely on handcrafted features and high human intervention. These techniques continue to employ manual design and ignore the opportunity of using the rich feature information that can be learned from training data to directly generate vectorized boundary descriptions. Aiming to address this problem, we introduce PolygonCNN, a learnable end-to-end vector shape modeling framework for generating building outlines from aerial images. The framework first performs an FCN-like segmentation to extract initial building contours. Then, by encoding the vertices of the building polygons along with the pooled image features extracted from segmentation step, a modified PointNet is proposed to learn shape priors and predict a polygon vertex deformation to generate refined building vector results. Additionally, we propose 1) a simplify-and-densify sampling strategy to generate homogeneously sampled polygon with well-kept geometric signals for shape prior learning; and 2) a novel loss function for estimating shape similarity between building polygons with vastly different vertex numbers. The experiments on over 10,000 building samples verify that PolygonCNN can generate building vectors with higher vertex-based F1-score than the state-of-the-art method, and simultaneously well maintains the building segmentation accuracy achieved by the FCN-like model.
引用
收藏
页码:114 / 126
页数:13
相关论文
共 56 条
  • [1] Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks
    Alshehhi, Rasha
    Marpu, Prashanth Reddy
    Woon, Wei Lee
    Dalla Mura, Mauro
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 : 139 - 149
  • [2] [Anonymous], 2018, ECCV
  • [3] [Anonymous], 2018, REMOTE SENS BASEL
  • [4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [5] Building Footprint Extraction From VHR Remote Sensing Images Combined With Normalized DSMs Using Fused Fully Convolutional Networks
    Bittner, Ksenia
    Adam, Fathalrahman
    Cui, Shiyong
    Koerner, Marco
    Reinartz, Peter
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (08) : 2615 - 2629
  • [6] A Deep Learning Approach on Building Detection from Unmanned Aerial Vehicle-Based Images in Riverbank Monitoring
    Boonpook, Wuttichai
    Tan, Yumin
    Ye, Yinghua
    Torteeka, Peerapong
    Torsri, Kritanai
    Dong, Shengxian
    [J]. SENSORS, 2018, 18 (11)
  • [7] Calkins H, 2017, J ARRYTHM, V33, P369, DOI 10.1016/j.joa.2017.08.001
  • [8] Annotating Object Instances with a Polygon-RNN
    Castrejon, Lluis
    Kundu, Kaustav
    Urtasun, Raquel
    Fidler, Sanja
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4485 - 4493
  • [9] Aerial imagery for roof segmentation: A large-scale dataset towards automatic mapping of buildings (Withdrawn Publication)
    Chen, Qi
    Wang, Lei
    Wu, Yifan
    Wu, Guangming
    Guo, Zhiling
    Waslander, Steven L.
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 147 : 42 - 55
  • [10] AN IMPROVED SNAKE MODEL FOR REFINEMENT OF LIDAR-DERIVED BUILDING ROOF CONTOURS USING AERIAL IMAGES
    Chen, Qi
    Wang, Shugen
    Liu, Xiuguo
    [J]. XXIII ISPRS CONGRESS, COMMISSION III, 2016, 41 (B3): : 583 - 589