An end-to-end shape modeling framework for vectorized building outline generation from aerial images

被引：63

作者：

Chen, Qi ^{[1
]}

Wang, Lei ^{[2
]}

Waslander, Steven L. ^{[2
]}

Liu, Xiuguo ^{[1
]}

机构：

[1] China Univ Geosci Wuhan, Sch Geog & Informat Engn, Wuhan, Peoples R China

[2] Univ Toronto, Inst Aerosp Studies, Toronto, ON, Canada

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2020年 / 170卷 / 114-126期

基金：

中国国家自然科学基金;

关键词：

Building segmentation; Boundary optimization; Automatic mapping; Deep learning; Shape modeling; CONVOLUTIONAL NEURAL-NETWORKS; EXTRACTION; SEGMENTATION; LIDAR; CLASSIFICATION; REFINEMENT; AREA;

D O I：

10.1016/j.isprsjprs.2020.10.008

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

The identification and annotation of buildings has long been a tedious and expensive part of high-precision vector map production. The deep learning techniques such as fully convolution network (FCN) have largely promoted the accuracy of automatic building segmentation from remote sensing images. However, compared with the deep-learning-based building segmentation methods that greatly benefit from data-driven feature learning, the building boundary vector representation generation techniques mainly rely on handcrafted features and high human intervention. These techniques continue to employ manual design and ignore the opportunity of using the rich feature information that can be learned from training data to directly generate vectorized boundary descriptions. Aiming to address this problem, we introduce PolygonCNN, a learnable end-to-end vector shape modeling framework for generating building outlines from aerial images. The framework first performs an FCN-like segmentation to extract initial building contours. Then, by encoding the vertices of the building polygons along with the pooled image features extracted from segmentation step, a modified PointNet is proposed to learn shape priors and predict a polygon vertex deformation to generate refined building vector results. Additionally, we propose 1) a simplify-and-densify sampling strategy to generate homogeneously sampled polygon with well-kept geometric signals for shape prior learning; and 2) a novel loss function for estimating shape similarity between building polygons with vastly different vertex numbers. The experiments on over 10,000 building samples verify that PolygonCNN can generate building vectors with higher vertex-based F1-score than the state-of-the-art method, and simultaneously well maintains the building segmentation accuracy achieved by the FCN-like model.

引用

页码：114 / 126

页数：13

共 56 条

[1] Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks
Alshehhi, Rasha
Marpu, Prashanth Reddy
Woon, Wei Lee
Dalla Mura, Mauro
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 : 139 - 149
[2] [Anonymous], 2018, ECCV
[3] [Anonymous], 2018, REMOTE SENS BASEL
[4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[5] Building Footprint Extraction From VHR Remote Sensing Images Combined With Normalized DSMs Using Fused Fully Convolutional Networks
Bittner, Ksenia
Adam, Fathalrahman
Cui, Shiyong
Koerner, Marco
Reinartz, Peter
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (08) : 2615 - 2629
[6] A Deep Learning Approach on Building Detection from Unmanned Aerial Vehicle-Based Images in Riverbank Monitoring
Boonpook, Wuttichai
Tan, Yumin
Ye, Yinghua
Torteeka, Peerapong
Torsri, Kritanai
Dong, Shengxian
[J]. SENSORS, 2018, 18 (11)
[7] Calkins H, 2017, J ARRYTHM, V33, P369, DOI 10.1016/j.joa.2017.08.001
[8] Annotating Object Instances with a Polygon-RNN
Castrejon, Lluis
Kundu, Kaustav
Urtasun, Raquel
Fidler, Sanja
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4485 - 4493
[9] Aerial imagery for roof segmentation: A large-scale dataset towards automatic mapping of buildings (Withdrawn Publication)
Chen, Qi
Wang, Lei
Wu, Yifan
Wu, Guangming
Guo, Zhiling
Waslander, Steven L.
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 147 : 42 - 55
[10] AN IMPROVED SNAKE MODEL FOR REFINEMENT OF LIDAR-DERIVED BUILDING ROOF CONTOURS USING AERIAL IMAGES
Chen, Qi
Wang, Shugen
Liu, Xiuguo
[J]. XXIII ISPRS CONGRESS, COMMISSION III, 2016, 41 (B3): : 583 - 589

← 1 2 3 4 5 6 →