A Deep Learning-Based Framework for Automated Extraction of Building Footprint Polygons from Very High-Resolution Aerial Imagery

被引:34
作者
Li, Ziming [1 ]
Xin, Qinchuan [1 ,2 ]
Sun, Ying [1 ]
Cao, Mengying [1 ]
机构
[1] Sun Yat Sen Univ, Sch Geog & Planning, Guangdong Key Lab Urbanizat & Geosimulat, Guangzhou 510275, Peoples R China
[2] Chinese Acad Sci, Res Ctr Ecol & Environm Cent Asia, State Key Lab Desert & Oasis Ecol, Urumqi 830011, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
building footprint; map vectorization; convolutional neural network; semantic segmentation; PERFORMANCE EVALUATION; SEGMENTATION; CLASSIFICATION; GENERATION; MULTISCALE;
D O I
10.3390/rs13183630
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate building footprint polygons provide essential data for a wide range of urban applications. While deep learning models have been proposed to extract pixel-based building areas from remote sensing imagery, the direct vectorization of pixel-based building maps often leads to building footprint polygons with irregular shapes that are inconsistent with real building boundaries, making it difficult to use them in geospatial analysis. In this study, we proposed a novel deep learning-based framework for automated extraction of building footprint polygons (DLEBFP) from very high-resolution aerial imagery by combining deep learning models for different tasks. Our approach uses the U-Net, Cascade R-CNN, and Cascade CNN deep learning models to obtain building segmentation maps, building bounding boxes, and building corners, respectively, from very high-resolution remote sensing images. We used Delaunay triangulation to construct building footprint polygons based on the detected building corners with the constraints of building bounding boxes and building segmentation maps. Experiments on the Wuhan University building dataset and ISPRS Vaihingen dataset indicate that DLEBFP can perform well in extracting high-quality building footprint polygons. Compared with the other semantic segmentation models and the vector map generalization method, DLEBFP is able to achieve comparable mapping accuracies with semantic segmentation models on a pixel basis and generate building footprint polygons with concise edges and vertices with regular shapes that are close to the reference data. The promising performance indicates that our method has the potential to extract accurate building footprint polygons from remote sensing images for applications in geospatial analysis.
引用
收藏
页数:25
相关论文
共 66 条
[1]  
[Anonymous], 2016, C COMPUTER VISION PA, DOI DOI 10.1109/CVPR.2016.90
[2]   An Effective Data-Driven Method for 3-D Building Roof Reconstruction and Robust Change Detection [J].
Awrangjeb, Mohammad ;
Gilani, Syed Ali Naqi ;
Siddiqui, Fasahat Ullah .
REMOTE SENSING, 2018, 10 (10)
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[5]   An end-to-end shape modeling framework for vectorized building outline generation from aerial images [J].
Chen, Qi ;
Wang, Lei ;
Waslander, Steven L. ;
Liu, Xiuguo .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 170 (114-126) :114-126
[6]   Aerial imagery for roof segmentation: A large-scale dataset towards automatic mapping of buildings (Withdrawn Publication) [J].
Chen, Qi ;
Wang, Lei ;
Wu, Yifan ;
Wu, Guangming ;
Guo, Zhiling ;
Waslander, Steven L. .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 147 :42-55
[7]   An adaptive spatial clustering algorithm based on delaunay triangulation [J].
Deng, Min ;
Liu, Qiliang ;
Cheng, Tao ;
Shi, Yan .
COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2011, 35 (04) :320-332
[8]   Outlier detection and robust plane fitting for building roof extraction from LiDAR data [J].
Dey, Emon Kumar ;
Awrangjeb, Mohammad ;
Stantic, Bela .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (16) :6325-6354
[9]  
Douglas D.H., 1973, INT J GEOGR INF GEOV, V10, P112, DOI [10.3138/FM57-6770-U75U-7727, DOI 10.3138/FM57-6770-U75U-7727]
[10]   Semantic classification of urban buildings combining VHR image and GIS data: An improved random forest approach [J].
Du, Shihong ;
Zhang, Fangli ;
Zhang, Xiuyuan .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2015, 105 :107-119