Building Footprint Extraction from High Resolution Aerial Images Using Generative Adversarial Network (GAN) Architecture

被引:56
作者
Abdollahi, Abolfazl [1 ]
Pradhan, Biswajeet [1 ,2 ,3 ]
Gite, Shilpa [4 ]
Alamri, Abdullah [5 ]
机构
[1] Univ Technol Sydney, Fac Engn & IT, Ctr Adv Modeling & Geospatial Informat Syst CAMGI, Sydney, NSW 2007, Australia
[2] Sejong Univ, Dept Energy & Mineral Resources Engn, Seoul 05006, South Korea
[3] Univ Kebangsaan Malaysia, Inst Climate Change, Earth Observat Ctr, Bangi 43600, Selangor, Malaysia
[4] Symbiosis Int Deemed Univ, Symbiosis Inst Technol, Comp Sci & Informat Technol Dept, Pune 412115, Maharashtra, India
[5] King Saud Univ, Coll Sci, Dept Geol & Geophys, Riyadh 11451, Saudi Arabia
关键词
Training; Image segmentation; Buildings; Semantics; Generative adversarial networks; Feature extraction; Gallium nitride; Building extraction; GAN; remote sensing; SegNet; ROAD EXTRACTION;
D O I
10.1109/ACCESS.2020.3038225
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Building extraction with high accuracy using semantic segmentation from high-resolution remotely sensed imagery has a wide range of applications like urban planning, updating of geospatial database, and disaster management. However, automatic building extraction with non-noisy segmentation map and obtaining accurate boundary information is a big challenge for most of the popular deep learning methods due to the existence of some barriers like cars, vegetation cover and shadow of trees in the high-resolution remote sensing imagery. Thus, we introduce an end-to-end convolutional neural network called Generative Adversarial Network (GAN) in this study to tackle these issues. In the generative model, we utilized SegNet model with Bi-directional Convolutional LSTM (BConvLSTM) to generate the segmentation map from Massachusetts building dataset containing high-resolution aerial imagery. BConvLSTM combines encoded features (containing of more local information) and decoded features (containing of more semantic information) to improve the performance of the model even with the presence of complex backgrounds and barriers. The adversarial training method enforces long-range spatial label vicinity to tackle with the issue of covering building objects with the existing occlusions such as trees, cars and shadows and achieve high-quality building segmentation outcomes under the complex areas. The quantitative results obtained by the proposed technique with an average F1-score of 96.81% show that the suggested approach could achieve better results through detecting and adjusting the difference between the segmentation model output and the reference map compared to other state-of-the-art approaches such as autoencoder method with 91.36%, SegNet+BConvLSTM with 95.96%, FCN-CRFs with 95.36%% SegNet with 94.77%, and GAN-SCA model with 96.36% accuracy.
引用
收藏
页码:209517 / 209527
页数:11
相关论文
共 53 条
[1]   VNet: An End-to-End Fully Convolutional Neural Network for Road Extraction From High-Resolution Remote Sensing Data [J].
Abdollahi, Abolfazl ;
Pradhan, Biswajeet ;
Alamri, Abdullah .
IEEE ACCESS, 2020, 8 :179424-179436
[2]   Deep Learning Approaches Applied to Remote Sensing Datasets for Road Extraction: A State-Of-The-Art Review [J].
Abdollahi, Abolfazl ;
Pradhan, Biswajeet ;
Shukla, Nagesh ;
Chakraborty, Subrata ;
Alamri, Abdullah .
REMOTE SENSING, 2020, 12 (09)
[3]   Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks [J].
Alshehhi, Rasha ;
Marpu, Prashanth Reddy ;
Woon, Wei Lee ;
Dalla Mura, Mauro .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 :139-149
[4]  
[Anonymous], 2014, INT C LEARN REPR ICL
[5]  
[Anonymous], 2015, P 10 INT C COMPUTER, DOI DOI 10.5220/0005355105100517
[6]  
[Anonymous], 2014, THESIS
[7]  
[Anonymous], 2015, ADV NEURAL INFORM PR, DOI DOI 10.1145/2702123.2702264
[8]  
[Anonymous], 2013, Ph.D. dissertation
[9]   Building footprint extraction in Yangon city from monocular optical satellite image using deep learning [J].
Aung, Hein Thura ;
Pha, Sao Hone ;
Takeuchi, Wataru .
GEOCARTO INTERNATIONAL, 2022, 37 (03) :792-812
[10]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495