Deep Learning-Based Improved Automatic Building Extraction from Open-Source High Resolution Unmanned Aerial Vehicle (UAV) Imagery

被引:0
作者
Maniyar, Chintan B. [1 ]
Kumar, Minakshi [1 ]
机构
[1] Indian Inst Remote Sensing, Photogrammetry & Remote Sensing Dept, Dehra Dun 248001, Uttarakhand, India
来源
PROCEEDINGS OF UASG 2021: WINGS 4 SUSTAINABILITY | 2023年 / 304卷
关键词
Transfer learning; Fully convolutional networks; Image segmentation; Building extraction; CLASSIFICATION; AREAS;
D O I
10.1007/978-3-031-19309-5_5
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Automatically extracting buildings from remotely sensed imagery has always been a challenging task, given the spectral homogeneity of buildings with non-building features as well as the complex structural diversity within the image. Traditional machine learning (ML) based methods deeply rely on a huge number of samples and are best suited for medium-resolution images. Unmanned aerial vehicle (UAV) imagery offers the distinct advantage of very high spatial resolution, which is helpful in improving building extraction by characterizing patterns and structures. However, with increased finer details, the number of images also increases many folds in a UAV dataset, which require robust processing algorithms. Deep learning algorithms, specifically Fully Convolutional Networks (FCNs) have greatly improved the results of building extraction from such high resolution remotely sensed imagery, as compared to traditional methods. This study proposes a deep learning-based segmentation approach to extract buildings by transferring the learning of a deep Residual Network (ResNet) to the segmentation-based FCN U-Net. This combined dense architecture of ResNet and U-Net (Res-U-Net) is trained and tested for building extraction on the open source Inria Aerial Image Labelling (IAIL) dataset. This dataset contains 360 orthorectified images with a tile size of 1500 m(2) each, at 30 cm spatial resolution with red, green and blue bands; while covering total area of 805 km(2) in select US and Austrian cities. Quantitative assessments show that the proposed methodology outperforms the current deep learning-based building extraction methods. When compared with a singular U-Net model for building extraction for the IAIL dataset, the proposed Res-U-Net model improves the overall accuracy from 92.85% to 96.5%, the mean F1-score from 0.83 to 0.88 and the mean IoU metric from 0.71 to 0.80. Results show that such a combination of two deep learning architectures greatly improves the building extraction accuracy as compared to a singular architecture.
引用
收藏
页码:51 / 66
页数:16
相关论文
共 34 条
[1]   An ensemble architecture of deep convolutional Segnet and Unet networks for building semantic segmentation from high-resolution aerial images [J].
Abdollahi, Abolfazl ;
Pradhan, Biswajeet ;
Alamri, Abdullah M. .
GEOCARTO INTERNATIONAL, 2022, 37 (12) :3355-3370
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   Classification and feature extraction for remote sensing images from urban areas based on morphological transformations [J].
Benediktsson, JA ;
Pesaresi, M ;
Arnason, K .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2003, 41 (09) :1940-1949
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]  
Chollet F., 2018, DEEP LEARNING PYTHON, DOI [10.1007/978-1-4842-2766-4, DOI 10.1007/978-1-4842-2766-4]
[6]   Automatic Rooftop Extraction in Nadir Aerial Imagery of Suburban Regions Using Corners and Variational Level Set Evolution [J].
Cote, Melissa ;
Saeedi, Parvaneh .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2013, 51 (01) :313-328
[7]  
Erdem F., 2020, Int. J. Environ. Geoinform., V2020, P221
[8]   Leveraging Accuracy-Uncertainty Tradeoff in SVM to Achieve Highly Accurate Outage Predictions [J].
Eskandarpour, Rozhin ;
Khodaei, Amin .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (01) :1139-1141
[9]   Semisupervised image classification with Laplacian support vector machines [J].
Gomez-Chova, Luis ;
Camps-Valls, Gustavo ;
Munoz-Mari, Jordi ;
Calpe, Javier .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2008, 5 (03) :336-340
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778