Dilated-ResUnet: A novel deep learning architecture for building extraction from medium resolution multi-spectral satellite imagery

被引:36
作者
Dixit, Mayank [1 ,2 ]
Chaurasia, Kuldeep [1 ]
Mishra, Vipul Kumar [1 ,3 ]
机构
[1] Bennett Univ, Sch Engn & Appl Sci, Dept Comp Sci Engn, Greater Noida, India
[2] Galgotias Coll Engn & Technol, Dept Comp Sci & Engn, Greater Noida, India
[3] Bennett Univ, Dept Comp Sci & Engn, Greater Noida, India
关键词
Sentinel-2; Building Extraction; Dilated Convolution; Residual block; Satellite Images; Deep learning; SEMANTIC SEGMENTATION; URBAN; AREAS;
D O I
10.1016/j.eswa.2021.115530
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In today's world, satellite images are being utilized for the identification of built-up area, urban planning, disaster management, insurance & tax assessment in an area, and many other social-economic activities. The extraction of the accurate building footprints in densely populated urban areas from medium resolution satellite images is still a challenging task which requires the development of the new methods to solve such problem. In this paper, a novel Dilated-ResUnet deep learning architecture for building extraction from Sentinel-2 satellite images has been proposed. The proposed model has been tested on three novel building datasets that are prepared for three densely populated cities of India (viz. Delhi, Hyderabad and Bengaluru) using Sentinel-2 satellite images and Planet OSM. First FCC (false colour composite) dataset prepared by merging NIR, Red, Green bands, second FCC dataset prepared by merging NIR, Red, Green and Blue bands and third is TCC (true colour composite) dataset by merging red, green and blue bands. The proposed architecture is applied to both the FCC datasets and TCC dataset separately; it has been identified that the proposed model has obtained better building extraction results using FCC (NIR, Red, Green) dataset. The input satellite image enhancement and extensive experimentations to identify the optimal deep learning hyper-parameters using FCC spatial dataset have also been carried out to further improve the performance of the proposed model. The results of the experimentations reveal that the proposed model has out-performed the state of the art models available in literature by achieving the F1-score of 0.4718 and Mean IoU of 0.582 for building extraction from Sentinel-2 satellite images. The outcome of the research work can be utilized for urban planning and management, generate more ground truths for Sentinel-2 satellite images which further can be useful for other societal applications.
引用
收藏
页数:16
相关论文
共 69 条
[1]   An ensemble architecture of deep convolutional Segnet and Unet networks for building semantic segmentation from high-resolution aerial images [J].
Abdollahi, Abolfazl ;
Pradhan, Biswajeet ;
Alamri, Abdullah M. .
GEOCARTO INTERNATIONAL, 2022, 37 (12) :3355-3370
[2]  
Agarap A. F., 2019, DEEP LEARNING USING
[3]   An efficient and improved scheme for handwritten digit recognition based on convolutional neural network [J].
Ali, Saqib ;
Shaukat, Zeeshan ;
Azeem, Muhammad ;
Sakhawat, Zareen ;
Mahmood, Tariq ;
Rehman, Khalil Ur .
SN APPLIED SCIENCES, 2019, 1 (09)
[4]  
[Anonymous], 2019, APPL CHEM IND, DOI DOI 10.16581/J.CNKI.ISSN1671-3206.20190311.023
[5]  
[Anonymous], 2017, BUILDING DETECTION S
[6]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[7]  
Bramhe V. S., 2018, REMOTE SENSING SPATI, V42, P79
[8]   Convolutional low-resolution fine-grained classification [J].
Cai, Dingding ;
Chen, Ke ;
Qian, Yanlin ;
Kamarainen, Joni-Kristian .
PATTERN RECOGNITION LETTERS, 2019, 119 :166-171
[9]   Building damage annotation on post-hurricane satellite imagery based on convolutional neural networks [J].
Cao, Quoc Dung ;
Choe, Youngjun .
NATURAL HAZARDS, 2020, 103 (03) :3357-3376
[10]  
Chaurasia A, 2017, 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP)