Building Extraction Based on U-Net with an Attention Block and Multiple Losses

被引:140
作者
Guo, Mingqiang [1 ]
Liu, Heng [1 ]
Xu, Yongyang [1 ]
Huang, Ying [2 ]
机构
[1] China Univ Geosci, Sch Geog & Informat Engn, Wuhan 430074, Peoples R China
[2] Wuhan Zondy Cyber Technol Co Ltd, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
building extraction; attention block; multiple losses; semantic segmentation; remote sensing images; RESOLUTION; NETWORK;
D O I
10.3390/rs12091400
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Semantic segmentation of high-resolution remote sensing images plays an important role in applications for building extraction. However, the current algorithms have some semantic information extraction limitations, and these can lead to poor segmentation results. To extract buildings with high accuracy, we propose a multiloss neural network based on attention. The designed network, based on U-Net, can improve the sensitivity of the model by the attention block and suppress the background influence of irrelevant feature areas. To improve the ability of the model, a multiloss approach is proposed during training the network. The experimental results show that the proposed model offers great improvement over other state-of-the-art methods. For the public Inria Aerial Image Labeling dataset, the F1 score reached 76.96% and showed good performance on the Aerial Imagery for Roof Segmentation dataset.
引用
收藏
页数:17
相关论文
共 54 条
[1]   Automatic urban building boundary extraction from high resolution aerial images using an innovative model of active contours [J].
Ahmadi, Salman ;
Zoej, M. J. Valadan ;
Ebadi, Hamid ;
Moghaddam, Hamid Abrishami ;
Mohammadzadeh, Ali .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2010, 12 (03) :150-157
[2]  
Allen-Zhu Z., 2018, P 36 INT C MACH LEAR, P242
[3]   Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks [J].
Alshehhi, Rasha ;
Marpu, Prashanth Reddy ;
Woon, Wei Lee ;
Dalla Mura, Mauro .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 :139-149
[4]  
[Anonymous], 2014, Advances in neural information processing systems
[5]  
[Anonymous], 2018, ARXIV180304953
[6]  
Bischke B, 2019, IEEE IMAGE PROC, P1480, DOI [10.1109/icip.2019.8803050, 10.1109/ICIP.2019.8803050]
[7]  
Bluche Theodore, 2016, ADV NEURAL INFORM PR, P838, DOI DOI 10.5555/3157096.3157190
[8]  
Cao C., 2015, P IEEE INT C COMPUTE, P2964
[9]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[10]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848