Building Footprint Extraction from Remote Sensing Images with Residual Attention Multi-Scale Aggregation Fully Convolutional Network

被引：2

作者：

Ahmadian, Nima ^{[1
]}

Sedaghat, Amin ^{[1
]}

Mohammadi, Nazila ^{[1
]}

机构：

[1] Univ Tabriz, Fac Civil Engn, Dept Geomat Engn, Tabriz, Iran

来源：

JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING | 2024年 / 52卷 / 11期

关键词：

Deep learning; Residual networks; Attention; Building footprint extraction; Fully convolutional networks; AERIAL IMAGES;

D O I：

10.1007/s12524-024-01961-8

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Building footprint extraction is crucial for various applications, including disaster management, change detection, and 3D modeling. Satellite and aerial images, when combined with deep learning techniques, offer an effective means for this task. The Multi-scale Aggregation Fully Convolutional Network (MA-FCN) is an encoder-decoder model that emphasizes scale information, producing the final segmentation map by concatenating four feature maps from different stages of the decoder. To enhance segmentation accuracy, we propose two novel deep learning models: Attention MA-FCN and Residual Attention MA-FCN. Attention MA-FCN incorporates attention gates in the skip connections to emphasize relevant features, directing the model's focus to essential areas. Residual Attention MA-FCN further integrates residual blocks into the architecture, using both attention mechanisms and residual blocks to improve stability against gradient vanishing and overfitting, thereby enabling deeper training. These models were evaluated on the WHU, Massachusetts, and Jinghai District datasets, showing superior performance compared to the original MA-FCN. Specifically, Residual Attention MA-FCN outperformed MA-FCN and Attention MA-FCN by 3.6% and 0.92% on the WHU dataset, and by 5.51% and 0.91% on the Massachusetts dataset in terms of the Intersection Over Union (IOU) metric. Additionally, Residual Attention MA-FCN surpassed MA-FCN, Attention MA-FCN, Mask-RCNN, and U-Net models on the Jinghai District dataset. Due to the significance of building footprint extraction in various applications, the results of this study indicates that the proposed methods are more accurate than the MA-FCN model with better performances in IOU and F1-score metrics.

引用

页码：2417 / 2429

页数：13

共 32 条

[1] Automatic coastline extraction through enhanced sea-land segmentation by modifying Standard U-Net
Aghdami-Nia, Mohammad
Shah-Hosseini, Reza
Rostami, Amirhossein
Homayouni, Saeid
[J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 109
[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[3] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[4] Feature Extraction and Object Detection Using Fast-Convolutional Neural Network for Remote Sensing Satellite Image
Devi, N. Bharatha
Kavida, A. Celine
Murugan, R.
[J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (06) : 961 - 973
[5] Building extraction from VHR remote sensing imagery by combining an improved deep convolutional encoder-decoder architecture and historical land use vector map
Feng, Wenqing
Sui, Haigang
Hua, Li
Xu, Chuan
Ma, Guorui
Huang, Weiming
[J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (17) : 6595 - 6617
[6] Building Footprint Semantic Segmentation using Bi-Channel Bi-Spatial (B2-CS) LinkNet
Giftlin, C. Jenifer Grace
Jenicka, S.
Juliet, S. Ebenezer
[J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (10) : 1841 - 1854
[7] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[8] He K, 2017, IEEE INT WORKSH MULT
[9] CMGFNet: A deep cross-modal gated fusion network for building extraction from very high-resolution remote sensing images
Hosseinpour, Hamidreza
Samadzadegan, Farhad
Javan, Farzaneh Dadrass
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 184 : 96 - 115
[10] FSAU-Net: a network for extracting buildings from remote sensing imagery using feature self-attention
Hu, Minghong
Li, Jiatian
Xiaohui, A.
Zhao, Yunfei
Lu, Mei
Li, Wen
[J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (05) : 1643 - 1664

← 1 2 3 4 →