Building Footprint Extraction from Remote Sensing Images with Residual Attention Multi-Scale Aggregation Fully Convolutional Network

被引:2
作者
Ahmadian, Nima [1 ]
Sedaghat, Amin [1 ]
Mohammadi, Nazila [1 ]
机构
[1] Univ Tabriz, Fac Civil Engn, Dept Geomat Engn, Tabriz, Iran
关键词
Deep learning; Residual networks; Attention; Building footprint extraction; Fully convolutional networks; AERIAL IMAGES;
D O I
10.1007/s12524-024-01961-8
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Building footprint extraction is crucial for various applications, including disaster management, change detection, and 3D modeling. Satellite and aerial images, when combined with deep learning techniques, offer an effective means for this task. The Multi-scale Aggregation Fully Convolutional Network (MA-FCN) is an encoder-decoder model that emphasizes scale information, producing the final segmentation map by concatenating four feature maps from different stages of the decoder. To enhance segmentation accuracy, we propose two novel deep learning models: Attention MA-FCN and Residual Attention MA-FCN. Attention MA-FCN incorporates attention gates in the skip connections to emphasize relevant features, directing the model's focus to essential areas. Residual Attention MA-FCN further integrates residual blocks into the architecture, using both attention mechanisms and residual blocks to improve stability against gradient vanishing and overfitting, thereby enabling deeper training. These models were evaluated on the WHU, Massachusetts, and Jinghai District datasets, showing superior performance compared to the original MA-FCN. Specifically, Residual Attention MA-FCN outperformed MA-FCN and Attention MA-FCN by 3.6% and 0.92% on the WHU dataset, and by 5.51% and 0.91% on the Massachusetts dataset in terms of the Intersection Over Union (IOU) metric. Additionally, Residual Attention MA-FCN surpassed MA-FCN, Attention MA-FCN, Mask-RCNN, and U-Net models on the Jinghai District dataset. Due to the significance of building footprint extraction in various applications, the results of this study indicates that the proposed methods are more accurate than the MA-FCN model with better performances in IOU and F1-score metrics.
引用
收藏
页码:2417 / 2429
页数:13
相关论文
共 32 条
  • [1] Automatic coastline extraction through enhanced sea-land segmentation by modifying Standard U-Net
    Aghdami-Nia, Mohammad
    Shah-Hosseini, Reza
    Rostami, Amirhossein
    Homayouni, Saeid
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 109
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [4] Feature Extraction and Object Detection Using Fast-Convolutional Neural Network for Remote Sensing Satellite Image
    Devi, N. Bharatha
    Kavida, A. Celine
    Murugan, R.
    [J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (06) : 961 - 973
  • [5] Building extraction from VHR remote sensing imagery by combining an improved deep convolutional encoder-decoder architecture and historical land use vector map
    Feng, Wenqing
    Sui, Haigang
    Hua, Li
    Xu, Chuan
    Ma, Guorui
    Huang, Weiming
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (17) : 6595 - 6617
  • [6] Building Footprint Semantic Segmentation using Bi-Channel Bi-Spatial (B2-CS) LinkNet
    Giftlin, C. Jenifer Grace
    Jenicka, S.
    Juliet, S. Ebenezer
    [J]. JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (10) : 1841 - 1854
  • [7] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [8] He K, 2017, IEEE INT WORKSH MULT
  • [9] CMGFNet: A deep cross-modal gated fusion network for building extraction from very high-resolution remote sensing images
    Hosseinpour, Hamidreza
    Samadzadegan, Farhad
    Javan, Farzaneh Dadrass
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 184 : 96 - 115
  • [10] FSAU-Net: a network for extracting buildings from remote sensing imagery using feature self-attention
    Hu, Minghong
    Li, Jiatian
    Xiaohui, A.
    Zhao, Yunfei
    Lu, Mei
    Li, Wen
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (05) : 1643 - 1664