FSAU-Net: a network for extracting buildings from remote sensing imagery using feature self-attention

被引:6
作者
Hu, Minghong [1 ]
Li, Jiatian [1 ]
Xiaohui, A. [1 ]
Zhao, Yunfei [1 ]
Lu, Mei [1 ]
Li, Wen [1 ]
机构
[1] Kunming Univ Sci & Technol, Fac Land & Resources Engn, Kunming 650031, Peoples R China
基金
中国国家自然科学基金;
关键词
Building extraction; Features self-attention; Spatial attention; Skip connection; SHADOW;
D O I
10.1080/01431161.2023.2177125
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Convolutional neural networks (CNNs) extract semantic features from images by stacking convolutional operators, which easily causes semantic information loss and leads to hollow and edge inaccuracies in building extraction. Therefore, a features self-attention U-block network (FSAU-Net) is proposed. The network focuses on the target feature self-attention in the coding stage, and features self-attention (FSA) distinguishes buildings from nonbuilding by weighting the extracted features themselves; we introduce spatial attention (SA) in the decoder stage to focus on the spatial locations of features, and SA generates spatial location features through the spatial relationship among the features to highlight the building information area. A jump connection is used to fuse the shallow features generated in the decoder stage with the deep features generated in the encoder stage to reduce the building information loss. We validate the superiority of the method FSAU-Net on the WHU and Inria datasets with 0.3 m resolution and Massachusetts with 1.0 m resolution, experimentally showing IoU of 91.73%, 80.73% and 78.46% and precision of 93.60%, 90.71% and 86.37%, respectively. In addition, we also set up ablation experiments by adding an FSA module, Squeeze-and-Excitation (SE) module and Efficient Channel Attention (ECA) module to UNet and ResNet101, where UNet+FSA improves the IoU values by 3.15%, 2.72% and 1.77% compared to UNet, UNet+SE and UNet+ECA, respectively, and ResNet101+FSA improves the IoU values by 2.06%, 1.17% and 0.9% compared to ResNet101, ResNet101+SE and ResNet101+ECA, respectively, demonstrating the superiority of our proposed FSA module. FSAU-Net improves the IoU values by 3.18%, 2.75% and 1.80% compared to those of UNet, UNet+SE and UNet+ECA, respectively. FSAU-Net has 2.11%, 1.22%, and 0.95% IoU improvements over the IoU values of ResNet101, ResNet101+SE and ResNet101+ECA, respectively, demonstrating the superiority of our proposed FSAU-Net model. The TensorFlow implementation is available at .
引用
收藏
页码:1643 / 1664
页数:22
相关论文
共 56 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Bhangale Kishor, 2022, Second International Conference on Image Processing and Capsule Networks: ICIPCN 2021. Lecture Notes in Networks and Systems (300), P414, DOI 10.1007/978-3-030-84760-9_36
  • [3] A Stacking Ensemble Deep Learning Model for Building Extraction from Remote Sensing Images
    Cao, Duanguang
    Xing, Hanfa
    Wong, Man Sing
    Kwan, Mei-Po
    Xing, Huaqiao
    Meng, Yuan
    [J]. REMOTE SENSING, 2021, 13 (19)
  • [4] AlexNet Convolutional Neural Network for Disease Detection and Classification of Tomato Leaf
    Chen, Hsing-Chung
    Widodo, Agung Mulyo
    Wisnujati, Andika
    Rahaman, Mosiur
    Lin, Jerry Chun-Wei
    Chen, Liukui
    Weng, Chien-Erh
    [J]. ELECTRONICS, 2022, 11 (06)
  • [5] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [6] USING POLAR GRID FOR BUILDING EXTRACTION IN TERRESTRIAL LASER SCANNING DATA
    Chen, Maolin
    Tang, Feifei
    Pan, Jianping
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1632 - 1634
  • [7] Research on Recognition of Fly Species Based on Improved RetinaNet and CBAM
    Chen, Yantong
    Zhang, Xianzhong
    Chen, Weinan
    Li, Yuyang
    Wang, Junsheng
    [J]. IEEE ACCESS, 2020, 8 (08) : 102907 - 102919
  • [8] Chunhui Zhao, 2021, 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, P5295, DOI 10.1109/IGARSS47720.2021.9554532
  • [9] Futagami T, 2019, 2019 58TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), P415, DOI [10.23919/sice.2019.8859851, 10.23919/SICE.2019.8859851]
  • [10] Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, DOI 10.48550/ARXIV.1704.04861, 10.48550/arXiv.1704.04861]