MSST-Net: A Multi-Scale Adaptive Network for Building Extraction from Remote Sensing Images Based on Swin Transformer

被引:51
|
作者
Yuan, Wei [1 ,2 ]
Xu, Wenbo [3 ]
机构
[1] Chengdu Univ, Sch Architecture & Civil Engn, Chengdu 610106, Peoples R China
[2] Chengdu Univ, Inst Higher Educ Sichuan Prov, Key Lab Pattern Recognit & Intelligent Informat P, Chengdu 610106, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Resources & Environm, Chengdu 611731, Peoples R China
关键词
deep learning; remote sensing; transformer; semantic segmentation; multi-scale adaptive; SEGMENTATION;
D O I
10.3390/rs13234743
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The segmentation of remote sensing images by deep learning technology is the main method for remote sensing image interpretation. However, the segmentation model based on a convolutional neural network cannot capture the global features very well. A transformer, whose self-attention mechanism can supply each pixel with a global feature, makes up for the deficiency of the convolutional neural network. Therefore, a multi-scale adaptive segmentation network model (MSST-Net) based on a Swin Transformer is proposed in this paper. Firstly, a Swin Transformer is used as the backbone to encode the input image. Then, the feature maps of different levels are decoded separately. Thirdly, the convolution is used for fusion, so that the network can automatically learn the weight of the decoding results of each level. Finally, we adjust the channels to obtain the final prediction map by using the convolution with a kernel of 1 x 1. By comparing this with other segmentation network models on a WHU building data set, the evaluation metrics, mIoU, F1-score and accuracy are all improved. The network model proposed in this paper is a multi-scale adaptive network model that pays more attention to the global features for remote sensing segmentation.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] MDTrans: Multi-scale and dual-branch feature fusion network based on Swin Transformer for building extraction in remote sensing images
    Diao, Kuo
    Zhu, Jinlong
    Liu, Guangjie
    Li, Meng
    IET IMAGE PROCESSING, 2024, 18 (11) : 2930 - 2942
  • [2] CSA-Net: Complex Scenarios Adaptive Network for Building Extraction for Remote Sensing Images
    Yang, Dongjie
    Gao, Xianjun
    Yang, Yuanwei
    Jiang, Minghan
    Guo, Kangliang
    Liu, Bo
    Li, Shaohua
    Yu, Shengyan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 938 - 953
  • [3] EMAFF-Net: an enhanced multi-scale attentive feature fusion network for building extraction from VHR remote sensing images
    Vijayan, Lakshmi
    Preethy Byju, Akshara
    REMOTE SENSING LETTERS, 2024, 15 (02) : 157 - 166
  • [4] ASF-Net: Adaptive Screening Feature Network for Building Footprint Extraction From Remote-Sensing Images
    Chen, Jun
    Jiang, Yuxuan
    Luo, Linbo
    Gong, Wenping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] A Multi-Scale Edge Constraint Network for the Fine Extraction of Buildings from Remote Sensing Images
    Wang, Zhenqing
    Zhou, Yi
    Wang, Futao
    Wang, Shixin
    Qin, Gang
    Zou, Weijie
    Zhu, Jinfeng
    REMOTE SENSING, 2023, 15 (04)
  • [6] Multi-Scale Feature Fusion Attention Network for Building Extraction in Remote Sensing Images
    Liu, Jia
    Gu, Hang
    Li, Zuhe
    Chen, Hongyang
    Chen, Hao
    ELECTRONICS, 2024, 13 (05)
  • [7] Asymmetric Network Combining CNN and Transformer for Building Extraction from Remote Sensing Images
    Chang, Junhao
    Cen, Yuefeng
    Cen, Gang
    SENSORS, 2024, 24 (19)
  • [8] Multi-Scale Attention Network for Building Extraction from High-Resolution Remote Sensing Images
    Chang, Jing
    He, Xiaohui
    Li, Panle
    Tian, Ting
    Cheng, Xijie
    Qiao, Mengjia
    Zhou, Tao
    Zhang, Beibei
    Chang, Ziqian
    Fan, Tingwei
    SENSORS, 2024, 24 (03)
  • [9] Adaptive enhanced swin transformer with U-net for remote sensing image segmentation*
    Gu, Xingjian
    Li, Sizhe
    Ren, Shougang
    Zheng, Hengbiao
    Fan, Chengcheng
    Xu, Huanliang
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [10] LiteST-Net: A Hybrid Model of Lite Swin Transformer and Convolution for Building Extraction from Remote Sensing Image
    Yuan, Wei
    Zhang, Xiaobo
    Shi, Jibao
    Wang, Jin
    REMOTE SENSING, 2023, 15 (08)