MSST-Net: A Multi-Scale Adaptive Network for Building Extraction from Remote Sensing Images Based on Swin Transformer

被引:51
作者
Yuan, Wei [1 ,2 ]
Xu, Wenbo [3 ]
机构
[1] Chengdu Univ, Sch Architecture & Civil Engn, Chengdu 610106, Peoples R China
[2] Chengdu Univ, Inst Higher Educ Sichuan Prov, Key Lab Pattern Recognit & Intelligent Informat P, Chengdu 610106, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Resources & Environm, Chengdu 611731, Peoples R China
关键词
deep learning; remote sensing; transformer; semantic segmentation; multi-scale adaptive; SEGMENTATION;
D O I
10.3390/rs13234743
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The segmentation of remote sensing images by deep learning technology is the main method for remote sensing image interpretation. However, the segmentation model based on a convolutional neural network cannot capture the global features very well. A transformer, whose self-attention mechanism can supply each pixel with a global feature, makes up for the deficiency of the convolutional neural network. Therefore, a multi-scale adaptive segmentation network model (MSST-Net) based on a Swin Transformer is proposed in this paper. Firstly, a Swin Transformer is used as the backbone to encode the input image. Then, the feature maps of different levels are decoded separately. Thirdly, the convolution is used for fusion, so that the network can automatically learn the weight of the decoding results of each level. Finally, we adjust the channels to obtain the final prediction map by using the convolution with a kernel of 1 x 1. By comparing this with other segmentation network models on a WHU building data set, the evaluation metrics, mIoU, F1-score and accuracy are all improved. The network model proposed in this paper is a multi-scale adaptive network model that pays more attention to the global features for remote sensing segmentation.
引用
收藏
页数:14
相关论文
共 50 条
[41]   A novel network for semantic segmentation of landslide areas in remote sensing images with multi-branch and multi-scale fusion [J].
Wang, Kai ;
He, Daojie ;
Sun, Qingqiang ;
Yi, Lizhi ;
Yuan, Xiaofeng ;
Wang, Yalin .
APPLIED SOFT COMPUTING, 2024, 158
[42]   Fine-Scale Urban Informal Settlements Mapping by Fusing Remote Sensing Images and Building Data via a Transformer-Based Multimodal Fusion Network [J].
Fan, Runyu ;
Li, Fengpeng ;
Han, Wei ;
Yan, Jining ;
Li, Jun ;
Wang, Lizhe .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[43]   Multi-Scale Sea-Land Segmentation Method for Remote Sensing Images Based on Res2Net [J].
Gao Hui ;
Yan Xiaodong ;
Zhang Heng ;
Niu Yiting ;
Wang Jiaqi .
ACTA OPTICA SINICA, 2022, 42 (18)
[44]   AFL-Net: Attentional Feature Learning Network for Building Extraction from Remote Sensing Images [J].
Qiu, Yue ;
Wu, Fang ;
Qian, Haizhong ;
Zhai, Renjian ;
Gong, Xianyong ;
Yin, Jichong ;
Liu, Chengyi ;
Wang, Andong .
REMOTE SENSING, 2023, 15 (01)
[45]   Multi-scale feature extraction for energy-efficient object detection in remote sensing images [J].
Wu, Di ;
Liu, Hongning ;
Xu, Jiawei ;
Xie, Fei .
IET COMPUTER VISION, 2024, :1223-1234
[46]   IRU-Net: An Efficient End-to-End Network for Automatic Building Extraction From Remote Sensing Images [J].
Sheikh, Md Abdul Alim ;
Maity, Tanmoy ;
Kole, Alok .
IEEE ACCESS, 2022, 10 :37811-37828
[47]   MC-Net: multi-scale contextual information aggregation network for image captioning on remote sensing images [J].
Huang, Haiyan ;
Shao, Zhenfeng ;
Cheng, Qimin ;
Huang, Xiao ;
Wu, Xiaoping ;
Li, Guoming ;
Tan, Li .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (02) :4848-4866
[48]   MSCSA-Net: Multi-Scale Channel Spatial Attention Network for Semantic Segmentation of Remote Sensing Images [J].
Liu, Kuan-Hsien ;
Lin, Bo-Yen .
APPLIED SCIENCES-BASEL, 2023, 13 (17)
[49]   Multi scale feature extraction network with machine learning algorithms for water body extraction from remote sensing images [J].
Nagaraj, R. ;
Kumar, Lakshmi Sutha .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (17) :6349-6387
[50]   BOMSC-Net: Boundary Optimization and Multi-Scale Context Awareness Based Building Extraction From High-Resolution Remote Sensing Imagery [J].
Zhou, Yuan ;
Chen, Zhanlong ;
Wang, Bin ;
Li, Shuangjiang ;
Liu, Hao ;
Xu, Daozhu ;
Ma, Chao .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60