SFA-Net: Semantic Feature Adjustment Network for Remote Sensing Image Segmentation

被引:7
作者
Hwang, Gyutae [1 ]
Jeong, Jiwoo [1 ]
Lee, Sang Jun [2 ]
机构
[1] Jeonbuk Natl Univ, Div Elect Engn, Jeonju, South Korea
[2] Jeonbuk Natl Univ, Future Semicond Convergence Technol Res Ctr, Div Elect Engn, Jeonju 54896, South Korea
关键词
remote sensing image; segmentation; transformer; hybrid architecture; feature adjustment module; FUSION;
D O I
10.3390/rs16173278
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Advances in deep learning and computer vision techniques have made impacts in the field of remote sensing, enabling efficient data analysis for applications such as land cover classification and change detection. Convolutional neural networks (CNNs) and transformer architectures have been utilized in visual perception algorithms due to their effectiveness in analyzing local features and global context. In this paper, we propose a hybrid transformer architecture that consists of a CNN-based encoder and transformer-based decoder. We propose a feature adjustment module that refines the multiscale feature maps extracted from an EfficientNet backbone network. The adjusted feature maps are integrated into the transformer-based decoder to perform the semantic segmentation of the remote sensing images. This paper refers to the proposed encoder-decoder architecture as a semantic feature adjustment network (SFA-Net). To demonstrate the effectiveness of the SFA-Net, experiments were thoroughly conducted with four public benchmark datasets, including the UAVid, ISPRS Potsdam, ISPRS Vaihingen, and LoveDA datasets. The proposed model achieved state-of-the-art accuracy on the UAVid, ISPRS Vaihingen, and LoveDA datasets for the segmentation of the remote sensing images. On the ISPRS Potsdam dataset, our method achieved comparable accuracy to the latest model while reducing the number of trainable parameters from 113.8 M to 10.7 M.
引用
收藏
页数:18
相关论文
共 38 条
[1]  
Badrinarayanan V, 2016, Arxiv, DOI [arXiv:1511.00561, DOI 10.48550/ARXIV.1511.00561, 10.48550/arXiv.1511.00561]
[2]   A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection [J].
Chen, Hao ;
Shi, Zhenwei .
REMOTE SENSING, 2020, 12 (10)
[3]  
Chen J., 2021, arXiv
[4]  
Chen YX, 2024, Arxiv, DOI arXiv:2305.12691
[5]  
Cheng HK, 2020, Arxiv, DOI arXiv:2005.02551
[6]   DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images [J].
Demir, Ilke ;
Koperski, Krzysztof ;
Lindenbaum, David ;
Pang, Guan ;
Huang, Jing ;
Bast, Saikat ;
Hughes, Forest ;
Tuia, Devis ;
Raskar, Ramesh .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :172-181
[7]   Lightweight boundary refinement module based on point supervision for semantic segmentation [J].
Dong, Zihao ;
Li, Jinping ;
Fang, Tiyu ;
Shao, Xiuli .
IMAGE AND VISION COMPUTING, 2021, 110 (110)
[8]  
Fu J, 2019, Arxiv, DOI [arXiv:1809.02983, 10.48550/arXiv.1809.02983]
[9]   Adaptive enhanced swin transformer with U-net for remote sensing image segmentation* [J].
Gu, Xingjian ;
Li, Sizhe ;
Ren, Shougang ;
Zheng, Hengbiao ;
Fan, Chengcheng ;
Xu, Huanliang .
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
[10]  
Islam MA, 2021, Arxiv, DOI arXiv:2108.07884