Local-Global Feature Capture and Boundary Information Refinement Swin Transformer Segmentor for Remote Sensing Images

被引:7
作者
Lin, Rui [1 ]
Zhang, Ying [1 ]
Zhu, Xue [1 ]
Chen, Xueyun [1 ]
机构
[1] Guangxi Univ, Sch Elect Engn, Nanning 530004, Peoples R China
基金
中国国家自然科学基金;
关键词
Boundary information; dual linear attention; feature capture; remote sensing images; semantic segmentation; SEMANTIC SEGMENTATION; EDGE FEATURE; NETWORK;
D O I
10.1109/ACCESS.2024.3350645
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semantic segmentation of urban remote sensing images is a highly challenging task. Due to the complex background, occlusion overlap, and small-scale targets in urban remote sensing images, the semantic segmentation results suffer from deficiencies such as similar target confusion, blurred target boundaries, and small-scale target omission. To solve the above problems, a local-global feature capture and boundary information refinement Swin Transformer segmentor (LGBSwin) is proposed. First, the dual linear attention module (DLAM) utilizes spatial linear attention and channel linear attention mechanisms for strengthening global modeling capabilities to improve the segmentation ability of similar targets. Second, boundary-aware enhancement (BAE) adaptively mines the boundary semantic information through the effective integration of high-level and low-level features to alleviate blurred boundaries. Finally, feature refinement aggregation (FRA) establishes information relationships between different layers, reduces the loss of local information, and enhances local-global dependence, thus significantly improving the recognition ability of small targets. Experimental results demonstrate the effectiveness of LGBSwin, with an F1 of 91.02% on the ISPRS Vaihingen dataset and 93.35% on the ISPRS Potsdam dataset.
引用
收藏
页码:6088 / 6099
页数:12
相关论文
共 47 条
  • [1] Crop and Weed Leaf Area Index Mapping Using Multi-Source Remote and Proximal Sensing
    Asad, Muhammad Hamza
    Bais, Abdul
    [J]. IEEE ACCESS, 2020, 8 (138179-138190): : 138179 - 138190
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] Edge-Guided Recurrent Convolutional Neural Network for Multitemporal Remote Sensing Image Building Change Detection
    Bai, Beifang
    Fu, Wei
    Lu, Ting
    Li, Shutao
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
  • [5] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [6] ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks
    Ding, Xiaohan
    Guo, Yuchen
    Ding, Guiguang
    Han, Jungong
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1911 - 1920
  • [7] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [8] A Semantic Segmentation Method for Remote Sensing Images Based on the Swin Transformer Fusion Gabor Filter
    Feng, Dongdong
    Zhang, Zhihua
    Yan, Kun
    [J]. IEEE ACCESS, 2022, 10 : 77432 - 77451
  • [9] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [10] Semantic Segmentation of Marine Remote Sensing Based on a Cross Direction Attention Mechanism
    Gao, Hao
    Cao, Lin
    Yu, Dingfeng
    Xiong, Xuejun
    Cao, Maoyong
    [J]. IEEE ACCESS, 2020, 8 : 142483 - 142494