Local-Global Feature Capture and Boundary Information Refinement Swin Transformer Segmentor for Remote Sensing Images

被引：7

作者：

Lin, Rui ^{[1
]}

Zhang, Ying ^{[1
]}

Zhu, Xue ^{[1
]}

Chen, Xueyun ^{[1
]}

机构：

[1] Guangxi Univ, Sch Elect Engn, Nanning 530004, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

基金：

中国国家自然科学基金;

关键词：

Boundary information; dual linear attention; feature capture; remote sensing images; semantic segmentation; SEMANTIC SEGMENTATION; EDGE FEATURE; NETWORK;

D O I：

10.1109/ACCESS.2024.3350645

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Semantic segmentation of urban remote sensing images is a highly challenging task. Due to the complex background, occlusion overlap, and small-scale targets in urban remote sensing images, the semantic segmentation results suffer from deficiencies such as similar target confusion, blurred target boundaries, and small-scale target omission. To solve the above problems, a local-global feature capture and boundary information refinement Swin Transformer segmentor (LGBSwin) is proposed. First, the dual linear attention module (DLAM) utilizes spatial linear attention and channel linear attention mechanisms for strengthening global modeling capabilities to improve the segmentation ability of similar targets. Second, boundary-aware enhancement (BAE) adaptively mines the boundary semantic information through the effective integration of high-level and low-level features to alleviate blurred boundaries. Finally, feature refinement aggregation (FRA) establishes information relationships between different layers, reduces the loss of local information, and enhances local-global dependence, thus significantly improving the recognition ability of small targets. Experimental results demonstrate the effectiveness of LGBSwin, with an F1 of 91.02% on the ISPRS Vaihingen dataset and 93.35% on the ISPRS Potsdam dataset.

引用

页码：6088 / 6099

页数：12

共 47 条

[1] Crop and Weed Leaf Area Index Mapping Using Multi-Source Remote and Proximal Sensing
Asad, Muhammad Hamza
Bais, Abdul
[J]. IEEE ACCESS, 2020, 8 (138179-138190): : 138179 - 138190
[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[3] Edge-Guided Recurrent Convolutional Neural Network for Multitemporal Remote Sensing Image Building Change Detection
Bai, Beifang
Fu, Wei
Lu, Ting
Li, Shutao
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4] Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
[5] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[6] ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks
Ding, Xiaohan
Guo, Yuchen
Ding, Guiguang
Han, Jungong
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1911 - 1920
[7] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[8] A Semantic Segmentation Method for Remote Sensing Images Based on the Swin Transformer Fusion Gabor Filter
Feng, Dongdong
Zhang, Zhihua
Yan, Kun
[J]. IEEE ACCESS, 2022, 10 : 77432 - 77451
[9] Dual Attention Network for Scene Segmentation
Fu, Jun
Liu, Jing
Tian, Haijie
Li, Yong
Bao, Yongjun
Fang, Zhiwei
Lu, Hanqing
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
[10] Semantic Segmentation of Marine Remote Sensing Based on a Cross Direction Attention Mechanism
Gao, Hao
Cao, Lin
Yu, Dingfeng
Xiong, Xuejun
Cao, Maoyong
[J]. IEEE ACCESS, 2020, 8 : 142483 - 142494

← 1 2 3 4 5 →