Swin-CDSA: The Semantic Segmentation of Remote Sensing Images Based on Cascaded Depthwise Convolution and Spatial Attention Mechanism

被引：2

作者：

Kang, Yuhan ^{[1
]}

Ji, Jian ^{[1
]}

Xu, Hekai ^{[1
]}

Yang, Yong ^{[1
]}

Chen, Peng ^{[1
]}

Zhao, Hui ^{[2
]}

机构：

[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China

[2] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China

来源：

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS | 2024年 / 21卷

基金：

中国国家自然科学基金;

关键词：

Convolution; Remote sensing; Feature extraction; Semantic segmentation; Attention mechanisms; Transformers; Semantics; Attention mechanism; remote sensing; semantic segmentation; transformer;

D O I：

10.1109/LGRS.2024.3431638

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

As an important task in remote sensing image processing, semantic segmentation of remote sensing images has broad application prospects in many fields such as disaster warning and rescue, environmental protection, and road planning. Research on semantic segmentation of remote sensing images based on deep learning has made some progress, but there are still problems such as poor perception of small object features, loss of detailed information in deep feature extraction, and imprecise segmentation contours of small objects. To this end, we propose a new remote sensing semantic segmentation model Swin-CDSA, which copes these problems to some extent by designing cascaded deep convolutional modules (CDCMs) and spatial attention mechanisms (SAMs). CDCM extracts multiscale features by using multilayer convolutions with different layers but parallel fixed small-sized kernels, while SAM supplements the model's understanding of local and global information through a dual attention mechanism. We conducted experiments on the Potsdam and LoveDA datasets and achieved good results.

引用

页数：5

共 19 条

[1] Adversarial Instance Augmentation for Building Change Detection in Remote Sensing Images
Chen, Hao
Li, Wenyuan
Shi, Zhenwei
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[2] Chen J., 2021, arXiv
[3] Dong S., 2022, IEEE Geosci. Remote Sens. Lett., V19, P1
[4] Multiattention Network for Semantic Segmentation of Fine-Resolution Remote Sensing Images
Li, Rui
Zheng, Shunyi
Zhang, Ce
Duan, Chenxi
Su, Jianlin
Wang, Libo
Atkinson, Peter M.
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[5] A Synergistical Attention Model for Semantic Segmentation of Remote Sensing Images
Li, Xin
Xu, Feng
Liu, Fan
Lyu, Xin
Tong, Yao
Xu, Zhennan
Zhou, Jun
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[6] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Liu, Ze
Lin, Yutong
Cao, Yue
Hu, Han
Wei, Yixuan
Zhang, Zheng
Lin, Stephen
Guo, Baining
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
[7] FactSeg: Foreground Activation-Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery
Ma, Ailong
Wang, Junjue
Zhong, Yanfei
Zheng, Zhuo
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[8] Segmenter: Transformer for Semantic Segmentation
Strudel, Robin
Garcia, Ricardo
Laptev, Ivan
Schmid, Cordelia
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7242 - 7252
[9] An Empirical Study of Remote Sensing Pretraining
Wang, Di
Zhang, Jing
Du, Bo
Xia, Gui-Song
Tao, Dacheng
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[10] Advancing Plain Vision Transformer Toward Remote Sensing Foundation Model
Wang, Di
Zhang, Qiming
Xu, Yufei
Zhang, Jing
Du, Bo
Tao, Dacheng
Zhang, Liangpei
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61

← 1 2 →