Swin-CDSA: The Semantic Segmentation of Remote Sensing Images Based on Cascaded Depthwise Convolution and Spatial Attention Mechanism

被引:2
作者
Kang, Yuhan [1 ]
Ji, Jian [1 ]
Xu, Hekai [1 ]
Yang, Yong [1 ]
Chen, Peng [1 ]
Zhao, Hui [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
[2] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Xian 710119, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution; Remote sensing; Feature extraction; Semantic segmentation; Attention mechanisms; Transformers; Semantics; Attention mechanism; remote sensing; semantic segmentation; transformer;
D O I
10.1109/LGRS.2024.3431638
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
As an important task in remote sensing image processing, semantic segmentation of remote sensing images has broad application prospects in many fields such as disaster warning and rescue, environmental protection, and road planning. Research on semantic segmentation of remote sensing images based on deep learning has made some progress, but there are still problems such as poor perception of small object features, loss of detailed information in deep feature extraction, and imprecise segmentation contours of small objects. To this end, we propose a new remote sensing semantic segmentation model Swin-CDSA, which copes these problems to some extent by designing cascaded deep convolutional modules (CDCMs) and spatial attention mechanisms (SAMs). CDCM extracts multiscale features by using multilayer convolutions with different layers but parallel fixed small-sized kernels, while SAM supplements the model's understanding of local and global information through a dual attention mechanism. We conducted experiments on the Potsdam and LoveDA datasets and achieved good results.
引用
收藏
页数:5
相关论文
共 19 条
  • [1] Adversarial Instance Augmentation for Building Change Detection in Remote Sensing Images
    Chen, Hao
    Li, Wenyuan
    Shi, Zhenwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [2] Chen J., 2021, arXiv
  • [3] Dong S., 2022, IEEE Geosci. Remote Sens. Lett., V19, P1
  • [4] Multiattention Network for Semantic Segmentation of Fine-Resolution Remote Sensing Images
    Li, Rui
    Zheng, Shunyi
    Zhang, Ce
    Duan, Chenxi
    Su, Jianlin
    Wang, Libo
    Atkinson, Peter M.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] A Synergistical Attention Model for Semantic Segmentation of Remote Sensing Images
    Li, Xin
    Xu, Feng
    Liu, Fan
    Lyu, Xin
    Tong, Yao
    Xu, Zhennan
    Zhou, Jun
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Liu, Ze
    Lin, Yutong
    Cao, Yue
    Hu, Han
    Wei, Yixuan
    Zhang, Zheng
    Lin, Stephen
    Guo, Baining
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
  • [7] FactSeg: Foreground Activation-Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery
    Ma, Ailong
    Wang, Junjue
    Zhong, Yanfei
    Zheng, Zhuo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [8] Segmenter: Transformer for Semantic Segmentation
    Strudel, Robin
    Garcia, Ricardo
    Laptev, Ivan
    Schmid, Cordelia
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7242 - 7252
  • [9] An Empirical Study of Remote Sensing Pretraining
    Wang, Di
    Zhang, Jing
    Du, Bo
    Xia, Gui-Song
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [10] Advancing Plain Vision Transformer Toward Remote Sensing Foundation Model
    Wang, Di
    Zhang, Qiming
    Xu, Yufei
    Zhang, Jing
    Du, Bo
    Tao, Dacheng
    Zhang, Liangpei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61