DBDAN: Dual-Branch Dynamic Attention Network for Semantic Segmentation of Remote Sensing Images

被引:0
作者
Che, Rui [1 ]
Ma, Xiaowen [1 ]
Hong, Tingfeng [1 ]
Wang, Xinyu [1 ]
Feng, Tian [1 ]
Zhang, Wei [1 ,2 ]
机构
[1] Zhejiang Univ, Sch Software Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Innovat Ctr Yangtze River Delta, Jiaxing 314103, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IV | 2024年 / 14428卷
关键词
Remote Sensing; Semantic Segmentation; Attention Mechanism; Deep Learning;
D O I
10.1007/978-981-99-8462-6_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attention mechanism is capable to capture long-range dependence. However, its independent calculation of correlations can hardly consider the complex background of remote sensing images, which causes noisy and ambiguous attention weights. To address this issue, we design a correlation attention module (CAM) to enhance appropriate correlations and suppress erroneous ones by seeking consensus among all correlation vectors, which facilitates feature aggregation. Simultaneously, we introduce the CAM into a local dynamic attention (LDA) branch and a global dynamic attention (GDA) branch to obtain the information on local texture details and global context, respectively. In addition, considering the different demands of complex and diverse geographical objects for both local texture details and global context, we devise a dynamic weighting mechanism to adaptively adjust the contributions of both branches, thereby constructing a more discriminative feature representation. Experimental results on three datasets suggest that the proposed dual-branch dynamic attention network (DBDAN), which integrates the CAM and both branches, can considerably improve the performance for semantic segmentation of remote sensing images and outperform representative state-of-the-art methods.
引用
收藏
页码:306 / 317
页数:12
相关论文
共 33 条
  • [1] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [2] LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images
    Ding, Lei
    Tang, Hao
    Bruzzone, Lorenzo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 426 - 435
  • [3] Feng Y., 2020, ISPRS Ann. Photogrammetry Remote Sens. Spatial Inf. Sci., V2, P475
  • [4] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [5] ISNet: Integrate Image-Level and Semantic-Level Context for Semantic Segmentation
    Jin, Zhenchao
    Liu, Bin
    Chu, Qi
    Yu, Nenghai
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7169 - 7178
  • [6] Multiattention Network for Semantic Segmentation of Fine-Resolution Remote Sensing Images
    Li, Rui
    Zheng, Shunyi
    Zhang, Ce
    Duan, Chenxi
    Su, Jianlin
    Wang, Libo
    Atkinson, Peter M.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation
    Li, Xin
    Xu, Feng
    Xia, Runliang
    Lyu, Xin
    Gao, Hongmin
    Tong, Yao
    [J]. REMOTE SENSING, 2021, 13 (15)
  • [8] Dual attention deep fusion semantic segmentation networks of large-scale satellite remote-sensing images
    Li, Xin
    Xu, Feng
    Lyu, Xin
    Gao, Hongmin
    Tong, Yao
    Cai, Sujin
    Li, Shengyang
    Liu, Daofang
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (09) : 3583 - 3610
  • [9] A ConvNet for the 2020s
    Liu, Zhuang
    Mao, Hanzi
    Wu, Chao-Yuan
    Feichtenhofer, Christoph
    Darrell, Trevor
    Xie, Saining
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11966 - 11976
  • [10] Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965