SwinTFNet: Dual-Stream Transformer With Cross Attention Fusion for Land Cover Classification

被引:5
|
作者
Ren, Bo [1 ]
Liu, Bo [1 ]
Hou, Biao [1 ]
Wang, Zhao [1 ]
Yang, Chen [1 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Minist Educ China, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Optical imaging; Feature extraction; Optical sensors; Transformers; Adaptive optics; Optical fiber networks; Fuses; Data fusion; land cover classification (LCC); multimodality; synthetic aperture radar (SAR)-optical;
D O I
10.1109/LGRS.2024.3358899
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Land cover classification (LCC) is an important application in remote sensing data interpretation. As two common data sources, synthetic aperture radar (SAR) images can be regarded as an effective complement to optical images, which will reduce the influence caused by single-modal data. However, common LCC methods focus on designing advanced network architectures to process single-modal remote sensing data. Few works have been oriented toward improving segmentation performance through fusing multimodal data. In order to deeply integrate SAR and optical features, we propose SwinTFNet, a dual-stream deep fusion network. Through the global context modeling capability of Transformer structure, SwinTFNet models teleconnections between pixels in other regions and pixels in cloud regions for better prediction in cloud regions. In addition, a cross-attention fusion module (CAFM) is proposed to fuse features from optical and SAR data. Experimental results show that our method improves greatly in the classification of clouded images compared with other excellent segmentation methods and achieves the best performance on multimodal data. The source code of SwinTFNet is publicly available at https://github.com/XD-MG/SwinTFNet.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 50 条
  • [1] A Dual-Stream Transformer With Diff-Attention for Multispectral and Panchromatic Classification
    Xu, Lin
    Zhu, Hao
    Jiao, Licheng
    Zhao, Wenhao
    Li, Xiaotong
    Hou, Biao
    Ren, Zhongle
    Ma, Wenping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 14
  • [2] Evolutionary Dual-Stream Transformer
    Zhang, Ruohan
    Jiao, Licheng
    Li, Lingling
    Liu, Fang
    Liu, Xu
    Yang, Shuyuan
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (04) : 2166 - 2178
  • [3] Dual-stream GNN fusion network for hyperspectral classification
    Weiming Li
    Qikang Liu
    Shuaishuai Fan
    Cong’an Xu
    Hongyang Bai
    Applied Intelligence, 2023, 53 : 26542 - 26567
  • [4] Speech Emotion Recognition Using Dual-Stream Representation and Cross-Attention Fusion
    Yu, Shaode
    Meng, Jiajian
    Fan, Wenqing
    Chen, Ye
    Zhu, Bing
    Yu, Hang
    Xie, Yaoqin
    Sun, Qiuirui
    ELECTRONICS, 2024, 13 (11)
  • [5] Dual-stream GNN fusion network for hyperspectral classification
    Li, Weiming
    Liu, Qikang
    Fan, Shuaishuai
    Xu, Con'gan
    Bai, Hongyang
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26542 - 26567
  • [6] Dual-Stream Discriminative Attention Network for Cross-Scene Hyperspectral Image Classification
    Wang, Chenglong
    Guo, Yi
    Fu, Jiaojiao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [7] Dual-stream cross-modality fusion transformer for RGB-D action recognition
    Liu, Zhen
    Cheng, Jun
    Liu, Libo
    Ren, Ziliang
    Zhang, Qieshi
    Song, Chengqun
    KNOWLEDGE-BASED SYSTEMS, 2022, 255
  • [8] Dual-stream transformer-attention fusion network for short-term carbon price prediction
    Wu, Han
    Du, Pei
    ENERGY, 2024, 311
  • [9] Graph Dual-stream Convolutional Attention Fusion for precipitation nowcasting
    Vatamany, Lorand
    Mehrkanoon, Siamak
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 141
  • [10] DSSFN: A Dual-Stream Self-Attention Fusion Network for Effective Hyperspectral Image Classification
    Yang, Zian
    Zheng, Nairong
    Wang, Feng
    REMOTE SENSING, 2023, 15 (15)