A Novel Lightweight Attention-Discarding Transformer for High-Resolution SAR Image Classification

被引:7
作者
Liu, Xingyu [1 ]
Wu, Yan [1 ]
Hu, Xin [1 ]
Li, Zhikang [1 ]
Li, Ming [2 ]
机构
[1] Xidian Univ, Sch Elect Engn, Remote Sensing Image Proc & Fus Grp, Xian 710071, Peoples R China
[2] Xidian Univ, Natl Lab Radar Signal Proc, Xian 710071, Peoples R China
关键词
Radar polarimetry; Feature extraction; Transformers; Convolution; Training; Data models; Convolutional neural networks; Batch normalization and layer normalization (BLN); high-resolution synthetic aperture radar (HR SAR); image classification; lightweight attention-discarding transformer (LAD Transformer); Swin transformer;
D O I
10.1109/LGRS.2023.3279456
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Vision transformer (ViT) has been introduced in high-resolution synthetic aperture radar (HR SAR) image classification due to its excellent global feature extraction ability. However, small samples of SAR images make it difficult to fit the ViT with excessive trainable parameters, which easily results in over-fitting in training. Meanwhile, poor capability in capturing local features of ViT limits its accuracy in SAR image classification. To solve these problems, this letter proposes a new lightweight attention-discarding transformer (LAD Transformer) for the classification of HR SAR images. In the proposed model, the backbone of the advanced Swin transformer is used to model global information and extract hierarchical features. Moreover, the vital feature extraction part of the LAD transformer completely discards the self-attention mechanism and extracts local features of SAR images by introducing lighter group convolution and channel shuffle (GC-CS) block. In addition, to address the estimation shift caused by consecutive batch normalization (BN) layers, a new composite normalization method consisting of BN and layer normalization (BLN) in GC-CS block is proposed. The experiments show that the proposed network has fewer parameters and higher classification accuracy on two real HR SAR data.
引用
收藏
页数:5
相关论文
共 50 条
[31]   Dual Self-Attention Swin Transformer for Hyperspectral Image Super-Resolution [J].
Long, Yaqian ;
Wang, Xun ;
Xu, Meng ;
Zhang, Shuyu ;
Jiang, Shuguo ;
Jia, Sen .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[32]   CACFTNet: A Hybrid Cov-Attention and Cross-Layer Fusion Transformer Network for Hyperspectral Image Classification [J].
Cheng, Shuli ;
Chan, Runze ;
Du, Anyu .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :18-18
[33]   Boosting Memory Efficiency in Transfer Learning for High-Resolution Medical Image Classification [J].
Huang, Yijin ;
Cheng, Pujin ;
Tam, Roger ;
Tang, Xiaoying .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
[34]   Multiscale CNN With Autoencoder Regularization Joint Contextual Attention Network for SAR Image Classification [J].
Wu, Zitong ;
Hou, Biao ;
Jiao, Licheng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (02) :1200-1213
[35]   LHDACT: Lightweight Hybrid Dual Attention CNN and Transformer Network for Remote Sensing Image Change Detection [J].
Song, Xinyang ;
Hua, Zhen ;
Li, Jinjiang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[36]   Bridging CNN and Transformer With Cross-Attention Fusion Network for Hyperspectral Image Classification [J].
Xu, Fulin ;
Mei, Shaohui ;
Zhang, Ge ;
Wang, Nan ;
Du, Qian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[37]   GAFnet: Group Attention Fusion Network for PAN and MS Image High-Resolution Classification [J].
Liu, Xu ;
Li, Lingling ;
Liu, Fang ;
Hou, Biao ;
Yang, Shuyuan ;
Jiao, Licheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) :10556-10569
[38]   Lightweight Spectral-Spatial Attention Network for Hyperspectral Image Classification [J].
Cui, Ying ;
Xia, Jinbiao ;
Wang, Zhiteng ;
Gao, Shan ;
Wang, Liguo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[39]   HIGH-RESOLUTION IMAGE CLASSIFICATION WITH CONVOLUTIONAL NETWORKS [J].
Maggiori, Emmanuel ;
Tarabalka, Yuliya ;
Charpiat, Guillaume ;
Alliez, Pierre .
2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, :5157-5160
[40]   Lightweight Single Image Super-Resolution With Multi-Scale Spatial Attention Networks [J].
Soh, Jae Woong ;
Cho, Nam Ik .
IEEE ACCESS, 2020, 8 :35383-35391