RCSFN: A remote sensing image scene classification and recognition network based on rectangle convolutional self attention fusion

被引:1
|
作者
Hou, Jingjin [1 ,2 ]
Zhou, Houkui [1 ,2 ]
Yu, Huimin [3 ,4 ]
Hu, Haoji [3 ]
机构
[1] Zhejiang A&F Univ, Sch Math & Comp Sci, Hangzhou 311300, Peoples R China
[2] Zhejiang Prov Key Lab Forestry Intelligent Monitor, Hangzhou 311300, Peoples R China
[3] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[4] State Key Lab CAD & CG, Hangzhou 310027, Peoples R China
关键词
Remote sensing; Scene classification; Local feature fusion; Position enhancement; Attention mechanism;
D O I
10.1007/s11760-024-03511-8
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Remote sensing scene classification is a critical task in the processing and analysis of remote sensing images. Traditional methods typically use standard convolutional kernels to extract feature information. Although these methods have seen improvements, they still struggle to fully capture unique local details, thus affecting classification accuracy. Each category within remote sensing scenes has its unique local details, such as the rectangular features of buildings in schools or industrial areas, as well as bridges and roads in parks or squares. The most important features are often these rectangular structures and their spatial positions, which standard convolutional kernels find challenging to capture effectively.To address this issue, we propose a remote sensing scene classification method based on a Rectangle Convolution Self-Attention Fusion Network (RCSFN) architecture. In the RCSFN network, the Rectangle Convolution Maximum Fusion (RCMF) module operates in parallel with the first 4 x 4 convolutional layer of VanillaNet-5. The RCMF module uses two different rectangular convolutional kernels to extract different receptive fields, enhancing the extraction of shallow local features through addition and fusion. This process, combined with the concatenation of the original input features, results in richer local detail information.Additionally, we introduce an Area Selection (AS) module that focuses on selecting feature information within local regions. The Sequential Polarisation Self-Attention (SPS) mechanism, integrated with the Mini Region Convolution (MRC) module through feature multiplication, enhances important features and improves spatial positional relationships, thereby increasing the accuracy of recognising categories with rectangular or elongated features. Experiments were carried out on AID and NWPU-RESISC45 data sets, and the overall classification accuracy was 96.56% and 92.46%, respectively. This shows that the RCSFN network model proposed in this paper is feasible and effective for class classification problems with unique local detail features.
引用
收藏
页码:8739 / 8756
页数:18
相关论文
共 50 条
  • [31] SEMSDNet: A Multiscale Dense Network With Attention for Remote Sensing Scene Classification
    Tian, Tian
    Li, Lingling
    Chen, Weitao
    Zhou, Huabing
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 5501 - 5514
  • [32] Remote Sensing Scene Classification via Multi-Branch Local Attention Network
    Chen, Si-Bao
    Wei, Qing-Song
    Wang, Wen-Zhong
    Tang, Jin
    Luo, Bin
    Wang, Zu-Yuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 99 - 109
  • [33] Remote Sensing Image Scene Classification Based on Head-Tail Global Joint Dual Attention Discrimination Network
    Wei, Lin
    Geng, Chao
    Yin, Yuping
    IEEE ACCESS, 2023, 11 : 88305 - 88316
  • [34] Gradient-Guided Multiscale Focal Attention Network for Remote Sensing Scene Classification
    Zhao, Yue
    Gong, Maoguo
    Qin, A. K.
    Zhang, Mingyang
    Hu, Zhuping
    Gao, Tianqi
    Pu, Yan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [35] Efficient Convolutional Neural Architecture Search for Remote Sensing Image Scene Classification
    Peng, Cheng
    Li, Yangyang
    Jiao, Licheng
    Shang, Ronghua
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (07): : 6092 - 6105
  • [36] A Multi-Branch Feature Fusion Strategy Based on an Attention Mechanism for Remote Sensing Image Scene Classification
    Shi, Cuiping
    Zhao, Xin
    Wang, Liguo
    REMOTE SENSING, 2021, 13 (10)
  • [37] Transformer-based convolutional neural network approach for remote sensing natural scene classification
    Sivasubramanian, Arrun
    Prashanth, V. R.
    Hari, Theivaprakasham
    Sowmya, V.
    Gopalakrishnan, E. A.
    Ravi, Vinayakumar
    REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2024, 33
  • [38] MGFN: A Multi-Granularity Fusion Convolutional Neural Network for Remote Sensing Scene Classification
    Zeng, Zhiguo
    Chen, Xihong
    Song, Zhihua
    IEEE ACCESS, 2021, 9 : 76038 - 76046
  • [39] Scene classification of remote sensing image based on deep network and multi-scale features fusion
    Yang, Zhou
    Mu, Xiao-dong
    Zhao, Feng-an
    OPTIK, 2018, 171 : 287 - 293
  • [40] Remote Sensing Scene Classification Using Spatial Transformer Fusion Network
    Tong, Shun
    Qi, Kunlun
    Guan, Qingfeng
    Zhu, Qiqi
    Yang, Chao
    Zheng, Jie
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 549 - 552