Reparameterized Feature Aggregation Convolutional Neural Network for Remote Sensing Scene Image Classification

被引:0
作者
Shi, Cuiping [1 ]
Ding, Mengxiang [2 ]
Wang, Liguo [3 ]
机构
[1] Huzhou Univ, Coll Informat Engn, Huzhou 313000, Peoples R China
[2] Qiqihar Univ, Coll Commun & Elect Engn, Qiqihar 161000, Peoples R China
[3] Dalian Nationalities Univ, Coll Informat & Commun Engn, Dalian 116000, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Convolution; Transformers; Convolutional neural networks; Remote sensing; Computer architecture; Computational modeling; Attention mechanisms; Kernel; Scene classification; Convolutional neural networks (CNNs); multilevel features; remote sensing scene classification (RSSC); reparameterization; transformer; WAVELET;
D O I
10.1109/JSTARS.2025.3568280
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the advancement of deep learning techniques, transformer has been introduced into remote sensing scene classification (RSSC). Although transformer performs well in building long-range dependencies, the computational complexity of its self-attention mechanism is proportional to the square of the input sequence length, which can lead to high computational costs and resource consumption when processing large-scale or high-resolution remote sensing images. In this study, a re-parameterized feature aggregation convolutional neural network (RepFACNN) is proposed. This is a novel network architecture that combines the advantages of convolutional neural networks (CNNs) and transformers, effectively reducing computational complexity by replacing the self-attention module with a reparametrized transformer (RepFormer). First, a RepFormer is constructed to extract multilevel features. Then, a multihead hybrid convolution module is designed to extract spatial features across various scales, enhancing the ability of the model to perceive intricate details and broader contexts simultaneously. Finally, a feature fusion module is introduced, adeptly amalgamating the features from the dual branches to facilitate more accurate and robust classification. To illustrate the effectiveness of the RepFACNN method, numerous experiments were conducted on three commonly used RSSC datasets: UC-Merced, AID, and NWPU. The experimental outcomes demonstrate that RepFACNN outperforms some state-of-the-art scene classification approaches by a large margin.
引用
收藏
页码:12603 / 12615
页数:13
相关论文
共 62 条
[1]  
Cao JM, 2020, Arxiv, DOI arXiv:2006.12030
[2]   Combining the Convolution and Transformer for Classification of Smoke-Like Scenes in Remote Sensing Images [J].
Chen, Shikun ;
Li, Weixuan ;
Cao, Yichao ;
Lu, Xiaobo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[3]   Remote Sensing Scene Classification via Multi-Branch Local Attention Network [J].
Chen, Si-Bao ;
Wei, Qing-Song ;
Wang, Wen-Zhong ;
Tang, Jin ;
Luo, Bin ;
Wang, Zu-Yuan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :99-109
[4]   GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification [J].
Chen, Weitao ;
Ouyang, Shubing ;
Tong, Wei ;
Li, Xianju ;
Zheng, Xiongwei ;
Wang, Lizhe .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :1150-1162
[5]  
Chen Y., 2019, arXiv, DOI DOI 10.1109/ICCV.2019.00353
[6]   Dynamic Convolution: Attention over Convolution Kernels [J].
Chen, Yinpeng ;
Dai, Xiyang ;
Liu, Mengchen ;
Chen, Dongdong ;
Yuan, Lu ;
Liu, Zicheng .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11027-11036
[7]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[8]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[9]  
Choromanski K. M., 2021, Rethinking attention with performers
[10]   A Deep Neural Network Combined With Context Features for Remote Sensing Scene Classification [J].
Deng, Peifang ;
Huang, Hong ;
Xu, Kejie .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19