GLFFNet model for remote sensing image scene classification

被引:0
作者
Wang W. [1 ]
Deng J. [1 ]
Wang X. [1 ]
Li Z. [2 ]
Yuan P. [3 ]
机构
[1] School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha
[2] Hunan Shenfan Science and Technology Limited Company, Changsha
[3] Changsha Jingwang Information and Technology Limited Company, Changsha
来源
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica | 2023年 / 52卷 / 10期
关键词
convolutional neural network; GLFFNet model; remote sensing image; scene classification; Transformer;
D O I
10.11947/j.AGCS.2023.20220286
中图分类号
学科分类号
摘要
Traditional scene classification models cannot perform multi-scale key feature extraction in remote sensing images in a lightweight and efficient manner. Deep learning methods generally have shortcomings such as large amount of calculation and slow convergence speed. In view of the above problems, this paper makes full use of the ability of CNN structure and Transformer structure to extract features at different scales, and proposes a feature extract module, named global and local features fused (GLFF) block. Based on this module, a lightweight remote sensing image scene classification model, GLFFNet, is designed, which has better local information and global information extraction ability. In order to verify the effectiveness of GLFFNet, this paper uses the open-source remote sensing image datasets RSSCN7 and SIRI-WHU to verify the complexity and recognition ability of GLFFNet and other deep learning networks. Finally, GLFFNet achieves recognition accuracy of up to 94.82% and 95.83% on RSSCN7 and SIRI-WHU datasets, respectively, which is better than other state-of-the-art models. © 2023 SinoMaps Press. All rights reserved.
引用
收藏
页码:1693 / 1702
页数:9
相关论文
共 37 条
[1]  
GONG Jianya, ZHANG Mi, HU Xiangyun, Et al., The design of deep learning framework and model for intelligent remote sensing, Acta Geodaetica et Cartographica Sinica, 51, 4, pp. 475-487, (2022)
[2]  
QIAO Xingxing, SHI Wenzao, LIU Yuanxi, Et al., Remote sensing image scene classification based on ResNet and dual attention mechanism, Computer Systems cV Applications, 30, 8, pp. 243-248, (2021)
[3]  
WANG Wang, LI Shaochun, WANG Wang, Et al., A simple deep learning network for classification of 3D mobile Li-DAR point clouds, Journal of Geodesy and Geoinformation Science, 4, 3, pp. 49-59, (2021)
[4]  
BIAN Xiaoyong, FEI Xiongjun, MU Nan, Remote sensing image scene classification based on scale-attention network, Journal of Computer Applications, 40, 3, pp. 872-877, (2020)
[5]  
XU Suhui, MU Xiaodong, ZHAO Peng, Et al., Scene classification of remote sensing image based on multi-scale feature and deep neural network[J], Acta Geodaetica et Cartographica Sinica, 45, 7, pp. 834-840, (2016)
[6]  
CAI Zhiling, WENG Qian, YE Shaozhen, Et al., Remote sensing image scene classification based on Inception-V3 [J], Remote Sensing for Land & Resources, 32, 3, pp. 80-89, (2020)
[7]  
PAN S J, YANG Qiang, A survey on transfer learning, IEEE Transactions on Knowledge and Data Engineering, 22, 10, pp. 1345-1359, (2010)
[8]  
SZEGEDY C, VANHOUCKE V, IOFFE S, Et al., Rethinking the inception architecture for computer vision, Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818-2826, (2016)
[9]  
VASWANI A, SHAZEER N, PARMAR N, Et al., Attention is all you need, Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 6000-6010, (2017)
[10]  
SWALPA K, ANKUR D, DANFENG H, Et al., Multimodal fusion transformer for remote sensing image classification