A Scene Classification Model Based on Global-Local Features and Attention in Lie Group Space

被引:1
作者
Xu, Chengjun [1 ,2 ]
Shu, Jingqian [1 ]
Wang, Zhenghan [1 ]
Wang, Jialin [1 ]
机构
[1] Jiangxi Normal Univ, Sch Software, Nanchang 330022, Peoples R China
[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
attention mechanism; feature fusion; global feature; Lie Group; local feature; remote sensing scene classification; CONVOLUTIONAL NEURAL-NETWORKS; RECOGNITION; EXTRACTION; FUSION;
D O I
10.3390/rs16132323
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The efficient fusion of global and local multi-scale features is quite important for remote sensing scene classification (RSSC). The scenes in high-resolution remote sensing images (HRRSI) contain many complex backgrounds, intra-class diversity, and inter-class similarities. Many studies have shown that global features and local features are helpful for RSSC. The receptive field of a traditional convolution kernel is small and fixed, and it is difficult to capture global features in the scene. The self-attention mechanism proposed in transformer effectively alleviates the above shortcomings. However, such models lack local inductive bias, and the calculation is complicated due to the large number of parameters. To address these problems, in this study, we propose a classification model of global-local features and attention based on Lie Group space. The model is mainly composed of three independent branches, which can effectively extract multi-scale features of the scene and fuse the above features through a fusion module. Channel attention and spatial attention are designed in the fusion module, which can effectively enhance the crucial features in the crucial regions, to improve the accuracy of scene classification. The advantage of our model is that it extracts richer features, and the global-local features of the scene can be effectively extracted at different scales. Our proposed model has been verified on publicly available and challenging datasets, taking the AID as an example, the classification accuracy reached 97.31%, and the number of parameters is 12.216 M. Compared with other state-of-the-art models, it has certain advantages in terms of classification accuracy and number of parameters.
引用
收藏
页数:22
相关论文
共 84 条
[1]   Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification [J].
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
van de Weijer, Joost ;
Molinier, Matthieu ;
Laaksonen, Jorma .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 138 :74-85
[2]  
Aral RA, 2018, IEEE INT CONF BIG DA, P2058, DOI 10.1109/BigData.2018.8622212
[3]   Remote Sensing Image Scene Classification Using Multiscale Feature Fusion Covariance Network With Octave Convolution [J].
Bai, Lin ;
Liu, Qingxin ;
Li, Cuiling ;
Ye, Zhen ;
Hui, Meng ;
Jia, Xiuping .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4]   A Lightweight and Multiscale Network for Remote Sensing Image Scene Classification [J].
Bai, Lin ;
Liu, Qingxin ;
Li, Cuiling ;
Zhu, Chunlin ;
Ye, Zhen ;
Xi, Meng .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[5]   Vision Transformers for Remote Sensing Image Classification [J].
Bazi, Yakoub ;
Bashmal, Laila ;
Rahhal, Mohamad M. Al ;
Dayil, Reham Al ;
Ajlan, Naif Al .
REMOTE SENSING, 2021, 13 (03) :1-20
[6]   APDC-Net: Attention Pooling-Based Convolutional Network for Aerial Scene Classification [J].
Bi, Qi ;
Qin, Kun ;
Zhang, Han ;
Xie, Jiafen ;
Li, Zhili ;
Xu, Kai .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (09) :1603-1607
[7]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784
[8]   Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].
Chattopadhay, Aditya ;
Sarkar, Anirban ;
Howlader, Prantik ;
Balasubramanian, Vineeth N. .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847
[9]   Remote Sensing Scene Classification via Multi-Branch Local Attention Network [J].
Chen, Si-Bao ;
Wei, Qing-Song ;
Wang, Wen-Zhong ;
Tang, Jin ;
Luo, Bin ;
Wang, Zu-Yuan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :99-109
[10]   Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities [J].
Cheng, Gong ;
Xie, Xingxing ;
Han, Junwei ;
Guo, Lei ;
Xia, Gui-Song .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :3735-3756