Multi-Scale Spatial Perception Attention Network for Few-Shot Hyperspectral Image Classification

被引:0
作者
Li, Yang [1 ]
Luo, Jian [2 ]
Long, Haoyu [1 ]
Jin, Qianqian [1 ]
机构
[1] China West Normal Univ, Sch Elect & Informat Engn, Nanchong 637002, Peoples R China
[2] China West Normal Univ, Sch Comp Sci, Nanchong 637002, Peoples R China
关键词
Feature extraction; Transformers; Convolutional neural networks; Training; Decoding; Convolution; Data mining; Kernel; Hyperspectral imaging; Semantics; Hyperspectral image (HSI) classification; few-shot learning; multi-scale; attention; fully convolutional network (FCN); transformer;
D O I
10.1109/ACCESS.2024.3501412
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In hyperspectral image (HSI) classification, combining the strengths of convolutional neural networks (CNNs) and Transformers can significantly enhance classification performance and model robustness. However, neural networks that combine CNNs and Transformers face classification accuracy and generalization limitations when dealing with imbalanced class samples, particularly in few-shot training scenarios. To solve the above problems, we propose a multi-scale spatial perception attention network (Ms-SPA) for few-shot HSI classification in this article. This method is based on an encoder-decoder fully convolutional network (FCN) architecture, where the encoder combines a convolutional neural network (CNN) with a Transformer module to extract local and global spatial-spectral joint features simultaneously. In the encoder, the spatial contraction perception Transformer (SCPFormer) is first proposed to improve the model's capacity for perceiving global-local joint features. Next, the multi-scale spatial attention (MSSA) module is proposed to capture spatial information at different convolution kernel scales and cascade them to form a more comprehensive representation structure. In the decoder, adaptive residual aggregation (ARA) is proposed to embed high-level semantic information into low-level features using a residual structure, thereby enhancing the perception of contextual information. A weighted CL-MixedLoss function (CL-MixedLoss) is proposed to solve the problem of imbalanced heterogeneous pixels in HSIs. Experimental results on three renowned HSI datasets indicate that our model achieves optimal classification performance, exceeding 95%, even when trained with a limited number of class samples.
引用
收藏
页码:173076 / 173090
页数:15
相关论文
共 41 条
[1]  
Ahmad M, 2024, Arxiv, DOI arXiv:2404.14955
[2]   A Fast and Compact 3-D CNN for Hyperspectral Image Classification [J].
Ahmad, Muhammad ;
Khan, Adil Mehmood ;
Mazzara, Manuel ;
Distefano, Salvatore ;
Ali, Mohsin ;
Sarfraz, Muhammad Shahzad .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[3]   Classification of Hyperspectral Images With Regularized Linear Discriminant Analysis [J].
Bandos, Tatyana V. ;
Bruzzone, Lorenzo ;
Camps-Valls, Gustavo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (03) :862-873
[4]   The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks [J].
Berman, Maxim ;
Triki, Amal Rannen ;
Blaschko, Matthew B. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4413-4421
[5]   A novel transductive SVM for semisupervised classification of remote-sensing images [J].
Bruzzone, Lorenzo ;
Chi, Mingmin ;
Marconcini, Mattia .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (11) :3363-3373
[6]   Unsupervised Hybrid Network of Transformer and CNN for Blind Hyperspectral and Multispectral Image Fusion [J].
Cao, Xuheng ;
Lian, Yusheng ;
Wang, Kaixuan ;
Ma, Chao ;
Xu, Xianqing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-15
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   Semi-Supervised Adaptive Pseudo-Label Feature Learning for Hyperspectral Image Classification in Internet of Things [J].
Chen, Huayue ;
Ru, Jie ;
Long, Haoyu ;
He, Jialin ;
Chen, Tao ;
Deng, Wu .
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (19) :30754-30768
[9]  
Chen HY, 2024, IEEE T GEOSCI REMOTE, V62, DOI [10.1109/TGRS.2024.3380087, 10.1109/TGRS.2024.3417253]
[10]   Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks [J].
Chen, Yushi ;
Jiang, Hanlu ;
Li, Chunyang ;
Jia, Xiuping ;
Ghamisi, Pedram .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10) :6232-6251