Integrating Multiscale Spatial-Spectral Shuffling Convolution With 3-D Lightweight Transformer for Hyperspectral Image Classification

被引:3
作者
Wu, Qinggang [1 ]
He, Mengkun [1 ]
Chen, Qiqiang [1 ]
Sun, Le [2 ,3 ]
Ma, Chao [1 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Comp Sci & Technol, Zhengzhou 450000, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Transformers; Convolution; Data mining; Training; Three-dimensional displays; Complexity theory; Mathematical models; Kernel; Adaptation models; Convolutional neural networks (CNNs); hyperspectral image (HSI) classification; multimetric adaptive learning rate (MALR); 3-D lightweight transformer (3DLT); DISCRIMINANT-ANALYSIS; CLASSIFIERS; NETWORK;
D O I
10.1109/JSTARS.2025.3533211
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The combination of convolutional neural networks and vision transformers has garnered considerable attention in hyperspectral image (HSI) classification due to their abilities to enhance the classification accuracy by concurrently extracting local and global features. However, these accuracy improvements come at the cost of significant demands on storage resources, computational overhead, and extensive training samples. To address these challenges, this article proposes a multiscale spatial-spectral shuffling convolution integrated with a 3-D lightweight transformer (MSC-3DLT) for HSI classification. This network directly captures 3-D structural features throughout the entire feature extraction process, thereby enhancing HSI classification performance even at small sampling rates within a lightweight framework. Specifically, we first design a multiscale spatial-spectral shuffling convolution to comprehensively refine spatial-spectral feature granularities and enhance feature interactions by shuffling multiscale features across different groups. Second, to maximize the exploitation of limited training samples, we rethink transformers from the 3-D structural perspective of HSI data and propose a novel 3-D lightweight transformer (3DLT). Different from the slicing operation employed in classical transformers, the 3DLT directly extracts the inherent 3-D structural features from the HSI and mitigates quadratic complexity through a lightweight spatial-spectral pooling cross-attention mechanism. Finally, a novel training strategy is designed to adaptively adjust the learning rate based on multimetric feedback during the model training process, significantly accelerating the model fitting speed. Extensive experiments demonstrate that the proposed MSC-3DLT method remains highly competitive compared with state-of-the-art methods in terms of classification accuracy, model parameters, and floating point and operations under small sampling rates.
引用
收藏
页码:5378 / 5394
页数:17
相关论文
共 65 条
[1]  
Ardouin JP, 2007, 2007 PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, P1632
[2]   3-D Deep Learning Approach for Remote Sensing Image Classification [J].
Ben Hamida, Amina ;
Benoit, Alexandre ;
Lambert, Patrick ;
Ben Amar, Chokri .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (08) :4420-4434
[3]   Quantitative Detection of Settled Dust Over Green Canopy Using Sparse Unmixing of Airborne Hyperspectral Data [J].
Brook, Anna ;
Ben Dor, Eyal .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (02) :884-897
[4]   A NEW K-NEAREST NEIGHBOR DENSITY-BASED CLUSTERING METHOD AND ITS APPLICATION TO HYPERSPECTRAL IMAGES [J].
Cariou, Claude ;
Chehdi, Kacem .
2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, :6161-6164
[5]  
Dai Z, 2021, ADV NEUR IN, V34
[6]   Tree Species Classification in Boreal Forests With Hyperspectral Data [J].
Dalponte, Michele ;
Orka, Hans Ole ;
Gobakken, Terje ;
Gianelle, Damiano ;
Naesset, Erik .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2013, 51 (05) :2632-2645
[7]  
Dosovitskiy Alexey., 2021, PROC INT C LEARN REP, P2021
[8]   Multiattention Joint Convolution Feature Representation With Lightweight Transformer for Hyperspectral Image Classification [J].
Fang, Yu ;
Ye, Qiaolin ;
Sun, Le ;
Zheng, Yuhui ;
Wu, Zebin .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[9]   Divide-and-Conquer Dual-Architecture Convolutional Neural Network for Classification of Hyperspectral Images [J].
Feng, Jie ;
Wang, Lin ;
Yu, Haipeng ;
Jiao, Licheng ;
Zhang, Xiangrong .
REMOTE SENSING, 2019, 11 (05)
[10]   Learning Robust Discriminant Subspace Based on Joint L2, p- and L2,s-Norm Distance Metrics [J].
Fu, Liyong ;
Li, Zechao ;
Ye, Qiaolin ;
Yin, Hang ;
Liu, Qingwang ;
Chen, Xiaobo ;
Fan, Xijian ;
Yang, Wankou ;
Yang, Guowei .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) :130-144