Fusion of ConvLSTM and Multi-Attention Mechanism Network for Hyperspectral Image Classification

被引:0
作者
Tang Ting [1 ]
Xin, Pan [1 ]
Luo Xiao-ling [1 ]
Gao Xiao-jing [1 ]
机构
[1] Inner Mongolia Agr Univ, Sch Comp & Informat Engn, Hohhot 010018, Peoples R China
关键词
Hyperspectral image classification; Deep learning; ConvLSTM; Convolutional neural network; Attention mechanism;
D O I
10.3964/j.issn.1000-0593(2023)08-2608-09
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
In recent years, deep learning-based models have achieved remarkable results in the hyperspectral image ( HSI) classification. Aiming at the low classification accuracy of deep learning-based HSI classification methods under limited sample data, this paper proposes an HSI classification method that combines ConvLSTM and a multi-attention mechanism network. The method is divided into three branches: spectral branch, spatial-X branch and spatial-Y branch to extract spectral features, spatial-X features and spatial-Y features respectively, and fuse the features in three directions for hyperspectral image classification. Since convolutional long short-term memory (ConvLSTM) shows good performance in learning valuable features and modeling long-term dependencies in spectral data, 3 hidden layers are used in the spectral branch, and the convolution kernel size is 3X 3, the channels are 150, 100 and 60, respectively, to extract spectral information. On the spatial-X and spatial-Y branches, Dense spatial-X blocks and Dense spatial-Y blocks based on DenseNet and 3D-CNN are used to extract spatial-X and spatial-Y features, respectively. In order to enhance feature extraction, the attention mechanism of its feature direction is also introduced in these three branches, respectively. The spectral attention blocks are designed for the information-rich spectral bands, and a spatial-X attention block and a spatial-Y attention block are designed for the information-rich pixels, respectively. Experiments were conducted on three publicly available hyperspectral datasets, namely Indian Pines (IP) Pavia University (UP) and Salinas Valley (SV) datasets, and compared with five other methods: the SVM with RBF kernel (SVM), Going Deeper with Contextual CNN (CDCNN), Fast Dense Spectral-Spatial Convolution (FDSSC), Spectral-Spatial Residual Network (SSRN), Double-Branch Dual-Attention Mechanism Network (DBDA). In the experiments, the size of training and validation samples is set to 3% of the total samples on the IP dataset, and 0. 5% of the total samples on the UP and SV datasets. For our method and all deep learning-based methods, the batch size is set to 16, the optimizer is set to Adam, the learning rate is set to 0. 000 5, and the learning rate is dynamically adjusted. Since SVM directly uses spectral information for classification, the pixel size of the input sample block is 1 X 1, and the pixels of other input sample blocks based on deep learning methods are all set to 9 X 9. The experimental results show that the method in this paper can fully use the spectral and spatial characteristics of HSI, and achieve better results in the evaluation criteria such as OA, AA, and KAPPA. Among them, the OA index of the method in this paper is improved by 0. 12%0-2. 04% on average compared with the suboptimal algorithm.
引用
收藏
页码:2608 / 2616
页数:9
相关论文
共 14 条
[1]  
ChenYS YS, 2016, IEEE T GEOSCIENCE RE, V54, P6232, DOI DOI 10.1109/TGRS.2016.2584107
[2]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[3]  
HuangG LiuZ, 2017, P IEEE C COMP VIS PA, P2261
[4]  
Lecon D, 2017, IEEE GEOSCIENCE REMO, V14, P1685
[5]   Going Deeper With Contextual CNN for Hyperspectral Image Classification [J].
Lee, Hyungtae ;
Kwon, Heesung .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (10) :4843-4855
[6]   Classification of Hyperspectral Image Based on Double-Branch Dual-Attention Mechanism Network [J].
Li, Rui ;
Zheng, Shunyi ;
Duan, Chenxi ;
Yang, Yang ;
Wang, Xiqi .
REMOTE SENSING, 2020, 12 (03)
[7]  
LiH C, 2018, ELECT LETT, V54, P628
[8]   Double-Branch Multi-Attention Mechanism Network for Hyperspectral Image Classification [J].
Ma, Wenping ;
Yang, Qifan ;
Wu, Yue ;
Zhao, Wei ;
Zhang, Xiangrong .
REMOTE SENSING, 2019, 11 (11)
[9]   Classification of hyperspectral remote sensing images with support vector machines [J].
Melgani, F ;
Bruzzone, L .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2004, 42 (08) :1778-1790
[10]  
Mnih V, 2014, Arxiv, DOI [arXiv:1406.6247, 10.48550/arXiv.1406.6247, DOI 10.48550/ARXIV.1406.6247]