A complementary integrated Transformer network for hyperspectral image classification

被引：13

作者：

Liao, Diling ^{[1
]}

Shi, Cuiping ^{[1
]}

Wang, Liguo ^{[2
]}

机构：

[1] Qiqihar Univ, Coll Commun & Elect Engn, Qiqihar 161000, Peoples R China

[2] Dalian Nationalities Univ, Coll Informat & Commun Engn, Dalian, Peoples R China

来源：

CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY | 2023年 / 8卷 / 04期

基金：

中国国家自然科学基金;

关键词：

complementary integrated Transformer module; convolutional neural network; Gaussian modulation; Transformer;

D O I：

10.1049/cit2.12150

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the past, convolutional neural network (CNN) has become one of the most popular deep learning frameworks, and has been widely used in Hyperspectral image classification tasks. Convolution (Conv) in CNN uses filter weights to extract features in local receiving domain, and the weight parameters are shared globally, which more focus on the high-frequency information of the image. Different from Conv, Transformer can obtain the long-term dependence between long-distance features through modelling, and adaptively focus on different regions. In addition, Transformer is considered as a low-pass filter, which more focuses on the low-frequency information of the image. Considering the complementary characteristics of Conv and Transformer, the two modes can be integrated for full feature extraction. In addition, the most important image features correspond to the discrimination region, while the secondary image features represent important but easily ignored regions, which are also conducive to the classification of HSIs. In this study, a complementary integrated Transformer network (CITNet) for hyperspectral image classification is proposed. Firstly, three-dimensional convolution (Conv3D) and two-dimensional convolution (Conv2D) are utilised to extract the shallow semantic information of the image. In order to enhance the secondary features, a channel Gaussian modulation attention module is proposed, which is embedded between Conv3D and Conv2D. This module can not only enhance secondary features, but suppress the most important and least important features. Then, considering the different and complementary characteristics of Conv and Transformer, a complementary integrated Transformer module is designed. Finally, through a large number of experiments, this study evaluates the classification performance of CITNet and several state-of-the-art networks on five common datasets. The experimental results show that compared with these classification networks, CITNet can provide better classification performance.

引用

页码：1288 / 1307

页数：20

共 57 条

[31] Deep learning classifiers for hyperspectral imaging: A review [J].

Paoletti, M. E. ;

Haut, J. M. ;

Plaza, J. ;

Plaza, A. .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 158 :279-317

[32] Capsule Networks for Hyperspectral Image Classification [J].

Paoletti, Mercedes E. ;

Haut, Juan Mario ;

Fernandez-Beltran, Ruben ;

Plaza, Javier ;

Plaza, Antonio ;

Li, Jun ;

Pla, Filiberto .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (04) :2145-2160

[33] Deep Pyramidal Residual Networks for Spectral-Spatial Hyperspectral Image Classification [J].

Paoletti, Mercedes E. ;

Mario Haut, Juan ;

Fernandez-Beltran, Ruben ;

Plaza, Javier ;

Plaza, Antonio J. ;

Pla, Filiberto .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (02) :740-754

[34]

Park N., 2022, PROC INT C LEARN REP

[35] Improved Transformer Net for Hyperspectral Image Classification [J].

Qing, Yuhao ;

Liu, Wenyi ;

Feng, Liuyan ;

Gao, Wanjia .

REMOTE SENSING, 2021, 13 (11)

[36] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].

Ren, Shaoqing ;

He, Kaiming ;

Girshick, Ross ;

Sun, Jian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149

[37] HybridSN: Exploring 3-D-2-D CNN Feature Hierarchy for Hyperspectral Image Classification [J].

Roy, Swalpa Kumar ;

Krishna, Gopal ;

Dubey, Shiv Ram ;

Chaudhuri, Bidyut B. .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (02) :277-281

[38]

Sabour S, 2017, Arxiv, DOI [arXiv:1710.09829, DOI 10.48550/ARXIV.1710.09829, 10.48550/ARXIV.1710.09829]

[39] Fully Convolutional Networks for Semantic Segmentation [J].

Shelhamer, Evan ;

Long, Jonathan ;

Darrell, Trevor .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) :640-651

[40]

Simonyan K, 2015, Arxiv, DOI [arXiv:1409.1556, DOI 10.48550/ARXIV.1409.1556]

← 1 2 3 4 5 6 →