Multiple vision architectures-based hybrid network for hyperspectral image classification

被引：38

作者：

Zhao, Feng ^{[1
]}

Zhang, Junjie ^{[1
]}

Meng, Zhe ^{[1
]}

Liu, Hanqiang ^{[2
]}

Chang, Zhenhui ^{[3
]}

Fan, Jiulun ^{[1
]}

机构：

[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Xian 710121, Peoples R China

[2] Shaanxi Normal Univ, Sch Comp Sci, Xian 710119, Peoples R China

[3] Xian Univ Posts & Telecommun, Sch Cyberspace Secur, Xian 710121, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2023年 / 234卷

基金：

中国国家自然科学基金;

关键词：

Hyperspectral image classification; Convolutional neural network; Vision transformer; Graph convolutional network; GRAPH CONVOLUTIONAL NETWORKS;

D O I：

10.1016/j.eswa.2023.121032

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

More recently, vision transformer (ViT) has shown competitive performance with convolutional neural network (CNN) on computer vision tasks, which provided more possibilities for accurate classification of hyperspectral image (HSI). However, whether CNN or ViT, they generally only focus on single type of feature, resulting in insufficient information utilization. For instance, CNN has powerful local feature extraction ability, while ViT pays more attention to long-range dependencies and global features. To consider multiple types of feature information, we propose a multiple vision architectures-based hybrid network (MVAHN) for HSI classification, which consists of joint CNN and transformer (JCT) structure and graph convolutional module (GCM). Firstly, JCT successfully embeds convolution operations into ViT to capture local and global features, which mainly include: 1) A spectral spatial convolution block (SSCB) is proposed to unearth local spectral spatial features. 2) A convolution embedding is aggregated into self-attention to design a local-global attention (LGA) mechanism, which can realize the seamless integration of CNN and ViT, thereby capturing local-global combined features. Secondly, a plug-and-play GCM is developed in parallel with transformer encoders to further improve the model classification ability by mining the similarity relationship between pixels in HSI. Overall, an elegant integration of these seemingly distinct paradigms is realized by MVAHN to capture multiple types of feature information. The overall accuracies (OAs) of MVAHN on Pavia University, Houston 2013, Salinas Valley, Kennedy Space Center, Indian Pines and Botswana datasets are 96.37%, 88.33%, 97.57%, 98.96%, 96.25% and 99.26%, respectively. Compared with the state-of-the-art hybrid models, MVAHN achieves competitive classification results. The source code will be available at https://github.com/ZJier/MVAHN.

引用

页数：16

共 52 条

[1] Hyperspectral Image Classification Based on Superpixel Feature Subdivision and Adaptive Graph Structure [J].

Bai, Jing ;

Shi, Wei ;

Xiao, Zhu ;

Regan, Amelia C. ;

Ali, Talal Ahmed Ali ;

Zhu, Yongdong ;

Zhang, Rui ;

Jiao, Licheng .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[2] Hyperspectral Image Classification Based on Deep Attention Graph Convolutional Network [J].

Bai, Jing ;

Ding, Bixiu ;

Xiao, Zhu ;

Jiao, Licheng ;

Chen, Hongyang ;

Regan, Amelia C. .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[3] Consolidated Convolutional Neural Network for Hyperspectral Image Classification [J].

Chang, Yang-Lang ;

Tan, Tan-Hsu ;

Lee, Wei-Hong ;

Chang, Lena ;

Chen, Ying-Nong ;

Fan, Kuo-Chin ;

Alkhaleefah, Mohammad .

REMOTE SENSING, 2022, 14 (07)

[4] SPECTRAL-SPATIAL CLASSIFICATION OF HYPERSPECTRAL IMAGES WITH MULTI-LEVEL CNN [J].

Chhapariya, Koushikey ;

Buddhiraju, Krishna Mohan ;

Kumar, Anil .

2022 12TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2022,

[5] Hyperspectral Image Classification: Potentials, Challenges, and Future Directions [J].

Datta, Debaleena ;

Mallick, Pradeep Kumar ;

Bhoi, Akash Kumar ;

Ijaz, Muhammad Fazal ;

Shafi, Jana ;

Choi, Jaeyoung .

COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022

[6]

Defferrard M, 2016, ADV NEUR IN, V29

[7]

Dosovitskiy A., 2021, An image is worth 16x16 words: Transformers for image recognition at scale

[8]

Gao K., 2021, ACAD J COMPUTING INF, V4, P11, DOI [10.25236/AJCIS.2021.040703, DOI 10.25236/AJCIS.2021.040703]

[9] Spatial-Spectral Transformer for Hyperspectral Image Classification [J].

He, Xin ;

Chen, Yushi ;

Lin, Zhouhan .

REMOTE SENSING, 2021, 13 (03) :1-22

[10] SpectralFormer: Rethinking Hyperspectral Image Classification With Transformers [J].

Hong, Danfeng ;

Han, Zhu ;

Yao, Jing ;

Gao, Lianru ;

Zhang, Bing ;

Plaza, Antonio ;

Chanussot, Jocelyn .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

← 1 2 3 4 5 6 →