Multiple vision architectures-based hybrid network for hyperspectral image classification

被引:27
作者
Zhao, Feng [1 ]
Zhang, Junjie [1 ]
Meng, Zhe [1 ]
Liu, Hanqiang [2 ]
Chang, Zhenhui [3 ]
Fan, Jiulun [1 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Xian 710121, Peoples R China
[2] Shaanxi Normal Univ, Sch Comp Sci, Xian 710119, Peoples R China
[3] Xian Univ Posts & Telecommun, Sch Cyberspace Secur, Xian 710121, Peoples R China
基金
中国国家自然科学基金;
关键词
Hyperspectral image classification; Convolutional neural network; Vision transformer; Graph convolutional network; GRAPH CONVOLUTIONAL NETWORKS;
D O I
10.1016/j.eswa.2023.121032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
More recently, vision transformer (ViT) has shown competitive performance with convolutional neural network (CNN) on computer vision tasks, which provided more possibilities for accurate classification of hyperspectral image (HSI). However, whether CNN or ViT, they generally only focus on single type of feature, resulting in insufficient information utilization. For instance, CNN has powerful local feature extraction ability, while ViT pays more attention to long-range dependencies and global features. To consider multiple types of feature information, we propose a multiple vision architectures-based hybrid network (MVAHN) for HSI classification, which consists of joint CNN and transformer (JCT) structure and graph convolutional module (GCM). Firstly, JCT successfully embeds convolution operations into ViT to capture local and global features, which mainly include: 1) A spectral spatial convolution block (SSCB) is proposed to unearth local spectral spatial features. 2) A convolution embedding is aggregated into self-attention to design a local-global attention (LGA) mechanism, which can realize the seamless integration of CNN and ViT, thereby capturing local-global combined features. Secondly, a plug-and-play GCM is developed in parallel with transformer encoders to further improve the model classification ability by mining the similarity relationship between pixels in HSI. Overall, an elegant integration of these seemingly distinct paradigms is realized by MVAHN to capture multiple types of feature information. The overall accuracies (OAs) of MVAHN on Pavia University, Houston 2013, Salinas Valley, Kennedy Space Center, Indian Pines and Botswana datasets are 96.37%, 88.33%, 97.57%, 98.96%, 96.25% and 99.26%, respectively. Compared with the state-of-the-art hybrid models, MVAHN achieves competitive classification results. The source code will be available at https://github.com/ZJier/MVAHN.
引用
收藏
页数:16
相关论文
共 52 条
  • [1] [Anonymous], 2020, ARXIV
  • [2] Hyperspectral Image Classification Based on Superpixel Feature Subdivision and Adaptive Graph Structure
    Bai, Jing
    Shi, Wei
    Xiao, Zhu
    Regan, Amelia C.
    Ali, Talal Ahmed Ali
    Zhu, Yongdong
    Zhang, Rui
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Hyperspectral Image Classification Based on Deep Attention Graph Convolutional Network
    Bai, Jing
    Ding, Bixiu
    Xiao, Zhu
    Jiao, Licheng
    Chen, Hongyang
    Regan, Amelia C.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] Consolidated Convolutional Neural Network for Hyperspectral Image Classification
    Chang, Yang-Lang
    Tan, Tan-Hsu
    Lee, Wei-Hong
    Chang, Lena
    Chen, Ying-Nong
    Fan, Kuo-Chin
    Alkhaleefah, Mohammad
    [J]. REMOTE SENSING, 2022, 14 (07)
  • [5] SPECTRAL-SPATIAL CLASSIFICATION OF HYPERSPECTRAL IMAGES WITH MULTI-LEVEL CNN
    Chhapariya, Koushikey
    Buddhiraju, Krishna Mohan
    Kumar, Anil
    [J]. 2022 12TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2022,
  • [6] Hyperspectral Image Classification: Potentials, Challenges, and Future Directions
    Datta, Debaleena
    Mallick, Pradeep Kumar
    Bhoi, Akash Kumar
    Ijaz, Muhammad Fazal
    Shafi, Jana
    Choi, Jaeyoung
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [7] Defferrard M, 2016, ADV NEUR IN, V29
  • [8] Gao K., 2021, Academic J. Comput. Inf. Sci., V4, P11
  • [9] Spatial-Spectral Transformer for Hyperspectral Image Classification
    He, Xin
    Chen, Yushi
    Lin, Zhouhan
    [J]. REMOTE SENSING, 2021, 13 (03) : 1 - 22
  • [10] Hong D., 2021, IEEE T GEOSCI REMOTE, V60, p1 15, DOI [DOI 10.1109/TGRS.2021.3130716, 10.1109/TGRS.2021.3130716, 10.1109/tgrs.2021.3130716]