SVAFormer: Integrating Random and Hierarchical Spectral View Attention for Hyperspectral Image Classification

被引：0

作者：

Chen, Ning ^{[1
]}

Huang, Zhou ^{[1
]}

Yue, Xia ^{[2
]}

Liu, Anfeng ^{[3
]}

Lu, Meiyun ^{[4
]}

Yue, Jun ^{[4
]}

Fang, Leyuan ^{[5
,6
]}

机构：

[1] Peking Univ, Inst Remote Sensing & Geog Informat Syst, Beijing 100871, Peoples R China

[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China

[3] Cent South Univ, Sch Elect Informat, Changsha 410083, Peoples R China

[4] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China

[5] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China

[6] Peng Cheng Lab, Shenzhen 518000, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

基金：

中国国家自然科学基金;

关键词：

Transformers; Feature extraction; Accuracy; Radar imaging; Hyperspectral imaging; Data models; Training; Data mining; Image classification; Electronic mail; Attention; deep neural network; hyperspectral image (HSI) classification; Transformer; CNN;

D O I：

10.1109/TGRS.2024.3509478

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Recently, hyperspectral image (HSI) classification methods based on Transformers have developed rapidly. However, these methods still face challenges in handling the widely varying scales and diverse spatial distribution patterns commonly found in HSIs. To address these issues, this article proposes a simple, yet novel HSI classification framework named the spectral view attention Transformer (SVAFormer). Built on the Transformer mechanism, this framework enhances the integration of spectral and spatial features by allowing the spectral token, corresponding to the pixel to be classified, to access spatial neighborhood information from multiple perspectives and levels. Specifically, the framework employs random masking techniques to provide spectral tokens with spatial neighborhood information from different viewpoints, enabling the model to handle diverse land-cover distribution patterns. Additionally, the framework introduces a spectral token-aware pooling layer between adjacent Transformer blocks, which preserves the central role of spectral tokens while progressively expanding the spatial scale represented by each token. This reduces the Transformer's focus on spatially fragmented information and enables spectral tokens to concentrate on spatial neighborhood information at various levels and scales. The key characteristic of this framework is its ability to effectively handle land-cover features of different scales and shapes by strengthening the fusion of spectral and spatial characteristics. Experimental results on multiple public datasets demonstrate that our framework outperforms previous state-of-the-art methods. For the sake of reproducibility, the source code of SVAFormer will be publicly available at https://github.com/chenning0115/SVAFormer.

引用

页数：13

共 43 条

[1] WaveFormer: Spectral-Spatial Wavelet Transformer for Hyperspectral Image Classification [J].

Ahmad, Muhammad ;

Ghous, Usman ;

Usama, Muhammad ;

Mazzara, Manuel .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 :1-5

[2] Classification of Hyperspectral Images With Regularized Linear Discriminant Analysis [J].

Bandos, Tatyana V. ;

Bruzzone, Lorenzo ;

Camps-Valls, Gustavo .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (03) :862-873

[3] Local Similarity-Based Spatial-Spectral Fusion Hyperspectral Image Classification With Deep CNN and Gabor Filtering [J].

Bhatti, Uzair Aslam ;

Yu, Zhaoyuan ;

Chanussot, Jocelyn ;

Zeeshan, Zeeshan ;

Yuan, Linwang ;

Luo, Wen ;

Nawaz, Saqib Ali ;

Bhatti, Mughair Aslam ;

ul Ain, Qurat ;

Mehmood, Anum .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[4] Spectral Query Spatial: Revisiting the Role of Center Pixel in Transformer for Hyperspectral Image Classification [J].

Chen, Ning ;

Fang, Leyuan ;

Xia, Yang ;

Xia, Shaobo ;

Liu, Hui ;

Yue, Jun .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-14

[5] TempEE: Temporal-Spatial Parallel Transformer for Radar Echo Extrapolation Beyond Autoregression [J].

Chen, Shengchao ;

Shu, Ting ;

Zhao, Huan ;

Zhong, Guo ;

Chen, Xunlai .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61

[6] ATTENTION-BASED DUAL-STREAM VISION TRANSFORMER FOR RADAR GAIT RECOGNITION [J].

Chen, Shiliang ;

He, Wentao ;

Ren, Jianfeng ;

Jiang, Xudong .

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :3668-3672

[7] Multi- and hyperspectral classification of soft-bottom intertidal vegetation using a spectral library for coastal biodiversity remote sensing [J].

Davies, Bede Ffinian Rowe ;

Gernez, Pierre ;

Geraud, Andrea ;

Oiry, Simon ;

Rosa, Philippe ;

Zoffoli, Maria Laura ;

Barille, Laurent .

REMOTE SENSING OF ENVIRONMENT, 2023, 290

[8] Improving PRISMA hyperspectral spatial resolution and geolocation by using Sentinel-2: development and test of an operational procedure in urban and rural areas [J].

De Luca, Giandomenico ;

Carotenuto, Federico ;

Genesio, Lorenzo ;

Pepe, Monica ;

Toscano, Piero ;

Boschetti, Mirco ;

Miglietta, Franco ;

Gioli, Beniamino .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 215 :112-135

[9] Quantitative Estimation of Wheat Stripe Rust Disease Index Using Unmanned Aerial Vehicle Hyperspectral Imagery and Innovative Vegetation Indices [J].

Deng, Jie ;

Wang, Rui ;

Yang, Lujia ;

Lv, Xuan ;

Yang, Ziqian ;

Zhang, Kai ;

Zhou, Congying ;

Li, Pengju ;

Wang, Zhifang ;

Abdullah, Ahsan ;

Ma, Zhanhong .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61

[10] Multi-feature fusion: Graph neural network and CNN combining for hyperspectral image classification [J].

Ding, Yao ;

Zhang, Zhili ;

Zhao, Xiaofeng ;

Hong, Danfeng ;

Cai, Wei ;

Yu, Chengguo ;

Yang, Nengjun ;

Cai, Weiwei .

NEUROCOMPUTING, 2022, 501 :246-257

← 1 2 3 4 5 →