Spectral-Spatial Morphological Attention Transformer for Hyperspectral Image Classification

被引:183
作者
Roy, Swalpa Kumar [1 ]
Deria, Ankur [2 ]
Shah, Chiranjibi [3 ]
Haut, Juan M. [4 ]
Du, Qian [3 ]
Plaza, Antonio [4 ]
机构
[1] Jalpaiguri Govt Engn Coll, Dept Comp Sci & Engn, Jalpaiguri 735102, India
[2] Tech Univ Munich, Dept Informat, D-85748 Munich, Germany
[3] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
[4] Univ Extremadura, Escuela Polit, Dept Technol Comp & Commun, Hyperspectral Comp Lab, Caceres 10003, Spain
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
欧盟地平线“2020”;
关键词
Transformers; Feature extraction; Data mining; Shape; Hyperspectral imaging; Convolution; Training; Classification; hyperspectral images (HSIs); morphological transformer (morphFormer); spatial-spectral features; LAND-COVER CLASSIFICATION; CONVOLUTIONAL NEURAL-NETWORKS; REMOTE-SENSING IMAGES; PROFILES; GRAPH;
D O I
10.1109/TGRS.2023.3242346
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In recent years, convolutional neural networks (CNNs) have drawn significant attention for the classification of hyperspectral images (HSIs). Due to their self-attention mechanism, the vision transformer (ViT) provides promising classification performance compared to CNNs. Many researchers have incorporated ViT for HSI classification purposes. However, its performance can be further improved because the current version does not use spatial-spectral features. In this article, we present a new morphological transformer (morphFormer) that implements a learnable spectral and spatial morphological network, where spectral and spatial morphological convolution operations are used (in conjunction with the attention mechanism) to improve the interaction between the structural and shape information of the HSI token and the CLS token. Experiments conducted on widely used HSIs demonstrate the superiority of the proposed morphFormer over the classical CNN models and state-of-the-art transformer models. The source will be made available publicly at https://github.com/mhaut/morphFormer.
引用
收藏
页数:15
相关论文
共 71 条
[1]   Hyperspectral Image Classification-Traditional to Deep Models: A Survey for Future Prospects [J].
Ahmad, Muhammad ;
Shabbir, Sidrah ;
Roy, Swalpa Kumar ;
Hong, Danfeng ;
Wu, Xin ;
Yao, Jing ;
Khan, Adil Mehmood ;
Mazzara, Manuel ;
Distefano, Salvatore ;
Chanussot, Jocelyn .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :968-999
[2]   GLC2000:: a new approach to global land cover mapping from Earth observation data [J].
Bartholomé, E ;
Belward, AS .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2005, 26 (09) :1959-1977
[3]   Characterisation methods for the hyperspectral sensor HySpex at DLR's calibration home base [J].
Baumgartner, Andreas ;
Gege, Peter ;
Koehler, Claas ;
Lenhard, Karim ;
Schwarzmaier, Thomas .
SENSORS, SYSTEMS, AND NEXT-GENERATION SATELLITES XVI, 2012, 8533
[4]   3-D Deep Learning Approach for Remote Sensing Image Classification [J].
Ben Hamida, Amina ;
Benoit, Alexandre ;
Lambert, Patrick ;
Ben Amar, Chokri .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (08) :4420-4434
[5]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[6]   Classification of Urban Functional Areas From Remote Sensing Images and Time-Series User Behavior Data [J].
Chen, Chen ;
Yan, Jining ;
Wang, Lizhe ;
Liang, Dong ;
Zhang, Wanfeng .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :1207-1221
[7]  
Cho KYHY, 2014, Arxiv, DOI arXiv:1409.1259
[8]   Morphological Attribute Profiles for the Analysis of Very High Resolution Images [J].
Dalla Mura, Mauro ;
Benediktsson, Jon Atli ;
Waske, Bjoern ;
Bruzzone, Lorenzo .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2010, 48 (10) :3747-3762
[9]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[10]  
Du X., 2017, Tech. Rep.