Spectral-Spatial Transformer Network for Hyperspectral Image Classification: A Factorized Architecture Search Framework

被引:179
作者
Zhong, Zilong [1 ,2 ]
Li, Ying [3 ]
Ma, Lingfei [4 ,5 ]
Li, Jonathan [6 ,7 ]
Zheng, Wei-Shi [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[2] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou 510006, Peoples R China
[3] Beijing Inst Technol, Sch Mech Engn, Beijing 100081, Peoples R China
[4] Minist Educ, Engn Res Ctr State Financial Secur, Beijing 102206, Peoples R China
[5] Cent Univ Finance & Econ, Sch Stat & Math, Beijing 102206, Peoples R China
[6] Univ Waterloo, Dept Geog & Environm Management, Waterloo, ON N2L 3G1, Canada
[7] Univ Waterloo, Dept Syst Design Engn, Waterloo, ON N2L 3G1, Canada
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国博士后科学基金;
关键词
Convolution; Transformers; Computer architecture; Task analysis; Training; Kernel; Hyperspectral imaging; Factorized architecture search (FAS); spatial attention; spectral association; spectral-spatial transformer network (SSTN); MARKOV-RANDOM-FIELDS; AUTOMATIC DESIGN; REPRESENTATION; ATTENTION;
D O I
10.1109/TGRS.2021.3115699
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Neural networks have dominated the research of hyperspectral image classification, attributing to the feature learning capacity of convolution operations. However, the fixed geometric structure of convolution kernels hinders long-range interaction between features from distant locations. In this article, we propose a novel spectral-spatial transformer network (SSTN), which consists of spatial attention and spectral association modules, to overcome the constraints of convolution kernels. Also, we design a factorized architecture search (FAS) framework that involves two independent subprocedures to determine the layer-level operation choices and block-level orders of SSTN. Unlike conventional neural architecture search (NAS) that requires a bilevel optimization of both network parameters and architecture settings, the FAS focuses only on finding out optimal architecture settings to enable a stable and fast architecture search. Extensive experiments conducted on five popular HSI benchmarks demonstrate the versatility of SSTNs over other state-of-the-art (SOTA) methods and justify the FAS strategy. On the University of Houston dataset, SSTN obtains comparable overall accuracy to SOTA methods with a small fraction (1.2%) of multiply-and-accumulate operations compared to a strong baseline spectral-spatial residual network (SSRN). Most importantly, SSTNs outperform other SOTA networks using only 1.2% or fewer MACs of SSRNs on the Indian Pines, the Kennedy Space Center, the University of Pavia, and the Pavia Center datasets.
引用
收藏
页数:15
相关论文
共 48 条
  • [41] Deep Learning for Remote Sensing Data A technical tutorial on the state of the art
    Zhang, Liangpei
    Zhang, Lefei
    Du, Bo
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2016, 4 (02) : 22 - 40
  • [42] Zhong ZL, 2020, PROC CVPR IEEE, P13062, DOI 10.1109/CVPR42600.2020.01308
  • [43] Generative Adversarial Networks and Conditional Random Fields for Hyperspectral Image Classification
    Zhong, Zilong
    Li, Jonathan
    Clausi, David A.
    Wong, Alexander
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (07) : 3318 - 3329
  • [44] Spectral-Spatial Residual Network for Hyperspectral Image Classification: A 3-D Deep Learning Framework
    Zhong, Zilong
    Li, Jonathan
    Luo, Zhiming
    Chapman, Michael
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (02): : 847 - 858
  • [45] Subspace Structure Regularized Nonnegative Matrix Factorization for Hyperspectral Unmixing
    Zhou, Lei
    Zhang, Xueni
    Wang, Jianbo
    Bai, Xiao
    Tong, Lei
    Zhang, Liang
    Zhou, Jun
    Hancock, Edwin
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 (13) : 4257 - 4270
  • [46] Residual Spectral-Spatial Attention Network for Hyperspectral Image Classification
    Zhu, Minghao
    Jiao, Licheng
    Liu, Fang
    Yang, Shuyuan
    Wang, Jianing
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 449 - 462
  • [47] Zhu XX, 2017, IEEE GEOSC REM SEN M, V5, P8, DOI 10.1109/MGRS.2017.2762307
  • [48] Zoph B, 2016, ARXIV