From center to surrounding: An interactive learning framework for hyperspectral image classification

被引:53
作者
Yang, Jiaqi [1 ]
Du, Bo [2 ,3 ,4 ,5 ]
Zhang, Liangpei [1 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
[3] Wuhan Univ, Inst Artificial Intelligence, Wuhan, Peoples R China
[4] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[5] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Hyperspectral image classification; Deep learning; Transformer; Center-to-surrounding interactive learning; NETWORK; ATTENTION;
D O I
10.1016/j.isprsjprs.2023.01.024
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Owing to rich spectral and spatial information, hyperspectral image (HSI) can be utilized for finely classifying different land covers. With the emergence of deep learning techniques, convolutional neural network (CNN), fully convolutional network (FCN), and recurrent neural network (RNN) have been widely applied in the field of HSI classification. Recently, transformer-based approaches represented by Vision Transformer (ViT) have yielded promising performance on numerous tasks and have been introduced to classify HSI. However, existing methods based on the above architectures still face three crucial issues that limit the classification performance: 1) geometric constraints caused by input data, 2) contribution fuzziness of central pixels with details, and 3) interaction gap between local areas and further environments. To tackle the above problems, an interactive learning framework inspired by ViT is proposed from a center to surrounding perspective, namely the center-to -surrounding interactive learning (CSIL) framework. Different from existing works, the CSIL framework enables to achieve multi-scale, detail-aware, and space-interactive classification based on a well-designed hierarchical re-gion sampling strategy, center transformer, and surrounding transformer. Specifically, a hierarchical region sampling strategy is first proposed to flexibly generate the center region, neighbor region, and surrounding re-gion, respectively. Thus, multi-scale input data breaks the geometric constraints. Second, a center transformer is presented to obtain core characteristics in detail based on the center region. In this way, central pixels are remarkably highlighted and the details are easily perceived. Third, a surrounding transformer including inter-active self-attention learning is formulated for interacting both locally fine-grained distributions in the neighbor region and further coarse-grained environments in the surrounding region. With this structure, short-and long-term dependencies can be modeled, emphasized, and exchanged to bridge the interaction gap. Finally, the features from center transformer and surrounding transformer are integrated, then fed into a multi-layer per-ceptron for the optimization of semantic representation. Extensive experiments on six HSI datasets including small-, medium-, and large-scale scenes demonstrate the superiority over state-of-the-art CNN-, FCN-, RNN-and transformer-based approaches, even with very few training samples (for example 0.19% in complex HanChuan city scene). The source code will be available soon at https://github.com/jqyang22/CSIL.
引用
收藏
页码:145 / 166
页数:22
相关论文
共 77 条
[1]   A Fast and Compact 3-D CNN for Hyperspectral Image Classification [J].
Ahmad, Muhammad ;
Khan, Adil Mehmood ;
Mazzara, Manuel ;
Distefano, Salvatore ;
Ali, Mohsin ;
Sarfraz, Muhammad Shahzad .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[2]   Object-based classification of hyperspectral data using Random Forest algorithm [J].
Amini, Saeid ;
Homayouni, Saeid ;
Safari, Abdolreza ;
Darvishsefat, Ali A. .
GEO-SPATIAL INFORMATION SCIENCE, 2018, 21 (02) :127-138
[3]   Local Similarity-Based Spatial-Spectral Fusion Hyperspectral Image Classification With Deep CNN and Gabor Filtering [J].
Bhatti, Uzair Aslam ;
Yu, Zhaoyuan ;
Chanussot, Jocelyn ;
Zeeshan, Zeeshan ;
Yuan, Linwang ;
Luo, Wen ;
Nawaz, Saqib Ali ;
Bhatti, Mughair Aslam ;
ul Ain, Qurat ;
Mehmood, Anum .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4]   ON ENHANCED ENSEMBLE LEARNING FOR MULTIMODAL REMOTE SENSING DATA ANALYSIS BY CAPACITY OPTIMIZATION [J].
Chlaily, Saloua ;
Ienco, Dino ;
Jutten, Christian ;
Marinoni, Andrea .
2021 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2021, :151-155
[5]  
Dong S., 2022, IEEE T GEOSCI ELECT, V60, P1, DOI DOI 10.1109/TGRS.2022.3183189
[6]   A Pixel Cluster CNN and Spectral-Spatial Fusion Algorithm for Hyperspectral Image Classification With Small-Size Training Samples [J].
Dong, Shuxian ;
Quan, Yinghui ;
Feng, Wei ;
Dauphin, Gabriel ;
Gao, Lianru ;
Xing, Mengdao .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :4101-4114
[7]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[8]   Adaptive spectral-spatial feature fusion network for hyperspectral image classification using limited training samples [J].
Gao, Hongmin ;
Chen, Zhonghao ;
Xu, Feng .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 107
[9]  
Gao J., 2022, CURR CONTENTS
[10]   Hyperspectral Image Classification Using CNN-Enhanced Multi-Level Haar Wavelet Features Fusion Network [J].
Guo, Wenhui ;
Xu, Guixun ;
Liu, Baodi ;
Wang, Yanjiang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19