From center to surrounding: An interactive learning framework for hyperspectral image classification

被引:54
作者
Yang, Jiaqi [1 ]
Du, Bo [2 ,3 ,4 ,5 ]
Zhang, Liangpei [1 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
[3] Wuhan Univ, Inst Artificial Intelligence, Wuhan, Peoples R China
[4] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
[5] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Hyperspectral image classification; Deep learning; Transformer; Center-to-surrounding interactive learning; NETWORK; ATTENTION;
D O I
10.1016/j.isprsjprs.2023.01.024
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Owing to rich spectral and spatial information, hyperspectral image (HSI) can be utilized for finely classifying different land covers. With the emergence of deep learning techniques, convolutional neural network (CNN), fully convolutional network (FCN), and recurrent neural network (RNN) have been widely applied in the field of HSI classification. Recently, transformer-based approaches represented by Vision Transformer (ViT) have yielded promising performance on numerous tasks and have been introduced to classify HSI. However, existing methods based on the above architectures still face three crucial issues that limit the classification performance: 1) geometric constraints caused by input data, 2) contribution fuzziness of central pixels with details, and 3) interaction gap between local areas and further environments. To tackle the above problems, an interactive learning framework inspired by ViT is proposed from a center to surrounding perspective, namely the center-to -surrounding interactive learning (CSIL) framework. Different from existing works, the CSIL framework enables to achieve multi-scale, detail-aware, and space-interactive classification based on a well-designed hierarchical re-gion sampling strategy, center transformer, and surrounding transformer. Specifically, a hierarchical region sampling strategy is first proposed to flexibly generate the center region, neighbor region, and surrounding re-gion, respectively. Thus, multi-scale input data breaks the geometric constraints. Second, a center transformer is presented to obtain core characteristics in detail based on the center region. In this way, central pixels are remarkably highlighted and the details are easily perceived. Third, a surrounding transformer including inter-active self-attention learning is formulated for interacting both locally fine-grained distributions in the neighbor region and further coarse-grained environments in the surrounding region. With this structure, short-and long-term dependencies can be modeled, emphasized, and exchanged to bridge the interaction gap. Finally, the features from center transformer and surrounding transformer are integrated, then fed into a multi-layer per-ceptron for the optimization of semantic representation. Extensive experiments on six HSI datasets including small-, medium-, and large-scale scenes demonstrate the superiority over state-of-the-art CNN-, FCN-, RNN-and transformer-based approaches, even with very few training samples (for example 0.19% in complex HanChuan city scene). The source code will be available soon at https://github.com/jqyang22/CSIL.
引用
收藏
页码:145 / 166
页数:22
相关论文
共 77 条
[21]   Hyperspectral image classification based on octave convolution and multi-scale feature fusion [J].
Li, Zhiyong ;
Wen, Bo ;
Luo, Yunzhong ;
Li, Qiaochu ;
Song, Lulu .
PRECISION ENGINEERING-JOURNAL OF THE INTERNATIONAL SOCIETIES FOR PRECISION ENGINEERING AND NANOTECHNOLOGY, 2022, 75 :80-94
[22]   Multiscale DenseNet Meets With Bi-RNN for Hyperspectral Image Classification [J].
Liang, Lianhui ;
Zhang, Shaoquan ;
Li, Jun .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :5401-5415
[23]  
Liu H., 2022, IEEE Trans.Neural Netw. Learn. Syst., V34, P1
[24]  
Liu Q., 2021, CURR CONTENTS, V13
[25]  
Lu X., 2019, IEEE T
[26]   Cross-domain road detection based on global-local adversarial learning framework from very high resolution satellite imagery [J].
Lu, Xiaoyan ;
Zhong, Yanfei ;
Zheng, Zhuo ;
Wang, Junjue .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 180 :296-312
[27]   GAMSNet: Globally aware road detection network with multi-scale residual learning [J].
Lu, Xiaoyan ;
Zhong, Yanfei ;
Zheng, Zhuo ;
Zhang, Liangpei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 175 :340-352
[28]  
Luo W., 2022, DEEPLY SUPERVISED PS
[29]   Hyperspectral Image Classification Using Attention-Based Bidirectional Long Short-Term Memory Network [J].
Mei, Shaohui ;
Li, Xingang ;
Liu, Xiao ;
Cai, Huimin ;
Du, Qian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[30]   DSSNet: A Simple Dilated Semantic Segmentation Network for Hyperspectral Imagery Classification [J].
Pan, Bin ;
Xu, Xia ;
Shi, Zhenwei ;
Zhang, Ning ;
Luo, Huanlin ;
Lan, Xianchao .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (11) :1968-1972