Joint Classification of Hyperspectral and LiDAR Data Using a Hierarchical CNN and Transformer

被引:77
作者
Zhao, Guangrui [1 ]
Ye, Qiaolin [2 ]
Sun, Le [3 ,4 ]
Wu, Zebin [5 ]
Pan, Chengsheng [6 ]
Jeon, Byeungwoo [7 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Sci, Nanjing 210044, Peoples R China
[2] Nanjing Forestry Univ, Sch Informat Sci & Technol, Nanjing 210037, Peoples R China
[3] Nanjing Univ Informat Sci & Technol NUIST, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Sch Comp & Sci, Minist Educ, Nanjing 210044, Peoples R China
[4] Nanjing Univ Informat Sci & Technol NUIST, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[5] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[6] Nanjing Univ Informat Sci & Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China
[7] Sungkyunkwan Univ, Sch Elect & Elect Engn, Suwon 440746, South Korea
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Feature extraction; Laser radar; Transformers; Convolutional neural networks; Data mining; Convolution; Tokenization; Convolutional neural network (CNN); hyperspectral image (HSI); joint classification; light detection and ranging (LiDAR) data; tokenization; transformer; IMAGE CLASSIFICATION; EXTINCTION PROFILES; NETWORK; FUSION;
D O I
10.1109/TGRS.2022.3232498
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The joint use of multisource remote-sensing (RS) data for Earth observation missions has drawn much attention. Although the fusion of several data sources can improve the accuracy of land-cover identification, many technical obstacles, such as disparate data structures, irrelevant physical characteristics, and a lack of training data, exist. In this article, a novel dual-branch method, consisting of a hierarchical convolutional neural network (CNN) and a transformer network, is proposed for fusing multisource heterogeneous information and improving joint classification performance. First, by combining the CNN with a transformer, the proposed dual-branch network can significantly capture and learn spectral-spatial features from hyperspectral image (HSI) data and elevation features from light detection and ranging (LiDAR) data. Then, to fuse these two sets of data features, a cross-token attention (CTA) fusion encoder is designed in a specialty. The well-designed deep hierarchical architecture takes full advantage of the powerful spatial context information extraction ability of the CNN and the strong long-range dependency modeling ability of the transformer network based on the self-attention (SA) mechanism. Four standard datasets are used in experiments to verify the effectiveness of the approach. The experimental results reveal that the proposed framework can perform noticeably better than state-of-the-art methods. The source code of the proposed method will be available publicly at https://github.com/zgr6010/Fusion_HCT.git.
引用
收藏
页数:16
相关论文
共 59 条
  • [1] DRBox-v2: An Improved Detector With Rotatable Boxes for Target Detection in SAR Images
    An, Quanzhi
    Pan, Zongxu
    Liu, Lei
    You, Hongjian
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11): : 8333 - 8349
  • [2] Deep Learning for Classification of Hyperspectral Data
    Audebert, Nicolas
    Le Saux, Bertrand
    Lefevre, Sebastien
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2019, 7 (02) : 159 - 173
  • [3] Baumgartner A., 2012, PROC SPIE, P371
  • [4] CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
    Chen, Chun-Fu
    Fan, Quanfu
    Panda, Rameswar
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 347 - 356
  • [5] Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks
    Chen, Yushi
    Jiang, Hanlu
    Li, Chunyang
    Jia, Xiuping
    Ghamisi, Pedram
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10): : 6232 - 6251
  • [6] Classification of Hyperspectral Images by Using Extended Morphological Attribute Profiles and Independent Component Analysis
    Dalla Mura, Mauro
    Villa, Alberto
    Benediktsson, Jon Atli
    Chanussot, Jocelyn
    Bruzzone, Lorenzo
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2011, 8 (03) : 542 - 546
  • [7] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [8] Skin Complications of Diabetes Mellitus Revealed by Polarized Hyperspectral Imaging and Machine Learning
    Dremin, Viktor
    Marcinkevics, Zbignevs
    Zherebtsov, Evgeny
    Popov, Alexey
    Grabovskis, Andris
    Kronberga, Hedviga
    Geldnere, Kristine
    Doronin, Alexander
    Meglinski, Igor
    Bykov, Alexander
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (04) : 1207 - 1216
  • [9] Du X., 2017, Tech. Tech. Rep. 20170417
  • [10] Spectral and Spatial Classification of Hyperspectral Data Using SVMs and Morphological Profiles
    Fauvel, Mathieu
    Benediktsson, Jon Atli
    Chanussot, Jocelyn
    Sveinsson, Johannes R.
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2008, 46 (11): : 3804 - 3814