DSHFNet: Dynamic Scale Hierarchical Fusion Network Based on Multiattention for Hyperspectral Image and LiDAR Data Classification

被引:29
作者
Feng, Yining [1 ]
Song, Liyang [2 ]
Wang, Lu [2 ]
Wang, Xianghai [1 ,3 ]
机构
[1] Liaoning Normal Univ, Sch Geog, Dalian 116029, Peoples R China
[2] Liaoning Normal Univ, Sch Comp Sci & Artificial Intelligence, Dalian 116029, Peoples R China
[3] Liaoning Normal Univ, Sch Geog, Dalian 116029, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Feature extraction; Laser radar; Transformers; Data mining; Task analysis; Convolutional neural networks; Classification algorithms; Attention mechanism (AM); deep learning; hierarchical fusion; hyperspectral (HS)-light detection and ranging (LiDAR) classification; multiscale feature; FEATURE-EXTRACTION;
D O I
10.1109/TGRS.2023.3311535
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
With the continuous improvement of satellite sensor performance, it is becoming easier to obtain different types of remote sensing (RS) data from multiple sensors, and the fusion of hyperspectral (HS) images and light detection and ranging (LiDAR) for land use/land cover (LULC) classification has become a research hotspot. However, the current mainstream methods still have defects in feature extraction and feature fusion. In the feature extraction stage, previous methods usually use a single-scale patch as input and a fixed convolution kernel for feature extraction, which makes it difficult to extract features in line with different land cover types at the same time and to obtain high-quality features. Although multiscale feature extraction can solve the one-sidedness problem of single-scale features, it also brings the challenge of high-dimensional multiscale features. In the feature fusion stage, the current fusion methods are relatively simple. Therefore, we propose a dynamic scale hierarchical fusion network (DSHFNet) for fusion classification of HS images and LiDAR data. By calculating the similarity in the scale space and judging the information at different scales through the threshold value, the appropriate scale features are dynamically selected, the small-scale features are integrated into the large-scale features, and the dimensionality of the features is reduced. This method solves the unreliability problem of single-scale features and the high-dimensional problem of multiscale features. In the feature fusion process, different attention modules are used for hierarchical fusion, spatial attention modules are used for shallow fusion and joint feature extraction, and modal attention modules are used for deep fusion of joint features and features from different sensors to achieve complete complementarity of features. Experimental evaluations on three real RS datasets demonstrate the superiority of the proposed method compared with existing methods. The source code can be downloaded at https://github.com/SYFYN0317/DSHFNet.
引用
收藏
页数:14
相关论文
共 58 条
[11]   Multimodal hyperspectral remote sensing: an overview and perspective [J].
Gu, Yanfeng ;
Liu, Tianzhu ;
Gao, Guoming ;
Ren, Guangbo ;
Ma, Yi ;
Chanussot, Jocelyn ;
Jia, Xiuping .
SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (02)
[12]   Classification of Hyperspectral and LiDAR Data Using Coupled CNNs [J].
Hang, Renlong ;
Li, Zhu ;
Ghamisi, Pedram ;
Hong, Danfeng ;
Xia, Guiyu ;
Liu, Qingshan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (07) :4939-4950
[13]   HSI-BERT: Hyperspectral Image Classification Using the Bidirectional Encoder Representation From Transformers [J].
He, Ji ;
Zhao, Lina ;
Yang, Hongwei ;
Zhang, Mengmeng ;
Li, Wei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (01) :165-178
[14]   Multimodal remote sensing benchmark datasets for land cover classification with a shared and specific feature learning model [J].
Hong, Danfeng ;
Hu, Jingliang ;
Yao, Jing ;
Chanussot, Jocelyn ;
Zhu, Xiao Xiang .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 178 :68-80
[15]   Deep EncoderDecoder Networks for Classification of Hyperspectral and LiDAR Data [J].
Hong, Danfeng ;
Gao, Lianru ;
Hang, Renlong ;
Zhang, Bing ;
Chanussot, Jocelyn .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[16]   More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification [J].
Hong, Danfeng ;
Gao, Lianru ;
Yokoya, Naoto ;
Yao, Jing ;
Chanussot, Jocelyn ;
Du, Qian ;
Zhang, Bing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05) :4340-4354
[17]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[18]   A comprehensive review of hyperspectral data fusion with lidar and sar data [J].
Kahraman, Sevcan ;
Bacher, Raphael .
ANNUAL REVIEWS IN CONTROL, 2021, 51 :236-253
[19]   Deep learning in multimodal remote sensing data fusion: A comprehensive review [J].
Li, Jiaxin ;
Hong, Danfeng ;
Gao, Lianru ;
Yao, Jing ;
Zheng, Ke ;
Zhang, Bing ;
Chanussot, Jocelyn .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112
[20]   COMBINING FEATURE FUSION AND DECISION FUSION FOR CLASSIFICATION OF HYPERSPECTRAL AND LIDAR DATA [J].
Liao, Wenzhi ;
Bellens, Rik ;
Pizurica, Aleksandra ;
Gautama, Sidharta ;
Philips, Wilfried .
2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014, :1241-1244