Dual-Branch Dynamic Modulation Network for Hyperspectral and LiDAR Data Classification

被引:13
作者
Xu, Zhengyi [1 ]
Jiang, Wen [1 ]
Geng, Jie [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
关键词
Feature extraction; Laser radar; Optimization; Hyperspectral imaging; Modulation; Training; Data mining; Feature fusion; hyperspectral image (HSI); imbalanced multimodal learning; land cover classification; multimodal remote sensing (RS); FUSION;
D O I
10.1109/TGRS.2023.3285097
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Deep learning algorithms that can effectively extract features from different modalities have achieved significant performance in multimodal remote sensing (RS) data classification. However, we actually found that the feature representation of one modality is likely to affect other modalities through parameter back-propagation. Even if multimodal models are superior to their uni-modal counterparts, they are likely to be underutilized. To solve the above issue, a dual-branch dynamic modulation network is proposed for hyperspectral (HS) and light detection and ranging (LiDAR) data classification. Firstly, a novel dynamic multimodal gradient optimization (DMGO) strategy is proposed to control the gradient modulation of each feature extraction branch adaptively. Then, a multimodal bi-directional enhancement (MBE) module is developed to integrate features of different modalities, which aims to enhance the complementarity of HS and LiDAR data. Furthermore, a feature distribution consistency (FDC) loss function is designed to quantify similarities between integrated features and dominant features, which can improve the consistency of features across modalities. Experimental evaluations on Houston2013 and Trento datasets demonstrate that our proposed network exceeds several state-of-the-art multimodal classification methods in terms of fusion classification performance.
引用
收藏
页数:13
相关论文
共 49 条
[1]  
Ismail AA, 2020, Arxiv, DOI arXiv:2011.06102
[2]   Machine-learning classification of debris-covered glaciers using a combination of Sentinel-1/-2 (SAR/optical), Landsat 8 (thermal) and digital elevation data [J].
Alifu, Haireti ;
Vuillaume, Jean-Francois ;
Johnson, Brian Alan ;
Hirabayashi, Yukiko .
GEOMORPHOLOGY, 2020, 369
[3]   The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks [J].
Berman, Maxim ;
Triki, Amal Rannen ;
Blaschko, Matthew B. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4413-4421
[4]   Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution [J].
Chen, Yunpeng ;
Fan, Haoqi ;
Xu, Bing ;
Yan, Zhicheng ;
Kalantidis, Yannis ;
Rohrbach, Marcus ;
Yan, Shuicheng ;
Feng, Jiashi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3434-3443
[5]   When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs [J].
Cheng, Gong ;
Yang, Ceyuan ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05) :2811-2821
[6]  
Du C., 2021, arXiv
[7]   PMR: Prototypical Modal Rebalance for Multimodal Learning [J].
Fan, Yunfeng ;
Xu, Wenchao ;
Wang, Haozhao ;
Wang, Junxiao ;
Guo, Song .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :20029-20038
[8]   S2ENet: Spatial-Spectral Cross-Modal Enhancement Network for Classification of Hyperspectral and LiDAR Data [J].
Fang, Sheng ;
Li, Kaiyu ;
Li, Zhe .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[9]  
Fu J, 2019, Arxiv, DOI [arXiv:1809.02983, DOI 10.48550/ARXIV.1809.02983]
[10]   Deep Residual Network-Based Fusion Framework for Hyperspectral and LiDAR Data [J].
Ge, Chiru ;
Du, Qian ;
Sun, Weiwei ;
Wang, Keyan ;
Li, Jiaojiao ;
Li, Yunsong .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :2458-2472