Multimodal Remote Sensing Data Classification Based on Gaussian Mixture Variational Dynamic Fusion Network

被引:5
作者
Wang, Haoyu [1 ,2 ]
Liu, Xiaomin [1 ,2 ]
Qiao, Zhenzhuang [1 ,2 ]
Wang, Guoqing [1 ,2 ]
Chen, Haotian [1 ,2 ]
机构
[1] China Univ Min & Technol, Engn Res Ctr Intelligent Control Underground Space, Minist Educ, Xuzhou 221116, Peoples R China
[2] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Laser radar; Noise; Remote sensing; Topology; Data mining; Data integration; Interference; Classification; feature fusion; hyperspectral image (HSI); light detection and ranging (LiDAR); variational autoencoder; HYPERSPECTRAL IMAGE CLASSIFICATION; ZERO-SHOT; GRAPH;
D O I
10.1109/TGRS.2024.3394462
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
With the development of sensor technology, the rational use of multimodal data has become a research hotspot in the field of remote sensing. The multimodal fusion method can effectively improve the accuracy of remote sensing data classification by using the complementary information of different modalities. However, the existing multimodal fusion methods face many challenges, including difficulties in suppressing spectral noise, fully mining contextual information, and learning the strong adaptive fusion pattern. To address the above challenges, a Gaussian mixture variational dynamic fusion network (GM-VDFN) is proposed. First, a multimodal multiscale spatial graph is constructed, and the graph convolution is used to learn the multiscale features. In this process, a spatial topology constraint based on GM (STC-GM) is proposed, which suppresses spectral noise by constraining the topological consistency of the two modalities. Second, a multiscale dynamic graph aggregation module (MDGAM) is constructed, which can capture the shareable class identification information from multiscale features and mine personalized fusion patterns suitable for each sample. Finally, the evidence lower bound for the multimodal joint distribution is derived, and a multimodal variational autoencoder (M-VAE) is designed. Optimizing the evidence lower bound to model multimodal joint distributions, thereby learning the strong adaptive fusion pattern between modalities. Experimental results on four fusion datasets (Houston 2013, Trento, MUUFL, and Houston 2018) show that GM-VDFN achieved state-of-the-art performance in multimodal remote sensing data classification tasks.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 49 条
[31]   Dynamic Super-Pixel Normalization for Robust Hyperspectral Image Classification [J].
Wang, Cong ;
Zhang, Lei ;
Wei, Wei ;
Zhang, Yanning .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[32]   Nearest Neighbor-Based Contrastive Learning for Hyperspectral and LiDAR Data Classification [J].
Wang, Meng ;
Gao, Feng ;
Dong, Junyu ;
Li, Heng-Chao ;
Du, Qian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[33]   Convolutional Neural Networks for Multimodal Remote Sensing Data Classification [J].
Wu, Xin ;
Hong, Danfeng ;
Chanussot, Jocelyn .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[34]   MAGE: Multisource Attention Network With Discriminative Graph and Informative Entities for Classification of Hyperspectral and LiDAR Data [J].
Xiu, Di ;
Pan, Zongxu ;
Wu, Yirong ;
Hu, Yuxin .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[35]   Grouped Bidirectional LSTM Network and Multistage Fusion Convolutional Transformer for Hyperspectral Image Classification [J].
Xu, Qin ;
Yang, Chao ;
Tang, Jin ;
Luo, Bin .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[36]   Spectral Super-Resolution Based on Dictionary Optimization Learning via Spectral Library [J].
Yan, Hao-Fang ;
Zhao, Yong-Qiang ;
Chan, Jonathan Cheung-Wai ;
Kong, Seong G. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[37]   MTFFN: Multimodal Transfer Feature Fusion Network for Hyperspectral Image Classification [J].
Yan, Huaiping ;
Zhang, Erlei ;
Wang, Jun ;
Leng, Chengcai ;
Peng, Jinye .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[38]   Extended Vision Transformer (ExViT) for Land Use and Land Cover Classification: A Multimodal Deep Learning Framework [J].
Yao, Jing ;
Zhang, Bing ;
Li, Chenyu ;
Hong, Danfeng ;
Chanussot, Jocelyn .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[39]   Learning a Deep Structural Subspace Across Hyperspectral Scenes With Cross-Domain VAE [J].
Ye, Minchao ;
Chen, Junbin ;
Xiong, Fengchao ;
Qian, Yuntao .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[40]   Hyperspectral Image Classification With Global-Local Discriminant Analysis and Spatial-Spectral Context [J].
Zeng, Shan ;
Wang, Zhiyong ;
Gao, Chongjun ;
Kang, Zhen ;
Feng, Dagan .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (12) :5005-5018