A Contrastive Learning Enhanced Adaptive Multimodal Fusion Network for Hyperspectral and LiDAR Data Classification

被引:1
作者
Xu, Kai [1 ]
Wang, Bangjun [1 ]
Zhu, Zhou [1 ]
Jia, Zhaohong [1 ]
Fan, Chengcheng [2 ,3 ]
机构
[1] Anhui Univ, Sch Internet, Hefei 230039, Peoples R China
[2] Chinese Acad Sci, Shanghai Engn Ctr Microsatellites, Shanghai 201210, Peoples R China
[3] Chinese Acad Sci, Innovat Acad Microsatellites, Shanghai 201210, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2025年 / 63卷
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Feature extraction; Laser radar; Contrastive learning; Semantics; Data mining; Adaptive systems; Accuracy; Geoscience and remote sensing; Unsupervised learning; Image reconstruction; cross-modal interaction (INTER); feature fusion; hyperspectral image (HSI); image classification; light detection and ranging (LiDAR); IMAGE CLASSIFICATION; INTELLIGENCE; ATTENTION;
D O I
10.1109/TGRS.2024.3521960
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In recent years, the fusion-based classification of hyperspectral image (HSI) and light detection and ranging (LiDAR) data has garnered increasing attention from researchers. However, significant image disparities exist between HSI and LiDAR data because of their distinct imaging mechanisms, which limits the fusion of HSI and LiDAR data. Therefore, establishing effective interaction (INTER) and fusion between HSI and LiDAR data is crucial. Moreover, the classification of multimodal data with limited labeled instances is another challenging task. To address these issues, we propose a contrastive learning enhanced adaptive multimodal fusion network (CAMFNet) for the joint classification of HSI and LiDAR data. CAMFNet introduces a novel semantic similarity contrastive (SSC) loss that fully utilizes a large amount of unlabeled data to learn potential complementary information between different modalities. In addition, a hierarchical fusion strategy is adopted to fuse multimodal features. At the shallow feature fusion stage, a cross-modal feature interaction fusion (CMFIF) module is proposed to guide the extraction of multimodal features and enhance the complementary information through cross-modal interaction. At the same time, adaptive feature fusion is also used to dynamically assign corresponding weights to different modal features, achieving finer fusion. At the deep feature fusion stage, a channel split and integration (CSI) block is constructed to further optimize the fused spectral-spatial and elevation features via a negative feedback mechanism. The effectiveness of CAMFNet is validated through experiments on three standard datasets, which demonstrates that CAMFNet outperforms state-of-the-art methods.
引用
收藏
页数:19
相关论文
共 56 条
[1]   Hyperspectral Image Classification Based on Deep Attention Graph Convolutional Network [J].
Bai, Jing ;
Ding, Bixiu ;
Xiao, Zhu ;
Jiao, Licheng ;
Chen, Hongyang ;
Regan, Amelia C. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[2]  
Chen T., 2020, ARXIV200205709
[3]  
Dong P., 2017, Lidar Remote Sensing and Applications
[4]   S2ENet: Spatial-Spectral Cross-Modal Enhancement Network for Classification of Hyperspectral and LiDAR Data [J].
Fang, Sheng ;
Li, Kaiyu ;
Li, Zhe .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[5]   S2EFT: Spectral-Spatial-Elevation Fusion Transformer for hyperspectral image and LiDAR classification [J].
Feng, Yining ;
Zhu, Junheng ;
Song, Ruoxi ;
Wang, Xianghai .
KNOWLEDGE-BASED SYSTEMS, 2024, 283
[6]   DSHFNet: Dynamic Scale Hierarchical Fusion Network Based on Multiattention for Hyperspectral Image and LiDAR Data Classification [J].
Feng, Yining ;
Song, Liyang ;
Wang, Lu ;
Wang, Xianghai .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[7]   Cross-Modal Contrastive Learning for Remote Sensing Image Classification [J].
Feng, Zhixi ;
Song, Liangliang ;
Yang, Shuyuan ;
Zhang, Xinyu ;
Jiao, Licheng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[8]   Unsupervised Meta Learning With Multiview Constraints for Hyperspectral Image Small Sample set Classification [J].
Gao, Kuiliang ;
Liu, Bing ;
Yu, Xuchu ;
Yu, Anzhu .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :3449-3462
[9]   LiCa: Label-Indicate-Conditional-Alignment Domain Generalization for Pixel-Wise Hyperspectral Imagery Classification [J].
Gao, Zhe ;
Pan, Bin ;
Xu, Xia ;
Li, Tao ;
Shi, Zhenwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[10]   Hyperspectral and LiDAR Data Fusion Using Extinction Profiles and Deep Convolutional Neural Network [J].
Ghamisi, Pedram ;
Hoefle, Bernhard ;
Zhu, Xiao Xiang .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (06) :3011-3024