DMSCA: deep multiscale cross-modal attention network for hyperspectral and light detection and ranging data fusion and joint classification

被引:3
作者
Yu, Wenbo [1 ,2 ]
Huang, Fenghua [2 ]
机构
[1] Soochow Univ, Sch Elect & Informat Engn, Dept Elect & Informat Engn, Suzhou, Peoples R China
[2] Yango Univ, Fujian Key Lab Spatial Informat Percept & Intellig, Fuzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
hyperspectral; light detection and ranging; deep learning; attention mechanism; multimodal fusion; classification; FEATURE-EXTRACTION;
D O I
10.1117/1.JRS.18.036505
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Hyperspectral and light detection and ranging (LiDAR) imaging instruments capture on-ground object information from diverse perspectives, reflecting spectral-spatial and elevation descriptions, respectively. Their complementary capturing feasibilities contribute to enhancing accurate landcover identification in multimodal data fusion tasks. However, their heterogeneous distributions always impede fusion and joint classification performance, leading to wrong classification phenomena. To solve this challenge, we proposed a deep multiscale cross-modal attention (DMSCA) network for hyperspectral and LiDAR data fusion and joint classification. Compared with existing methods, our primary motivation is to explore the intrinsic connection between these two specific remote sensing modalities and enhance their shared attributes through the implementation of various cross-modal attention mechanisms. The extracted modality features are cross-modally integrated and exchanged, thereby enhancing the overall consistency of the simulations. Specifically, these cross-modal attention mechanisms are capable of strengthening local considerable segments considering detailed hyperspectral and LiDAR geographical descriptions. The spatial-wise attention mechanism measures the contributions of neighboring samples to classification performance. The spectral-wise attention mechanism highlights the significant hyperspectral channels in terms of channel correlation. The elevation-wise attention mechanism highly connects the hyperspectral-related attention mechanisms to detailed LiDAR elevations for information fusion. Based on these mechanisms, an adaptive fusion and joint classification framework is constructed for balancing multimodal information. Multiple experiments are conducted on three widely used datasets to prove the effectiveness of DMSCA. Experimental results prove that DMSCA outperforms state-of-the-art techniques qualitatively and quantitatively.
引用
收藏
页数:21
相关论文
共 46 条
[1]   Physics-based shading reconstruction for intrinsic image decomposition [J].
Baslamisli, Anil S. ;
Liu, Yang ;
Karaoglu, Sezer ;
Gevers, Theo .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 205
[2]   A Novel Hyperspectral Image Classification Model Using Bole Convolution With Three-Direction Attention Mechanism: Small Sample and Unbalanced Learning [J].
Cai, Weiwei ;
Ning, Xin ;
Zhou, Guoxiong ;
Bai, Xiao ;
Jiang, Yizhang ;
Li, Wei ;
Qian, Pengjiang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[3]   Multibranch Feature Fusion Network With Self- and Cross-Guided Attention for Hyperspectral and LiDAR Classification [J].
Dong, Wenqian ;
Zhang, Tian ;
Qu, Jiahui ;
Xiao, Song ;
Zhang, Tongzhen ;
Li, Yunsong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4]   Shadow Removal of Hyperspectral Remote Sensing Images With Multiexposure Fusion [J].
Duan, Puhong ;
Hu, Shangsong ;
Kang, Xudong ;
Li, Shutao .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[5]   TOWARDS HIGH-QUALITY INTRINSIC IMAGES IN THE WILD [J].
Fu, Gang ;
Zhang, Qing ;
Xiao, Chunxia .
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, :175-180
[6]   Multitemporal Intrinsic Image Decomposition With Temporal-Spatial Energy Constraints for Remote Sensing Image Analysis [J].
Gao, Guoming ;
Liu, Baisen ;
Zhang, Xiangrong ;
Jin, Xudong ;
Gu, Yanfeng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[7]   Hyperspectral and LiDAR Data Classification Using Kernel Collaborative Representation Based Residual Fusion [J].
Ge, Chiru ;
Du, Qian ;
Li, Wei ;
Li, Yunsong ;
Sun, Weiwei .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (06) :1963-1973
[8]  
Gu Y., 2022, IEEE Trans. Geosci. Remote Sens., V60, P1
[9]   Cross-Modality Contrastive Learning for Hyperspectral Image Classification [J].
Hang, Renlong ;
Qian, Xuwei ;
Liu, Qingshan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]   Classification of Hyperspectral and LiDAR Data Using Coupled CNNs [J].
Hang, Renlong ;
Li, Zhu ;
Ghamisi, Pedram ;
Hong, Danfeng ;
Xia, Guiyu ;
Liu, Qingshan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (07) :4939-4950