Cross Attention-Based Multi-Scale Convolutional Fusion Network for Hyperspectral and LiDAR Joint Classification

被引:0
|
作者
Ge, Haimiao [1 ,2 ]
Wang, Liguo [3 ]
Pan, Haizhu [1 ,2 ]
Liu, Yanzhong [1 ,2 ]
Li, Cheng [1 ,2 ]
Lv, Dan [1 ,2 ]
Ma, Huiyu [1 ,2 ]
机构
[1] Qiqihar Univ, Coll Comp & Control Engn, Qiqihar 161000, Peoples R China
[2] Qiqihar Univ, Heilongjiang Key Lab Big Data Network Secur Detect, Qiqihar 161000, Peoples R China
[3] Dalian Minzu Univ, Coll Informat & Commun Engn, Dalian 116600, Peoples R China
基金
中国国家自然科学基金;
关键词
HSI and LiDAR fusion classification; convolutional neural network; multi-scale feature extraction; cross attention;
D O I
10.3390/rs16214073
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In recent years, deep learning-based multi-source data fusion, e.g., hyperspectral image (HSI) and light detection and ranging (LiDAR) data fusion, has gained significant attention in the field of remote sensing. However, the traditional convolutional neural network fusion techniques always provide poor extraction of discriminative spatial-spectral features from diversified land covers and overlook the correlation and complementarity between different data sources. Furthermore, the mere act of stacking multi-source feature embeddings fails to represent the deep semantic relationships among them. In this paper, we propose a cross attention-based multi-scale convolutional fusion network for HSI-LiDAR joint classification. It contains three major modules: spatial-elevation-spectral convolutional feature extraction module (SESM), cross attention fusion module (CAFM), and classification module. In the SESM, improved multi-scale convolutional blocks are utilized to extract features from HSI and LiDAR to ensure discriminability and comprehensiveness in diversified land cover conditions. Spatial and spectral pseudo-3D convolutions, pointwise convolutions, residual aggregation, one-shot aggregation, and parameter-sharing techniques are implemented in the module. In the CAFM, a self-designed local-global cross attention block is utilized to collect and integrate relationships of the feature embeddings and generate joint semantic representations. In the classification module, average polling, dropout, and linear layers are used to map the fused semantic representations to the final classification results. The experimental evaluations on three public HSI-LiDAR datasets demonstrate the competitiveness of the proposed network in comparison with state-of-the-art methods.
引用
收藏
页数:33
相关论文
共 50 条
  • [21] Fabric Defect Classification Algorithm Based on Multi-Scale Feature Fusion of Spatial Attention
    Song Zhiyong
    Pan Haipeng
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (10)
  • [22] Impact Load Localization Based on Multi-Scale Feature Fusion Convolutional Neural Network
    Wu, Shiji
    Huang, Xiufeng
    Xu, Rongwu
    Yu, Wenjing
    Cheng, Guo
    SENSORS, 2024, 24 (18)
  • [23] Multi-scale convolutional neural network for multi-focus image fusion
    Mustafa, Hafiz Tayyab
    Yang, Jie
    Zareapoor, Masoumeh
    IMAGE AND VISION COMPUTING, 2019, 85 : 26 - 35
  • [24] Attention-Guided Fusion and Classification for Hyperspectral and LiDAR Data
    Huang, Jing
    Zhang, Yinghao
    Yang, Fang
    Chai, Li
    Tansey, Kevin
    REMOTE SENSING, 2024, 16 (01)
  • [25] QoS Prediction via Multi-scale Feature Fusion Based on Convolutional Neural Network
    Xu, Hanzhi
    Shu, Yanjun
    Zhang, Zhan
    Zuo, Decheng
    SERVICE-ORIENTED COMPUTING, ICSOC 2023, PT I, 2023, 14419 : 119 - 134
  • [26] A Cross-Attention-Based Multi-Information Fusion Transformer for Hyperspectral Image Classification
    Yang, Jinghui
    Li, Anqi
    Qian, Jinxi
    Qin, Jia
    Wang, Liguo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 13358 - 13375
  • [27] Hyperspectral Image Classification Based on Dilated Convolutional Attention Neural Network
    Zhang Xiangdong
    Wang Tengjun
    Zhu Shaojun
    Yang Yun
    ACTA OPTICA SINICA, 2021, 41 (03)
  • [28] Infrared and visible image fusion based on multi-scale dense attention connection network
    Chen Y.
    Zhang J.
    Wang Z.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (18): : 2253 - 2266
  • [29] LiDAR-Guided Cross-Attention Fusion for Hyperspectral Band Selection and Image Classification
    Yang, Judy X.
    Zhou, Jun
    Wang, Jing
    Tian, Hui
    Liew, Alan Wee-Chung
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [30] MalMKNet: A Multi-Scale Convolutional Neural Network Used for Malware Classification
    Zhang D.-D.
    Song Y.-F.
    Liu S.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (05): : 1359 - 1369