Joint Classification of Hyperspectral and LiDAR Data Using Hierarchical Multimodal Feature Aggregation-Based Multihead Axial Attention Transformer

被引:0
|
作者
Zhu, Fei [1 ]
Shi, Cuiping [2 ]
Shi, Kaijie [1 ]
Wang, Liguo [3 ]
机构
[1] Qiqihar Univ, Dept Commun Engn, Qiqihar 161000, Peoples R China
[2] Huzhou Univ, Coll Informat Engn, Huzhou 313000, Peoples R China
[3] Dalian Nationalities Univ, Coll Informat & Commun Engn, Dalian 116000, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2025年 / 63卷
基金
中国国家自然科学基金;
关键词
Axial attention; convolutional neural networks (CNNs); feature aggregation; hyperspectral; light detection and ranging (LiDAR); multimodal; transformer; REMOTE-SENSING DATA; EXTINCTION PROFILES; FUSION;
D O I
10.1109/TGRS.2025.3533475
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The rapid development of sensor and multimodal technology has provided more possibilities for multisource remote sensing image classification. However, some existing joint classification methods are limited to single-level feature fusion and fail to fully explore the deep correlation between cross-level features, thus limiting the effective interaction and complementarity of information between different modal data. To alleviate this issue, this article proposes a hierarchical multimodal feature aggregation-based multihead axial attention transformer (HMAT) for joint classification of hyperspectral and light detection and ranging (LiDAR) data. First, a hierarchical multimodal feature aggregation module (HMFA) is proposed to more effectively fuse spatial-spectral features of hyperspectral images (HSIs) and elevation features of LiDAR data and generate more discriminative low-dimensional feature representations. Second, a pyramid-inverted pyramid convolution module (PIP) is designed. Through the complementary feature extraction structure, PIP can more fully capture the multiscale local features in the fused feature map of hyperspectral and LiDAR data. Finally, a multihead axial attention (MHAA) component is constructed to capture information at different scales in the fused feature maps, thereby accurately modeling global dependencies. The proposed HMAT has been extensively tested on three publicly available datasets. The experimental results demonstrate that the classification performance of the proposed method outperforms that of several state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 33 条
  • [31] Robust wave-feature adaptive heartbeat classification based on self-attention mechanism using a transformer model
    Hu, Shuaicong
    Cai, Wenjie
    Gao, Tijie
    Zhou, Jiajun
    Wang, Mingjie
    PHYSIOLOGICAL MEASUREMENT, 2021, 42 (12)
  • [32] Recursive Feature Elimination and Random Forest Classification of Natura 2000 Grasslands in Lowland River Valleys of Poland Based on Airborne Hyperspectral and LiDAR Data Fusion
    Demarchi, Luca
    Kania, Adam
    Ciezkowski, Wojciech
    Piorkowski, Hubert
    Ogwiecimska-Piasko, Zuzanna
    Chormanski, Jaroslaw
    REMOTE SENSING, 2020, 12 (11)
  • [33] Optimal Decision Fusion for Urban Land-Use/Land-Cover Classification Based on Adaptive Differential Evolution Using Hyperspectral and LiDAR Data
    Zhong, Yanfei
    Cao, Qiong
    Zhao, Ji
    Ma, Ailong
    Zhao, Bei
    Zhang, Liangpei
    REMOTE SENSING, 2017, 9 (08)