Joint Classification of Hyperspectral and LiDAR Data Using Hierarchical Multimodal Feature Aggregation-Based Multihead Axial Attention Transformer

被引：0

作者：

Zhu, Fei ^{[1
]}

Shi, Cuiping ^{[2
]}

Shi, Kaijie ^{[1
]}

Wang, Liguo ^{[3
]}

机构：

[1] Qiqihar Univ, Dept Commun Engn, Qiqihar 161000, Peoples R China

[2] Huzhou Univ, Coll Informat Engn, Huzhou 313000, Peoples R China

[3] Dalian Nationalities Univ, Coll Informat & Commun Engn, Dalian 116000, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2025年 / 63卷

基金：

中国国家自然科学基金;

关键词：

Axial attention; convolutional neural networks (CNNs); feature aggregation; hyperspectral; light detection and ranging (LiDAR); multimodal; transformer; REMOTE-SENSING DATA; EXTINCTION PROFILES; FUSION;

D O I：

10.1109/TGRS.2025.3533475

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

The rapid development of sensor and multimodal technology has provided more possibilities for multisource remote sensing image classification. However, some existing joint classification methods are limited to single-level feature fusion and fail to fully explore the deep correlation between cross-level features, thus limiting the effective interaction and complementarity of information between different modal data. To alleviate this issue, this article proposes a hierarchical multimodal feature aggregation-based multihead axial attention transformer (HMAT) for joint classification of hyperspectral and light detection and ranging (LiDAR) data. First, a hierarchical multimodal feature aggregation module (HMFA) is proposed to more effectively fuse spatial-spectral features of hyperspectral images (HSIs) and elevation features of LiDAR data and generate more discriminative low-dimensional feature representations. Second, a pyramid-inverted pyramid convolution module (PIP) is designed. Through the complementary feature extraction structure, PIP can more fully capture the multiscale local features in the fused feature map of hyperspectral and LiDAR data. Finally, a multihead axial attention (MHAA) component is constructed to capture information at different scales in the fused feature maps, thereby accurately modeling global dependencies. The proposed HMAT has been extensively tested on three publicly available datasets. The experimental results demonstrate that the classification performance of the proposed method outperforms that of several state-of-the-art methods.

引用

页数：17

共 33 条

[31] Robust wave-feature adaptive heartbeat classification based on self-attention mechanism using a transformer model
Hu, Shuaicong
Cai, Wenjie
Gao, Tijie
Zhou, Jiajun
Wang, Mingjie
PHYSIOLOGICAL MEASUREMENT, 2021, 42 (12)
[32] Recursive Feature Elimination and Random Forest Classification of Natura 2000 Grasslands in Lowland River Valleys of Poland Based on Airborne Hyperspectral and LiDAR Data Fusion
Demarchi, Luca
Kania, Adam
Ciezkowski, Wojciech
Piorkowski, Hubert
Ogwiecimska-Piasko, Zuzanna
Chormanski, Jaroslaw
REMOTE SENSING, 2020, 12 (11)
[33] Optimal Decision Fusion for Urban Land-Use/Land-Cover Classification Based on Adaptive Differential Evolution Using Hyperspectral and LiDAR Data
Zhong, Yanfei
Cao, Qiong
Zhao, Ji
Ma, Ailong
Zhao, Bei
Zhang, Liangpei
REMOTE SENSING, 2017, 9 (08)

← 1 2 3 4 →