共 33 条
Joint Classification of Hyperspectral and LiDAR Data Using Hierarchical Multimodal Feature Aggregation-Based Multihead Axial Attention Transformer
被引:0
|作者:
Zhu, Fei
[1
]
Shi, Cuiping
[2
]
Shi, Kaijie
[1
]
Wang, Liguo
[3
]
机构:
[1] Qiqihar Univ, Dept Commun Engn, Qiqihar 161000, Peoples R China
[2] Huzhou Univ, Coll Informat Engn, Huzhou 313000, Peoples R China
[3] Dalian Nationalities Univ, Coll Informat & Commun Engn, Dalian 116000, Peoples R China
来源:
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
|
2025年
/
63卷
基金:
中国国家自然科学基金;
关键词:
Axial attention;
convolutional neural networks (CNNs);
feature aggregation;
hyperspectral;
light detection and ranging (LiDAR);
multimodal;
transformer;
REMOTE-SENSING DATA;
EXTINCTION PROFILES;
FUSION;
D O I:
10.1109/TGRS.2025.3533475
中图分类号:
P3 [地球物理学];
P59 [地球化学];
学科分类号:
0708 ;
070902 ;
摘要:
The rapid development of sensor and multimodal technology has provided more possibilities for multisource remote sensing image classification. However, some existing joint classification methods are limited to single-level feature fusion and fail to fully explore the deep correlation between cross-level features, thus limiting the effective interaction and complementarity of information between different modal data. To alleviate this issue, this article proposes a hierarchical multimodal feature aggregation-based multihead axial attention transformer (HMAT) for joint classification of hyperspectral and light detection and ranging (LiDAR) data. First, a hierarchical multimodal feature aggregation module (HMFA) is proposed to more effectively fuse spatial-spectral features of hyperspectral images (HSIs) and elevation features of LiDAR data and generate more discriminative low-dimensional feature representations. Second, a pyramid-inverted pyramid convolution module (PIP) is designed. Through the complementary feature extraction structure, PIP can more fully capture the multiscale local features in the fused feature map of hyperspectral and LiDAR data. Finally, a multihead axial attention (MHAA) component is constructed to capture information at different scales in the fused feature maps, thereby accurately modeling global dependencies. The proposed HMAT has been extensively tested on three publicly available datasets. The experimental results demonstrate that the classification performance of the proposed method outperforms that of several state-of-the-art methods.
引用
收藏
页数:17
相关论文