Multiscale Attention Feature Fusion Based on Improved Transformer for Hyperspectral Image and LiDAR Data Classification

被引:0
|
作者
Wang, Aili [1 ]
Lei, Guilong [1 ]
Dai, Shiyu [1 ]
Wu, Haibin [1 ]
Iwahori, Yuji [2 ]
机构
[1] Harbin Univ Sci & Technol, Coll Measurement & Control Technol & Commun Engn, Heilongjiang Prov Key Lab Laser Spect Technol & Ap, Harbin 150080, Peoples R China
[2] Chubu Univ, Dept Comp Sci, Kasugai 4878501, Japan
关键词
Feature extraction; Transformers; Laser radar; Data mining; Convolutional neural networks; Convolution; Hyperspectral imaging; Correlation; Computer vision; Training; Hyperspectral image (HSI); interaction transformer; light detection and ranging (LiDAR); multisource data classification; three-dimensional convolutional neural network (3D-CNN);
D O I
10.1109/JSTARS.2024.3524443
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the uninterrupted evolution of remote sensing data, the list of available data sources has expanded, effectively utilizing useful information from multiple sources for better land surface observation, which has become an intriguing and challenging problem. However, the complexity of urban areas and their surrounding structures makes it extremely difficult to capture correlations between features. This article proposes a novel multiscale attention feature fusion network, composed of hierarchical convolutional neural networks and transformer to enhance joint classification accuracy of hyperspectral image (HSI) and light detection and ranging (LiDAR) data. First, a multiscale fusion Swin transformer module is employed to eliminate information loss in feature propagation, which explores deep spatial-spectral features of HSI while extracting height information from LiDAR data. This structure combines the advantages of the Swin transformer, featuring a nonlocal receptive field fusion by progressively expanding the window's receptive field layer by layer while preserving the spatial features of the image. It also exhibits excellent robustness against spatial misalignment. For the dual branches of hyperspectral and LiDAR, a dual-source feature interactor is designed, which facilitates interaction between hyperspectral and LiDAR features by establishing a dynamic attention mechanism, which effectively captures correlated information between the two modalities and fuses it into a unified feature representation. The efficacy of the proposed approach is validated using three standard datasets (Huston2013, Trento, and MUUFL) in the experiments. The classification results indicate that the proposed framework, by fully utilizing spatial context information and effectively integrating feature information, significantly outperforms state-of-the-art classification methods.
引用
收藏
页码:4124 / 4140
页数:17
相关论文
共 50 条
  • [1] MCFT: Multimodal Contrastive Fusion Transformer for Classification of Hyperspectral Image and LiDAR Data
    Feng, Yining
    Jin, Jiarui
    Yin, Yin
    Song, Chuanming
    Wang, Xianghai
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [2] Modality Fusion Vision Transformer for Hyperspectral and LiDAR Data Collaborative Classification
    Yang, Bin
    Wang, Xuan
    Xing, Ying
    Cheng, Chen
    Jiang, Weiwei
    Feng, Quanlong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 17052 - 17065
  • [3] Multilevel Feature Gated Fusion Based Spatial and Frequency Domain Attention Network for Joint Classification of Hyperspectral and LiDAR Data
    Shi, Cuiping
    Zhong, Zhipeng
    Ding, Shihang
    Lei, Yeqi
    Wang, Liguo
    Jin, Zhan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 5960 - 5974
  • [4] Multiview Feature Learning and Multilevel Information Fusion for Joint Classification of Hyperspectral and LiDAR Data
    Feng, Jia
    Zhang, Junping
    Zhang, Ye
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] Attention Multihop Graph and Multiscale Convolutional Fusion Network for Hyperspectral Image Classification
    Zhou, Hao
    Luo, Fulin
    Zhuang, Huiping
    Weng, Zhenyu
    Gong, Xiuwen
    Lin, Zhiping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] Hyperspectral Image Classification Based on Multibranch Attention Transformer Networks
    Bai, Jing
    Wen, Zheng
    Xiao, Zhu
    Ye, Fawang
    Zhu, Yongdong
    Alazab, Mamoun
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] A Cross-Attention-Based Multi-Information Fusion Transformer for Hyperspectral Image Classification
    Yang, Jinghui
    Li, Anqi
    Qian, Jinxi
    Qin, Jia
    Wang, Liguo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 13358 - 13375
  • [8] Multiscale and Multidirection Feature Extraction Network for Hyperspectral and LiDAR Classification
    Liu, Yi
    Ye, Zhen
    Xi, Yongqiang
    Liu, Huan
    Li, Wei
    Bai, Lin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 9961 - 9973
  • [9] Joint Classification of Hyperspectral and LiDAR Data Using a Hierarchical CNN and Transformer
    Zhao, Guangrui
    Ye, Qiaolin
    Sun, Le
    Wu, Zebin
    Pan, Chengsheng
    Jeon, Byeungwoo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [10] Spectral Feature Fusion Networks With Dual Attention for Hyperspectral Image Classification
    Li, Xian
    Ding, Mingli
    Pizurica, Aleksandra
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60