DECT: Diffusion-Enhanced CNN-Transformer for Multisource Remote Sensing Data Classification

被引:0
作者
Zhang, Guanglian [1 ]
Zhang, Lan [1 ]
Zhang, Zhanxu [1 ]
Deng, Jiangwei [1 ]
Bian, Lifeng [2 ]
Yang, Chen [1 ,3 ]
机构
[1] Guizhou Univ, Coll Big Data & Informat Engn, Power Syst Engn Res Ctr, Minist Educ, Guiyang 550025, Peoples R China
[2] Fudan Univ, Frontier Inst Chip & Syst, Shanghai 200433, Peoples R China
[3] Guizhou Univ, China State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
基金
中国国家自然科学基金;
关键词
CNN; diffusion; light detection and ranging (LiDAR); multisource remote sensing (RS) classification; synthetic aperture radar (SAR); transformer; HYPERSPECTRAL IMAGE CLASSIFICATION;
D O I
10.1109/JSTARS.2024.3479212
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Methods for joint classification of hyperspectral images (HSIs) with high dimensionality and spectral correlation and other sensor data (e.g., optical, infrared, radar, etc.) are important directions in the field of remote sensing. To better learn the feature representation of diffusion features (HSI), the unsupervised global modeling property of diffusion is utilized to mine the potential features of HSI to obtain diffusion features as input data. In addition, to fuse HSI features, HSI diffusion features, and other data features, a three-input diffusion-enhanced CNN-transformer (DECT) network based on CNN and transformer is proposed for feature extraction and fusion. First, the primary features are extracted by hierarchical CNN after premodal fusion. Second, considering the high dimensionality of HSI, spectral pooling attention interaction is designed for feature extraction and aggregation of information from different attentions. Finally, the inverted bottleneck convolutional transformer is designed to aggregate multisource information to enhance feature reuse and aggregate local and contextual information. It is shown on three publicly available datasets that DECT outperforms current state-of-the-art methods.
引用
收藏
页码:19288 / 19301
页数:14
相关论文
共 44 条
  • [1] Amit T, 2022, Arxiv, DOI arXiv:2112.00390
  • [2] Baranchuk D., 2021, arXiv
  • [3] SpectralDiff: A Generative Framework for Hyperspectral Image Classification With Diffusion Models
    Chen, Ning
    Yue, Jun
    Fang, Leyuan
    Xia, Shaobo
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [4] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [5] S2ENet: Spatial-Spectral Cross-Modal Enhancement Network for Classification of Hyperspectral and LiDAR Data
    Fang, Sheng
    Li, Kaiyu
    Li, Zhe
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [6] Gader P. G. M., 2013, Tech. Rep. REP-2013-570
  • [7] Gao L., 2021, IEEE Transactions on Geoscience and Remote Sensing, DOI [DOI 10.1109/TGRS.2021.3130716, 10.1109/TGRS.2021.3130716]
  • [8] DUAL GRAPH CONVOLUTION JOINT DENSE NETWORKS FOR HYPERSPECTRAL AND LIDAR DATA CLASSIFICATION
    Guo, Fangming
    Li, Zhongwei
    Meng, Qiao
    Wang, Leiquan
    Zhang, Jie
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1141 - 1144
  • [9] Ho J., 2020, Adv Neural Inf Process Syst, P6840
  • [10] Cross-city matters: A multimodal remote sensing benchmark dataset for cross-city semantic segmentation using high-resolution domain adaptation networks
    Hong, Danfeng
    Zhang, Bing
    Li, Hao
    Li, Yuxuan
    Yao, Jing
    Li, Chenyu
    Werner, Martin
    Chanussot, Jocelyn
    Zipf, Alexander
    Zhu, Xiao Xiang
    [J]. REMOTE SENSING OF ENVIRONMENT, 2023, 299