CSTFNet: A CNN and Dual Swin-Transformer Fusion Network for Remote Sensing Hyperspectral Data Fusion and Classification of Coastal Areas

被引:0
|
作者
Li, Dekai [1 ]
Neira-Molina, Harold [2 ]
Huang, Mengxing [1 ]
Syam, M. S. [3 ,4 ,5 ]
Yu, Zhang [1 ,6 ]
Zhang, Junfeng [1 ]
Bhatti, Uzair Aslam [1 ]
Asif, Muhammad [7 ]
Sarhan, Nadia [8 ]
Awwad, Emad Mahrous [9 ]
机构
[1] Hainan Univ, Sch Informat & Commun Engn, Haikou 570228, Peoples R China
[2] Univ Costa, Dept Comp Sci & Elect, Barranquilla 080002, Colombia
[3] Jingchu Univ Technol, Sch Artificial Intelligence, Jingmen 448000, Peoples R China
[4] Jingchu Univ Technol, Jingmen Cryptometry Applicat Technol Res Ctr, Jingmen 448000, Peoples R China
[5] Jingchu Univ Technol, Internet Intelligences Applicat Innovat Res Ctr, Jingmen 448000, Peoples R China
[6] Hainan Univ, Sch Comp Sci & Technol, Haikou 570228, Peoples R China
[7] Hunan Univ Sci & Engn, Sch Media, Yongzhou 425199, Peoples R China
[8] King Saud Univ, Coll Business Adm, Dept Quantitat Anal, Riyadh 11421, Saudi Arabia
[9] King Saud Univ, Coll Engn, Dept Elect Engn, Riyadh 11421, Saudi Arabia
基金
美国国家科学基金会;
关键词
Feature extraction; Computational modeling; Transformers; Convolutional neural networks; Accuracy; Hyperspectral imaging; Vectors; Three-dimensional displays; Sea measurements; Faces; Convolutional neural network (CNN); hyperspectral image; remote sensing; Swin transformer; IMAGE CLASSIFICATION;
D O I
10.1109/JSTARS.2025.3530935
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hyperspectral imaging (HSI) can capture a large amount of spectral information at various wavelengths, enabling detailed material classification and identification, making it a key tool in remote sensing, particularly for coastal area monitoring. In recent years, the convolutional neural network (CNN) framework and transformer models have demonstrated strong performance in HSI classification, especially in applications requiring precise change detection and analysis. However, due to the high dimensionality of HSI data and the complexity of spectral-spatial feature extraction, achieving accurate results in coastal areas remains challenging. This article introduces a new hybrid model, CSTFNet, which combines an improved CNN module and dual-layer Swin transformer (DLST) to tackle these challenges. CSTFNet integrates spectral and spatial processing capabilities, significantly reducing computational complexity while maintaining high classification accuracy. The improved CNN module employs one-dimensional convolutions to handle high-dimensional data, while the DLST module uses window-based multihead attention to capture both local and global dependencies. Experiments conducted on four standard HSI datasets (Houston-2013, Samson, KSC, and Botswana) demonstrate that CSTFNet outperforms traditional and state-of-the-art algorithms, achieving overall classification accuracy exceeding 99% . In particular, on the Houston-2013 dataset, the results for OA and AA are 1.00 and the kappa coefficient is 0. 976. The results highlight the robustness and efficiency of the proposed model in coastal area applications, where accurate and reliable spectral-spatial classification is crucial for monitoring and environmental management.
引用
收藏
页码:5853 / 5865
页数:13
相关论文
共 50 条
  • [21] Multiscale Fusion Transformer Network for Hyperspectral Image Classification
    Yuquan Gan
    Hao Zhang
    Chen Yi
    Journal of Beijing Institute of Technology, 2024, (03) : 255 - 270
  • [22] Multiscale Fusion Transformer Network for Hyperspectral Image Classification
    Gan, Yuquan
    Zhang, Hao
    Yi, Chen
    Journal of Beijing Institute of Technology (English Edition), 2024, 33 (03): : 255 - 270
  • [23] Multimodal Fusion Transformer for Remote Sensing Image Classification
    Roy, Swalpa Kumar
    Deria, Ankur
    Hong, Danfeng
    Rasti, Behnood
    Plaza, Antonio
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [24] Microscopic Hyperspectral Image Classification Based on Fusion Transformer With Parallel CNN
    Zeng, Weijia
    Li, Wei
    Zhang, Mengmeng
    Wang, Hao
    Lv, Meng
    Yang, Yue
    Tao, Ran
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (06) : 2910 - 2921
  • [25] TGF-Net: Transformer and gist CNN fusion network for multi-modal remote sensing image classification
    Wang, Huiqing
    Wang, Huajun
    Wu, Linfen
    PLOS ONE, 2025, 20 (02):
  • [26] A Lightweight Dual-Branch Swin Transformer for Remote Sensing Scene Classification
    Zheng, Fujian
    Lin, Shuai
    Zhou, Wei
    Huang, Hong
    REMOTE SENSING, 2023, 15 (11)
  • [27] Dual-Branch Feature Fusion Network Based Cross-Modal Enhanced CNN and Transformer for Hyperspectral and LiDAR Classification
    Wang, Wuli
    Li, Chong
    Ren, Peng
    Lu, Xinchao
    Wang, Jianbu
    Ren, Guangbo
    Liu, Baodi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [28] CTFNet: CNN-Transformer Fusion Network for Remote-Sensing Image Semantic Segmentation
    Wu, Honglin
    Huang, Peng
    Zhang, Min
    Tang, Wenlong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [29] CMTFNet: CNN and Multiscale Transformer Fusion Network for Remote-Sensing Image Semantic Segmentation
    Wu, Honglin
    Huang, Peng
    Zhang, Min
    Tang, Wenlong
    Yu, Xinyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [30] Scale Adaptive Fusion Network for Multimodal Remote Sensing Data Classification
    Liu, Xiaomin
    Yu, Mengjun
    Qiao, Zhenzhuang
    Wang, Haoyu
    Xing, Changda
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (09): : 3693 - 3702