Unsupervised Multi-Scale Hybrid Feature Extraction Network for Semantic Segmentation of High-Resolution Remote Sensing Images

被引:2
|
作者
Song, Wanying [1 ]
Nie, Fangxin [1 ]
Wang, Chi [1 ]
Jiang, Yinyin [1 ]
Wu, Yan [2 ]
机构
[1] Xian Univ Sci & Technol, Sch Commun & Informat Engn, Xian Key Lab Network Convergence Commun, Xian 710054, Peoples R China
[2] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
基金
中国博士后科学基金;
关键词
high-resolution remote sensing; unsupervised; semantic segmentation; global context information; fine-grained features; feature fusion;
D O I
10.3390/rs16203774
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Generating pixel-level annotations for semantic segmentation tasks of high-resolution remote sensing images is both time-consuming and labor-intensive, which has led to increased interest in unsupervised methods. Therefore, in this paper, we propose an unsupervised multi-scale hybrid feature extraction network based on the CNN-Transformer architecture, referred to as MSHFE-Net. The MSHFE-Net consists of three main modules: a Multi-Scale Pixel-Guided CNN Encoder, a Multi-Scale Aggregation Transformer Encoder, and a Parallel Attention Fusion Module. The Multi-Scale Pixel-Guided CNN Encoder is designed for multi-scale, fine-grained feature extraction in unsupervised tasks, efficiently recovering local spatial information in images. Meanwhile, the Multi-Scale Aggregation Transformer Encoder introduces a multi-scale aggregation module, which further enhances the unsupervised acquisition of multi-scale contextual information, obtaining global features with stronger feature representation. The Parallel Attention Fusion Module employs an attention mechanism to fuse global and local features in both channel and spatial dimensions in parallel, enriching the semantic relations extracted during unsupervised training and improving the performance of unsupervised semantic segmentation. K-means clustering is then performed on the fused features to achieve high-precision unsupervised semantic segmentation. Experiments with MSHFE-Net on the Potsdam and Vaihingen datasets demonstrate its effectiveness in significantly improving the accuracy of unsupervised semantic segmentation.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] ASPP+-LANet: A Multi-Scale Context Extraction Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Hu, Lei
    Zhou, Xun
    Ruan, Jiachen
    Li, Supeng
    REMOTE SENSING, 2024, 16 (06)
  • [2] HRCNet: High-Resolution Context Extraction Network for Semantic Segmentation of Remote Sensing Images
    Xu, Zhiyong
    Zhang, Weicun
    Zhang, Tianxiang
    Li, Jiangyun
    REMOTE SENSING, 2021, 13 (01) : 1 - 23
  • [3] Multi-scale Adaptive Feature Fusion Network for Semantic Segmentation in Remote Sensing Images
    Shang, Ronghua
    Zhang, Jiyu
    Jiao, Licheng
    Li, Yangyang
    Marturi, Naresh
    Stolkin, Rustam
    REMOTE SENSING, 2020, 12 (05)
  • [4] Semantic Segmentation on Remote Sensing Images with Multi-Scale Feature Fusion
    Zhang J.
    Jin Q.
    Wang H.
    Da C.
    Xiang S.
    Pan C.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (09): : 1509 - 1517
  • [5] Cross-Scale Feature Propagation Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zeng, Qiaolin
    Zhou, Jingxiang
    Niu, Xuerui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [6] MCNet: A Multi-scale and Cascade Network for Semantic Segmentation of Remote Sensing Images
    Zhou, Yin
    Li, Tianyi
    Li, Xianju
    Feng, Ruyi
    WEB AND BIG DATA, PT II, APWEB-WAIM 2023, 2024, 14332 : 162 - 176
  • [7] Multi-scale attention fusion network for semantic segmentation of remote sensing images
    Wen, Zhiqiang
    Huang, Hongxu
    Liu, Shuai
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (24) : 7909 - 7926
  • [8] Local-enhanced multi-scale aggregation swin transformer for semantic segmentation of high-resolution remote sensing images
    Ren, Dong
    Li, Falin
    Sun, Hang
    Liu, Li
    Ren, Shun
    Yu, Mei
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (01) : 101 - 120
  • [9] MFALNet: A Multiscale Feature Aggregation Lightweight Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Lv, Liang
    Guo, Yiyou
    Bao, Tengfei
    Fu, Chenqin
    Huo, Hong
    Fang, Tao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (12) : 2172 - 2176
  • [10] Edge Guidance Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Ni, Yue
    Liu, Jiahang
    Cui, Jian
    Yang, Yuze
    Wang, Xiaozhen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 9809 - 9822