Advancing high-resolution remote sensing: a compact and powerful approach to semantic segmentation

被引:0
作者
Zhang, Hua [1 ,2 ,3 ]
Jiang, Zhengang [1 ]
Xu, Jun [3 ]
Pan, Xin [2 ,3 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Changchun Inst Technol, Sch Comp Technol & Engn, Changchun, Peoples R China
[3] Changchun Inst Technol, Jilin Key Lab Changbai Hist Culture & VR Technol R, Changchun, Peoples R China
关键词
Remote sensing; image analysis; neural networks; semantic; CONVOLUTIONAL NEURAL-NETWORK; EXTRACTION;
D O I
10.1080/01431161.2024.2398226
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Deep learning (DL)-based approaches are notable for their ability to establish feature associations without relying on physical constraints, unlike traditional strategies that are complex and dependent on expert experience. However, three main challenges hinder the versatility of semantic segmentation models. First, the targets in these images are dense and exist at varying spatial scales, which imposes higher demands on the model for accurate segmentation across scales. Second, the segmentation of small targets in the images is often overlooked, leading to a compromise between fine segmentation and model efficiency. Lastly, the data-intensive nature of remote sensing images and the resource-intensive operations of large-scale networks impose significant communication and computation burdens on edge devices, which may not have sufficient resources to handle them effectively. To address these challenges, this paper proposes a lightweight semantic segmentation method for remote sensing images to achieve high-precision segmentation for multi-scale targets while maintaining low computational complexity. The main components include: (1) embedding the inverted residual block structure to minimize the number of model parameters and computational costs; (2) introducing the parallel irregular space pyramid pooling module to efficiently aggregate multi-scale contextual information for fine-grained recognition of small targets; and (3) embedding transfer learning into the encoder-decoder structure to speed up the convergence rate and improve multi-scale feature fusion capability, thereby reducing semantic information loss. The proposed lightweight method has been extensively tested on real-world high-resolution remote sensing datasets. It achieved PA, MPA, MIoU, and FWIoU scores of 87.90%, 75.76%, 66.29%, and 78.81% on the Vaihingen dataset; 87.03%, 85.31%, 74.85%, and 77.54% on the Potsdam dataset; and 95.37%, 83.33%, 75.70%, and 91.31% on the Aeroscapes dataset. Compared to other popular semantic segmentation models, the proposed method achieved the highest values in all four evaluation indicators, demonstrating its effectiveness and superiority.
引用
收藏
页码:6860 / 6883
页数:24
相关论文
共 58 条
  • [1] VNet: An End-to-End Fully Convolutional Neural Network for Road Extraction From High-Resolution Remote Sensing Data
    Abdollahi, Abolfazl
    Pradhan, Biswajeet
    Alamri, Abdullah
    [J]. IEEE ACCESS, 2020, 8 : 179424 - 179436
  • [2] Remote sensing and GIS techniques for reconstructing the military fort system on the Roman boundary (Tunisian section) and identifying archaeological sites
    Bachagha, Nabil
    Wang, Xinyuan
    Luo, Lei
    Li, Li
    Khatteli, Houcine
    Lasaponara, Rosa
    [J]. REMOTE SENSING OF ENVIRONMENT, 2020, 236
  • [3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [4] Transformer-Based Masked Autoencoder With Contrastive Loss for Hyperspectral Image Classification
    Cao, Xianghai
    Lin, Haifeng
    Guo, Shuaixu
    Xiong, Tao
    Jiao, Licheng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [5] MSF-Net: A Multiscale Supervised Fusion Network for Building Change Detection in High-Resolution Remote Sensing Images
    Chen, Jiahao
    Fan, Junfu
    Zhang, Mengzhen
    Zhou, Yuke
    Shen, Chen
    [J]. IEEE ACCESS, 2022, 10 : 30925 - 30938
  • [6] Chen LC, 2016, Arxiv, DOI [arXiv:1412.7062, DOI 10.48550/ARXIV.1412.7062]
  • [7] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [8] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [9] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [10] Free lunch for federated remote sensing target fine-grained classification: A parameter-efficient framework
    Chen, Shengchao
    Shu, Ting
    Zhao, Huan
    Wang, Jiahao
    Ren, Sufen
    Yang, Lina
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 294