UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images

被引:1
|
作者
Chang, Zhanyuan [1 ]
Xu, Mingyu [1 ]
Wei, Yuwen [1 ]
Lian, Jie [1 ]
Zhang, Chongming [1 ]
Li, Chuanjiang [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
基金
上海市自然科学基金;
关键词
high-resolution remote sensing images; real-time semantic segmentation; convolutional attention; global-local context; transformer;
D O I
10.3390/s24206655
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The application of deep neural networks for the semantic segmentation of remote sensing images is a significant research area within the field of the intelligent interpretation of remote sensing data. The semantic segmentation of remote sensing images holds great practical value in urban planning, disaster assessment, the estimation of carbon sinks, and other related fields. With the continuous advancement of remote sensing technology, the spatial resolution of remote sensing images is gradually increasing. This increase in resolution brings about challenges such as significant changes in the scale of ground objects, redundant information, and irregular shapes within remote sensing images. Current methods leverage Transformers to capture global long-range dependencies. However, the use of Transformers introduces higher computational complexity and is prone to losing local details. In this paper, we propose UNeXt (UNet+ConvNeXt+Transformer), a real-time semantic segmentation model tailored for high-resolution remote sensing images. To achieve efficient segmentation, UNeXt uses the lightweight ConvNeXt-T as the encoder and a lightweight decoder, Transnext, which combines a Transformer and CNN (Convolutional Neural Networks) to capture global information while avoiding the loss of local details. Furthermore, in order to more effectively utilize spatial and channel information, we propose a SCFB (SC Feature Fuse Block) to reduce computational complexity while enhancing the model's recognition of complex scenes. A series of ablation experiments and comprehensive comparative experiments demonstrate that our method not only runs faster than state-of-the-art (SOTA) lightweight models but also achieves higher accuracy. Specifically, our proposed UNeXt achieves 85.2% and 82.9% mIoUs on the Vaihingen and Gaofen5 (GID5) datasets, respectively, while maintaining 97 fps for 512 x 512 inputs on a single NVIDIA GTX 4090 GPU, outperforming other SOTA methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Cascaded CNN and global-local attention transformer network-based semantic segmentation for high-resolution remote sensing image
    Liu, Xiaohui
    Zhang, Lei
    Wang, Rui
    Li, Xiaoyu
    Xu, Jiyang
    Lu, Xiaochen
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (03)
  • [22] A Transformer-based multi-modal fusion network for semantic segmentation of high-resolution remote sensing imagery
    Liu, Yutong
    Gao, Kun
    Wang, Hong
    Yang, Zhijia
    Wang, Pengyu
    Ji, Shijing
    Huang, Yanjun
    Zhu, Zhenyu
    Zhao, Xiaobin
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 133
  • [23] Integrating Spatial Details With Long-Range Contexts for Semantic Segmentation of Very High-Resolution Remote-Sensing Images
    Long, Jiang
    Li, Mengmeng
    Wang, Xiaoqin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [24] Semi-Supervised Adversarial Semantic Segmentation Network Using Transformer and Multiscale Convolution for High-Resolution Remote Sensing Imagery
    Zheng, Yalan
    Yang, Mengyuan
    Wang, Min
    Qian, Xiaojun
    Yang, Rui
    Zhang, Xin
    Dong, Wen
    REMOTE SENSING, 2022, 14 (08)
  • [25] EFFICIENT SEMANTIC SEGMENTATION METHOD WITH STRIP POOLING FOR VHR REMOTE SENSING IMAGES
    Sheng, Yifan
    Yang, Junli
    Lin, Youguang
    Lei, Yu
    2021 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM IGARSS, 2021, : 2759 - 2762
  • [26] FURSformer: Semantic Segmentation Network for Remote Sensing Images with Fused Heterogeneous Features
    Zhang, Zehua
    Liu, Bailin
    Li, Yani
    ELECTRONICS, 2023, 12 (14)
  • [27] MATNet: multiattention Transformer network for cropland semantic segmentation in remote sensing images
    Zhang, Zixuan
    Huang, Liang
    Tang, Bo-Hui
    Le, Weipeng
    Wang, Meiqi
    Cheng, Jiapei
    Wu, Qiang
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [28] DEEP HIERARCHICAL REPRESENTATION AND SEGMENTATION OF HIGH RESOLUTION REMOTE SENSING IMAGES
    Wang, Jun
    Qin, Qiming
    Li, Zhoujing
    Ye, Xin
    Wang, Jianhua
    Yang, Xiucheng
    Qin, Xuebin
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 4320 - 4323
  • [29] A Novel Hybrid Method for Urban Green Space Segmentation from High-Resolution Remote Sensing Images
    Wang, Wei
    Cheng, Yong
    Ren, Zhoupeng
    He, Jiaxin
    Zhao, Yingfen
    Wang, Jun
    Zhang, Wenjie
    REMOTE SENSING, 2023, 15 (23)
  • [30] IRA-MRSNet: A Network Model for Change Detection in High-Resolution Remote Sensing Images
    Ling, Jie
    Hu, Lei
    Cheng, Lang
    Chen, Minghui
    Yang, Xin
    REMOTE SENSING, 2022, 14 (21)