UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images

被引:1
作者
Chang, Zhanyuan [1 ]
Xu, Mingyu [1 ]
Wei, Yuwen [1 ]
Lian, Jie [1 ]
Zhang, Chongming [1 ]
Li, Chuanjiang [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
基金
上海市自然科学基金;
关键词
high-resolution remote sensing images; real-time semantic segmentation; convolutional attention; global-local context; transformer;
D O I
10.3390/s24206655
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The application of deep neural networks for the semantic segmentation of remote sensing images is a significant research area within the field of the intelligent interpretation of remote sensing data. The semantic segmentation of remote sensing images holds great practical value in urban planning, disaster assessment, the estimation of carbon sinks, and other related fields. With the continuous advancement of remote sensing technology, the spatial resolution of remote sensing images is gradually increasing. This increase in resolution brings about challenges such as significant changes in the scale of ground objects, redundant information, and irregular shapes within remote sensing images. Current methods leverage Transformers to capture global long-range dependencies. However, the use of Transformers introduces higher computational complexity and is prone to losing local details. In this paper, we propose UNeXt (UNet+ConvNeXt+Transformer), a real-time semantic segmentation model tailored for high-resolution remote sensing images. To achieve efficient segmentation, UNeXt uses the lightweight ConvNeXt-T as the encoder and a lightweight decoder, Transnext, which combines a Transformer and CNN (Convolutional Neural Networks) to capture global information while avoiding the loss of local details. Furthermore, in order to more effectively utilize spatial and channel information, we propose a SCFB (SC Feature Fuse Block) to reduce computational complexity while enhancing the model's recognition of complex scenes. A series of ablation experiments and comprehensive comparative experiments demonstrate that our method not only runs faster than state-of-the-art (SOTA) lightweight models but also achieves higher accuracy. Specifically, our proposed UNeXt achieves 85.2% and 82.9% mIoUs on the Vaihingen and Gaofen5 (GID5) datasets, respectively, while maintaining 97 fps for 512 x 512 inputs on a single NVIDIA GTX 4090 GPU, outperforming other SOTA methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] CHANGE DETECTION OF BUILDINGS WITH THE UTILIZATION OF A DEEP BELIEF NETWORK AND HIGH-RESOLUTION REMOTE SENSING IMAGES
    Huang, Fenghua
    Shen, Guiping
    Hong, Huiqun
    Wei, Liying
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2022, 30 (10)
  • [32] Disparity Estimation of High-Resolution Remote Sensing Images with Dual-Scale Matching Network
    He, Sheng
    Zhou, Ruqin
    Li, Shenhong
    Jiang, San
    Jiang, Wanshou
    REMOTE SENSING, 2021, 13 (24)
  • [33] Multi-scale Feature Fusion and Transformer Network for urban green space segmentation from high-resolution remote sensing images
    Cheng, Yong
    Wang, Wei
    Ren, Zhoupeng
    Zhao, Yingfen
    Liao, Yilan
    Ge, Yong
    Wang, Jun
    He, Jiaxin
    Gu, Yakang
    Wang, Yixuan
    Zhang, Wenjie
    Zhang, Ce
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 124
  • [34] MCAFNet: A Multiscale Channel Attention Fusion Network for Semantic Segmentation of Remote Sensing Images
    Yuan, Min
    Ren, Dingbang
    Feng, Qisheng
    Wang, Zhaobin
    Dong, Yongkang
    Lu, Fuxiang
    Wu, Xiaolin
    REMOTE SENSING, 2023, 15 (02)
  • [35] Multi-scale attention fusion network for semantic segmentation of remote sensing images
    Wen, Zhiqiang
    Huang, Hongxu
    Liu, Shuai
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (24) : 7909 - 7926
  • [36] A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images
    Wang, Libo
    Li, Rui
    Duan, Chenxi
    Zhang, Ce
    Meng, Xiaoliang
    Fang, Shenghui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [37] Improved Anchor-Free Instance Segmentation for Building Extraction from High-Resolution Remote Sensing Images
    Wu, Tong
    Hu, Yuan
    Peng, Ling
    Chen, Ruonan
    REMOTE SENSING, 2020, 12 (18)
  • [38] A CNN-Transformer Network Combining CBAM for Change Detection in High-Resolution Remote Sensing Images
    Yin, Mengmeng
    Chen, Zhibo
    Zhang, Chengjian
    REMOTE SENSING, 2023, 15 (09)
  • [39] OCANet: An Overcomplete Convolutional Attention Network for Building Extraction From High-Resolution Remote Sensing Images
    Zhang, Bo
    Huang, Jiajia
    Wu, Fan
    Zhang, Wenjuan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 18427 - 18443
  • [40] Calculation of the optimal segmentation scale in object-based multiresolution segmentation based on the scene complexity of high-resolution remote sensing images
    Feng, Tianjing
    Ma, Hairong
    Cheng, Xinwen
    Zhang, Hongping
    JOURNAL OF APPLIED REMOTE SENSING, 2018, 12 (02):