UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images

被引:1
|
作者
Chang, Zhanyuan [1 ]
Xu, Mingyu [1 ]
Wei, Yuwen [1 ]
Lian, Jie [1 ]
Zhang, Chongming [1 ]
Li, Chuanjiang [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
基金
上海市自然科学基金;
关键词
high-resolution remote sensing images; real-time semantic segmentation; convolutional attention; global-local context; transformer;
D O I
10.3390/s24206655
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The application of deep neural networks for the semantic segmentation of remote sensing images is a significant research area within the field of the intelligent interpretation of remote sensing data. The semantic segmentation of remote sensing images holds great practical value in urban planning, disaster assessment, the estimation of carbon sinks, and other related fields. With the continuous advancement of remote sensing technology, the spatial resolution of remote sensing images is gradually increasing. This increase in resolution brings about challenges such as significant changes in the scale of ground objects, redundant information, and irregular shapes within remote sensing images. Current methods leverage Transformers to capture global long-range dependencies. However, the use of Transformers introduces higher computational complexity and is prone to losing local details. In this paper, we propose UNeXt (UNet+ConvNeXt+Transformer), a real-time semantic segmentation model tailored for high-resolution remote sensing images. To achieve efficient segmentation, UNeXt uses the lightweight ConvNeXt-T as the encoder and a lightweight decoder, Transnext, which combines a Transformer and CNN (Convolutional Neural Networks) to capture global information while avoiding the loss of local details. Furthermore, in order to more effectively utilize spatial and channel information, we propose a SCFB (SC Feature Fuse Block) to reduce computational complexity while enhancing the model's recognition of complex scenes. A series of ablation experiments and comprehensive comparative experiments demonstrate that our method not only runs faster than state-of-the-art (SOTA) lightweight models but also achieves higher accuracy. Specifically, our proposed UNeXt achieves 85.2% and 82.9% mIoUs on the Vaihingen and Gaofen5 (GID5) datasets, respectively, while maintaining 97 fps for 512 x 512 inputs on a single NVIDIA GTX 4090 GPU, outperforming other SOTA methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Fully convolutional DenseNet with adversarial training for semantic segmentation of high-resolution remote sensing images
    Guo, Xuejun
    Chen, Zehua
    Wang, Chengyi
    JOURNAL OF APPLIED REMOTE SENSING, 2021, 15 (01)
  • [42] Global Multi-Attention UResNeXt for Semantic Segmentation of High-Resolution Remote Sensing Images
    Chen, Zhong
    Zhao, Jun
    Deng, He
    REMOTE SENSING, 2023, 15 (07)
  • [43] RAANet: A Residual ASPP with Attention Framework for Semantic Segmentation of High-Resolution Remote Sensing Images
    Liu, Runrui
    Tao, Fei
    Liu, Xintao
    Na, Jiaming
    Leng, Hongjun
    Wu, Junjie
    Zhou, Tong
    REMOTE SENSING, 2022, 14 (13)
  • [44] Spatial-specific Transformer with involution for semantic segmentation of high-resolution remote sensing images
    Wu, Xinjia
    Zhang, Jing
    Li, Wensheng
    Li, Jiafeng
    Zhuo, Li
    Zhang, Jie
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (04) : 1280 - 1307
  • [45] Enhanced Lightweight End-to-End Semantic Segmentation for High-Resolution Remote Sensing Images
    Dong, He
    Yu, Baoguo
    Wu, Wanqing
    He, Chenglong
    IEEE Access, 2022, 10 : 70947 - 70954
  • [46] Unsupervised Multi-Scale Hybrid Feature Extraction Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Song, Wanying
    Nie, Fangxin
    Wang, Chi
    Jiang, Yinyin
    Wu, Yan
    REMOTE SENSING, 2024, 16 (20)
  • [47] CIMFNet: Cross-Layer Interaction and Multiscale Fusion Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zhou, Wujie
    Jin, Jianhui
    Lei, Jingsheng
    Yu, Lu
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (04) : 666 - 676
  • [48] Multiscale Feature Weighted-Aggregating and Boundary Enhancement Network for Semantic Segmentation of High-Resolution Remote Sensing Images
    Zhao, Yingying
    Zheng, Guizhou
    Xu, Zhangyan
    Qiu, Zhonghang
    Chen, Zhixing
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 8118 - 8130
  • [49] Scale Sensitive Neural Network for Road Segmentation in High-Resolution Remote Sensing Images
    Tan, Xiaowei
    Xiao, Zhifeng
    Wan, Qiao
    Shao, Weiping
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (03) : 533 - 537
  • [50] Semantic segmentation of high-resolution images
    Juhong WANG
    Bin LIU
    Kun XU
    Science China(Information Sciences), 2017, 60 (12) : 256 - 261