UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images

被引:1
作者
Chang, Zhanyuan [1 ]
Xu, Mingyu [1 ]
Wei, Yuwen [1 ]
Lian, Jie [1 ]
Zhang, Chongming [1 ]
Li, Chuanjiang [1 ]
机构
[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China
基金
上海市自然科学基金;
关键词
high-resolution remote sensing images; real-time semantic segmentation; convolutional attention; global-local context; transformer;
D O I
10.3390/s24206655
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The application of deep neural networks for the semantic segmentation of remote sensing images is a significant research area within the field of the intelligent interpretation of remote sensing data. The semantic segmentation of remote sensing images holds great practical value in urban planning, disaster assessment, the estimation of carbon sinks, and other related fields. With the continuous advancement of remote sensing technology, the spatial resolution of remote sensing images is gradually increasing. This increase in resolution brings about challenges such as significant changes in the scale of ground objects, redundant information, and irregular shapes within remote sensing images. Current methods leverage Transformers to capture global long-range dependencies. However, the use of Transformers introduces higher computational complexity and is prone to losing local details. In this paper, we propose UNeXt (UNet+ConvNeXt+Transformer), a real-time semantic segmentation model tailored for high-resolution remote sensing images. To achieve efficient segmentation, UNeXt uses the lightweight ConvNeXt-T as the encoder and a lightweight decoder, Transnext, which combines a Transformer and CNN (Convolutional Neural Networks) to capture global information while avoiding the loss of local details. Furthermore, in order to more effectively utilize spatial and channel information, we propose a SCFB (SC Feature Fuse Block) to reduce computational complexity while enhancing the model's recognition of complex scenes. A series of ablation experiments and comprehensive comparative experiments demonstrate that our method not only runs faster than state-of-the-art (SOTA) lightweight models but also achieves higher accuracy. Specifically, our proposed UNeXt achieves 85.2% and 82.9% mIoUs on the Vaihingen and Gaofen5 (GID5) datasets, respectively, while maintaining 97 fps for 512 x 512 inputs on a single NVIDIA GTX 4090 GPU, outperforming other SOTA methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] TRANSFORMER AND CNN HYBRID NETWORK FOR SUPER-RESOLUTION SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGERY
    Liu, Yutong
    Gao, Kun
    Wang, Hong
    Wang, Junwei
    Zhang, Xiaodian
    Wang, Pengyu
    Li, Shuzhong
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6940 - 6943
  • [42] Entropy guidance hierarchical rich-scale feature network for remote sensing image semantic segmentation of high resolution
    Zhang, Haoxue
    Li, Linjuan
    Xie, Xinlin
    He, Yun
    Ren, Jinchang
    Xie, Gang
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [43] An Efficient and Light Transformer-Based Segmentation Network for Remote Sensing Images of Landscapes
    Chen, Lijia
    Chen, Honghui
    Xie, Yanqiu
    He, Tianyou
    Ye, Jing
    Zheng, Yushan
    FORESTS, 2023, 14 (11):
  • [44] Multiscale Prototype Contrast Network for High-Resolution Aerial Imagery Semantic Segmentation
    Wang, Qixiong
    Luo, Xiaoyan
    Feng, Jiaqi
    Zhang, Guangyun
    Jia, Xiuping
    Yin, Jihao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [45] Hierarchical Optimization Method of Building Contour in High-Resolution Remote Sensing Images
    Chang J.
    Wang S.
    Yang Y.
    Gao X.
    Zhongguo Jiguang/Chinese Journal of Lasers, 2020, 47 (10):
  • [46] Improved watershed segmentation algorithm for high resolution remote sensing images using texture
    Wang, ZY
    Song, CY
    Wu, ZZ
    Chen, XW
    IGARSS 2005: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, PROCEEDINGS, 2005, : 3721 - 3723
  • [47] HMLNet: a hierarchical metric learning network with dual attention for change detection in high-resolution remote sensing images
    Liang, Yi
    Zhang, Chengkun
    Liu, Jianwei
    Han, Min
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (03) : 1001 - 1021
  • [48] MFINet: Multi-Scale Feature Interaction Network for Change Detection of High-Resolution Remote Sensing Images
    Ren, Wuxu
    Wang, Zhongchen
    Xia, Min
    Lin, Haifeng
    REMOTE SENSING, 2024, 16 (07)
  • [49] Hierarchical Optimization Method of Building Contour in High-Resolution Remote Sensing Images
    Chang Jingxin
    Wang Shuangxi
    Yang Yuanwei
    Gao Xianjun
    CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2020, 47 (10):
  • [50] Road extraction of high-resolution satellite remote sensing images in U-Net network with consideration of connectivity
    Wang B.
    Chen Z.
    Wu L.
    Xie P.
    Fan D.
    Fu B.
    Yaogan Xuebao/Journal of Remote Sensing, 2020, 24 (12): : 1488 - 1499