UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images

被引：1

作者：

Chang, Zhanyuan ^{[1
]}

Xu, Mingyu ^{[1
]}

Wei, Yuwen ^{[1
]}

Lian, Jie ^{[1
]}

Zhang, Chongming ^{[1
]}

Li, Chuanjiang ^{[1
]}

机构：

[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China

来源：

SENSORS | 2024年 / 24卷 / 20期

基金：

上海市自然科学基金;

关键词：

high-resolution remote sensing images; real-time semantic segmentation; convolutional attention; global-local context; transformer;

D O I：

10.3390/s24206655

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

The application of deep neural networks for the semantic segmentation of remote sensing images is a significant research area within the field of the intelligent interpretation of remote sensing data. The semantic segmentation of remote sensing images holds great practical value in urban planning, disaster assessment, the estimation of carbon sinks, and other related fields. With the continuous advancement of remote sensing technology, the spatial resolution of remote sensing images is gradually increasing. This increase in resolution brings about challenges such as significant changes in the scale of ground objects, redundant information, and irregular shapes within remote sensing images. Current methods leverage Transformers to capture global long-range dependencies. However, the use of Transformers introduces higher computational complexity and is prone to losing local details. In this paper, we propose UNeXt (UNet+ConvNeXt+Transformer), a real-time semantic segmentation model tailored for high-resolution remote sensing images. To achieve efficient segmentation, UNeXt uses the lightweight ConvNeXt-T as the encoder and a lightweight decoder, Transnext, which combines a Transformer and CNN (Convolutional Neural Networks) to capture global information while avoiding the loss of local details. Furthermore, in order to more effectively utilize spatial and channel information, we propose a SCFB (SC Feature Fuse Block) to reduce computational complexity while enhancing the model's recognition of complex scenes. A series of ablation experiments and comprehensive comparative experiments demonstrate that our method not only runs faster than state-of-the-art (SOTA) lightweight models but also achieves higher accuracy. Specifically, our proposed UNeXt achieves 85.2% and 82.9% mIoUs on the Vaihingen and Gaofen5 (GID5) datasets, respectively, while maintaining 97 fps for 512 x 512 inputs on a single NVIDIA GTX 4090 GPU, outperforming other SOTA methods.

引用

页数：18

共 50 条

[21] Cascaded CNN and global-local attention transformer network-based semantic segmentation for high-resolution remote sensing image
Liu, Xiaohui
Zhang, Lei
Wang, Rui
Li, Xiaoyu
Xu, Jiyang
Lu, Xiaochen
JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (03)
[22] A Transformer-based multi-modal fusion network for semantic segmentation of high-resolution remote sensing imagery
Liu, Yutong
Gao, Kun
Wang, Hong
Yang, Zhijia
Wang, Pengyu
Ji, Shijing
Huang, Yanjun
Zhu, Zhenyu
Zhao, Xiaobin
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 133
[23] Integrating Spatial Details With Long-Range Contexts for Semantic Segmentation of Very High-Resolution Remote-Sensing Images
Long, Jiang
Li, Mengmeng
Wang, Xiaoqin
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[24] Semi-Supervised Adversarial Semantic Segmentation Network Using Transformer and Multiscale Convolution for High-Resolution Remote Sensing Imagery
Zheng, Yalan
Yang, Mengyuan
Wang, Min
Qian, Xiaojun
Yang, Rui
Zhang, Xin
Dong, Wen
REMOTE SENSING, 2022, 14 (08)
[25] EFFICIENT SEMANTIC SEGMENTATION METHOD WITH STRIP POOLING FOR VHR REMOTE SENSING IMAGES
Sheng, Yifan
Yang, Junli
Lin, Youguang
Lei, Yu
2021 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM IGARSS, 2021, : 2759 - 2762
[26] FURSformer: Semantic Segmentation Network for Remote Sensing Images with Fused Heterogeneous Features
Zhang, Zehua
Liu, Bailin
Li, Yani
ELECTRONICS, 2023, 12 (14)
[27] MATNet: multiattention Transformer network for cropland semantic segmentation in remote sensing images
Zhang, Zixuan
Huang, Liang
Tang, Bo-Hui
Le, Weipeng
Wang, Meiqi
Cheng, Jiapei
Wu, Qiang
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
[28] DEEP HIERARCHICAL REPRESENTATION AND SEGMENTATION OF HIGH RESOLUTION REMOTE SENSING IMAGES
Wang, Jun
Qin, Qiming
Li, Zhoujing
Ye, Xin
Wang, Jianhua
Yang, Xiucheng
Qin, Xuebin
2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 4320 - 4323
[29] A Novel Hybrid Method for Urban Green Space Segmentation from High-Resolution Remote Sensing Images
Wang, Wei
Cheng, Yong
Ren, Zhoupeng
He, Jiaxin
Zhao, Yingfen
Wang, Jun
Zhang, Wenjie
REMOTE SENSING, 2023, 15 (23)
[30] IRA-MRSNet: A Network Model for Change Detection in High-Resolution Remote Sensing Images
Ling, Jie
Hu, Lei
Cheng, Lang
Chen, Minghui
Yang, Xin
REMOTE SENSING, 2022, 14 (21)

← 1 2 3 4 5 →