UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images

被引：1

作者：

Chang, Zhanyuan ^{[1
]}

Xu, Mingyu ^{[1
]}

Wei, Yuwen ^{[1
]}

Lian, Jie ^{[1
]}

Zhang, Chongming ^{[1
]}

Li, Chuanjiang ^{[1
]}

机构：

[1] Shanghai Normal Univ, Coll Informat Mech & Elect Engn, Shanghai 200234, Peoples R China

来源：

SENSORS | 2024年 / 24卷 / 20期

基金：

上海市自然科学基金;

关键词：

high-resolution remote sensing images; real-time semantic segmentation; convolutional attention; global-local context; transformer;

D O I：

10.3390/s24206655

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

The application of deep neural networks for the semantic segmentation of remote sensing images is a significant research area within the field of the intelligent interpretation of remote sensing data. The semantic segmentation of remote sensing images holds great practical value in urban planning, disaster assessment, the estimation of carbon sinks, and other related fields. With the continuous advancement of remote sensing technology, the spatial resolution of remote sensing images is gradually increasing. This increase in resolution brings about challenges such as significant changes in the scale of ground objects, redundant information, and irregular shapes within remote sensing images. Current methods leverage Transformers to capture global long-range dependencies. However, the use of Transformers introduces higher computational complexity and is prone to losing local details. In this paper, we propose UNeXt (UNet+ConvNeXt+Transformer), a real-time semantic segmentation model tailored for high-resolution remote sensing images. To achieve efficient segmentation, UNeXt uses the lightweight ConvNeXt-T as the encoder and a lightweight decoder, Transnext, which combines a Transformer and CNN (Convolutional Neural Networks) to capture global information while avoiding the loss of local details. Furthermore, in order to more effectively utilize spatial and channel information, we propose a SCFB (SC Feature Fuse Block) to reduce computational complexity while enhancing the model's recognition of complex scenes. A series of ablation experiments and comprehensive comparative experiments demonstrate that our method not only runs faster than state-of-the-art (SOTA) lightweight models but also achieves higher accuracy. Specifically, our proposed UNeXt achieves 85.2% and 82.9% mIoUs on the Vaihingen and Gaofen5 (GID5) datasets, respectively, while maintaining 97 fps for 512 x 512 inputs on a single NVIDIA GTX 4090 GPU, outperforming other SOTA methods.

引用

页数：18

共 50 条

[31] CHANGE DETECTION OF BUILDINGS WITH THE UTILIZATION OF A DEEP BELIEF NETWORK AND HIGH-RESOLUTION REMOTE SENSING IMAGES
Huang, Fenghua
Shen, Guiping
Hong, Huiqun
Wei, Liying
FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2022, 30 (10)
[32] Disparity Estimation of High-Resolution Remote Sensing Images with Dual-Scale Matching Network
He, Sheng
Zhou, Ruqin
Li, Shenhong
Jiang, San
Jiang, Wanshou
REMOTE SENSING, 2021, 13 (24)
[33] Multi-scale Feature Fusion and Transformer Network for urban green space segmentation from high-resolution remote sensing images
Cheng, Yong
Wang, Wei
Ren, Zhoupeng
Zhao, Yingfen
Liao, Yilan
Ge, Yong
Wang, Jun
He, Jiaxin
Gu, Yakang
Wang, Yixuan
Zhang, Wenjie
Zhang, Ce
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 124
[34] MCAFNet: A Multiscale Channel Attention Fusion Network for Semantic Segmentation of Remote Sensing Images
Yuan, Min
Ren, Dingbang
Feng, Qisheng
Wang, Zhaobin
Dong, Yongkang
Lu, Fuxiang
Wu, Xiaolin
REMOTE SENSING, 2023, 15 (02)
[35] Multi-scale attention fusion network for semantic segmentation of remote sensing images
Wen, Zhiqiang
Huang, Hongxu
Liu, Shuai
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (24) : 7909 - 7926
[36] A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images
Wang, Libo
Li, Rui
Duan, Chenxi
Zhang, Ce
Meng, Xiaoliang
Fang, Shenghui
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[37] Improved Anchor-Free Instance Segmentation for Building Extraction from High-Resolution Remote Sensing Images
Wu, Tong
Hu, Yuan
Peng, Ling
Chen, Ruonan
REMOTE SENSING, 2020, 12 (18)
[38] A CNN-Transformer Network Combining CBAM for Change Detection in High-Resolution Remote Sensing Images
Yin, Mengmeng
Chen, Zhibo
Zhang, Chengjian
REMOTE SENSING, 2023, 15 (09)
[39] OCANet: An Overcomplete Convolutional Attention Network for Building Extraction From High-Resolution Remote Sensing Images
Zhang, Bo
Huang, Jiajia
Wu, Fan
Zhang, Wenjuan
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 18427 - 18443
[40] Calculation of the optimal segmentation scale in object-based multiresolution segmentation based on the scene complexity of high-resolution remote sensing images
Feng, Tianjing
Ma, Hairong
Cheng, Xinwen
Zhang, Hongping
JOURNAL OF APPLIED REMOTE SENSING, 2018, 12 (02):

← 1 2 3 4 5 →