MU-Net: Embedding MixFormer into Unet to Extract Water Bodies from Remote Sensing Images

被引：13

作者：

Zhang, Yonghong ^{[1
]}

Lu, Huanyu ^{[1
]}

Ma, Guangyi ^{[2
]}

Zhao, Huajun ^{[1
]}

Xie, Donglin ^{[1
]}

Geng, Sutong ^{[1
]}

Tian, Wei ^{[3
]}

Sian, Kenny Thiam Choy Lim Kam ^{[4
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Sch Automat, Nanjing 210044, Peoples R China

[2] Nanjing Univ Informat Sci & Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China

[3] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China

[4] Wuxi Univ, Sch Atmospher Sci & Remote Sensing, Wuxi 214105, Peoples R China

来源：

REMOTE SENSING | 2023年 / 15卷 / 14期

基金：

中国国家自然科学基金;

关键词：

attention mechanism; convolutional neural network; MixFormer; remote sensing; semantic segmentation; Transformer; INDEX NDWI; SEGMENTATION; FEATURES;

D O I：

10.3390/rs15143559

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Water bodies extraction is important in water resource utilization and flood prevention and mitigation. Remote sensing images contain rich information, but due to the complex spatial background features and noise interference, problems such as inaccurate tributary extraction and inaccurate segmentation occur when extracting water bodies. Recently, using a convolutional neural network (CNN) to extract water bodies is gradually becoming popular. However, the local property of CNN limits the extraction of global information, while Transformer, using a self-attention mechanism, has great potential in modeling global information. This paper proposes the MU-Net, a hybrid MixFormer architecture, as a novel method for automatically extracting water bodies. First, the MixFormer block is embedded into Unet. The combination of CNN and MixFormer is used to model the local spatial detail information and global contextual information of the image to improve the ability of the network to capture semantic features of the water body. Then, the features generated by the encoder are refined by the attention mechanism module to suppress the interference of image background noise and non-water body features, which further improves the accuracy of water body extraction. The experiments show that our method has higher segmentation accuracy and robust performance compared with the mainstream CNN- and Transformer-based semantic segmentation networks. The proposed MU-Net achieves 90.25% and 76.52% IoU on the GID and LoveDA datasets, respectively. The experimental results also validate the potential of MixFormer in water extraction studies.

引用

页数：23

共 57 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[2] Extracting water-related features using reflectance data and principal component analysis of Landsat images
Balazs, Boglarka
Biro, Tibor
Dyke, Gareth
Singh, Sudhir Kumar
Szabo, Szilard
[J]. HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2018, 63 (02): : 269 - 284
[3] Cao H., 2021, arXiv
[4] Chen J, 2021, arXiv
[5] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[6] Chen Q., 2022, P 2022 IEEECVF C COM
[7] Water-Body Segmentation for Multi-Spectral Remote Sensing Images by Feature Pyramid Enhancement and Pixel Pair Matching
Chen, Suting
Liu, Yao
Zhang, Chuang
[J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (13) : 5029 - 5047
[8] Multiscale Feature Learning by Transformer for Building Extraction From Satellite Images
Chen, Xin
Qiu, Chunping
Guo, Wenyue
Yu, Anzhu
Tong, Xiaochong
Schmitt, Michael
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[9] MSResNet: Multiscale Residual Network via Self-Supervised Learning for Water-Body Detection in Remote Sensing Imagery
Dang, Bo
Li, Yansheng
[J]. REMOTE SENSING, 2021, 13 (16)
[10] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

← 1 2 3 4 5 6 →