MU-Net: Embedding MixFormer into Unet to Extract Water Bodies from Remote Sensing Images

被引:13
作者
Zhang, Yonghong [1 ]
Lu, Huanyu [1 ]
Ma, Guangyi [2 ]
Zhao, Huajun [1 ]
Xie, Donglin [1 ]
Geng, Sutong [1 ]
Tian, Wei [3 ]
Sian, Kenny Thiam Choy Lim Kam [4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Automat, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[4] Wuxi Univ, Sch Atmospher Sci & Remote Sensing, Wuxi 214105, Peoples R China
基金
中国国家自然科学基金;
关键词
attention mechanism; convolutional neural network; MixFormer; remote sensing; semantic segmentation; Transformer; INDEX NDWI; SEGMENTATION; FEATURES;
D O I
10.3390/rs15143559
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Water bodies extraction is important in water resource utilization and flood prevention and mitigation. Remote sensing images contain rich information, but due to the complex spatial background features and noise interference, problems such as inaccurate tributary extraction and inaccurate segmentation occur when extracting water bodies. Recently, using a convolutional neural network (CNN) to extract water bodies is gradually becoming popular. However, the local property of CNN limits the extraction of global information, while Transformer, using a self-attention mechanism, has great potential in modeling global information. This paper proposes the MU-Net, a hybrid MixFormer architecture, as a novel method for automatically extracting water bodies. First, the MixFormer block is embedded into Unet. The combination of CNN and MixFormer is used to model the local spatial detail information and global contextual information of the image to improve the ability of the network to capture semantic features of the water body. Then, the features generated by the encoder are refined by the attention mechanism module to suppress the interference of image background noise and non-water body features, which further improves the accuracy of water body extraction. The experiments show that our method has higher segmentation accuracy and robust performance compared with the mainstream CNN- and Transformer-based semantic segmentation networks. The proposed MU-Net achieves 90.25% and 76.52% IoU on the GID and LoveDA datasets, respectively. The experimental results also validate the potential of MixFormer in water extraction studies.
引用
收藏
页数:23
相关论文
共 57 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Extracting water-related features using reflectance data and principal component analysis of Landsat images
    Balazs, Boglarka
    Biro, Tibor
    Dyke, Gareth
    Singh, Sudhir Kumar
    Szabo, Szilard
    [J]. HYDROLOGICAL SCIENCES JOURNAL-JOURNAL DES SCIENCES HYDROLOGIQUES, 2018, 63 (02): : 269 - 284
  • [3] Cao H., 2021, arXiv
  • [4] Chen J, 2021, arXiv
  • [5] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [6] Chen Q., 2022, P 2022 IEEECVF C COM
  • [7] Water-Body Segmentation for Multi-Spectral Remote Sensing Images by Feature Pyramid Enhancement and Pixel Pair Matching
    Chen, Suting
    Liu, Yao
    Zhang, Chuang
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (13) : 5029 - 5047
  • [8] Multiscale Feature Learning by Transformer for Building Extraction From Satellite Images
    Chen, Xin
    Qiu, Chunping
    Guo, Wenyue
    Yu, Anzhu
    Tong, Xiaochong
    Schmitt, Michael
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [9] MSResNet: Multiscale Residual Network via Self-Supervised Learning for Water-Body Detection in Remote Sensing Imagery
    Dang, Bo
    Li, Yansheng
    [J]. REMOTE SENSING, 2021, 13 (16)
  • [10] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929