CT-UNet: Context-Transfer-UNet for Building Segmentation in Remote Sensing Images

被引:20
作者
Liu, Sheng [1 ]
Ye, Huanran [1 ]
Jin, Kun [1 ]
Cheng, Haohao [1 ]
机构
[1] Zhejiang Univ Technol, Inst Comp Sci & Technol, Hangzhou 310023, Zhejiang, Peoples R China
基金
国家重点研发计划;
关键词
Remote sensing images; Building segmentation; U-Net; Context information; Attention models;
D O I
10.1007/s11063-021-10592-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the proliferation of remote sensing images, how to segment buildings more accurately in remote sensing images is a critical challenge. First, most networks have poor recognition ability on high resolution images, resulting in blurred boundaries in the segmented building maps. Second, the similarity between buildings and background results in intra-class inconsistency. To address these two problems, we propose an UNet-based network named Context-Transfer-UNet (CT-UNet). Specifically, we design Dense Boundary Block. Dense Block utilizes reuse mechanism to refine features and increase recognition capabilities. Boundary Block introduces the low-level spatial information to solve the fuzzy boundary problem. Then, to handle intra-class inconsistency, we construct Spatial Channel Attention Block. It combines context space information and selects more distinguishable features from space and channel. Finally, we propose an improved loss function to enhance the purpose of loss by adding evaluation indicator. Based on our proposed CT-UNet, we achieve 85.33% mean IoU on the Inria dataset, 91.00% mean IoU on the WHU dataset and 83.92% F1-score on the Massachusetts dataset. The results outperform our baseline (U-Net ResNet-34) by 3.76%, exceed Web-Net by 2.24% and surpass HFSA-Unet by 2.17%.
引用
收藏
页码:4257 / 4277
页数:21
相关论文
共 40 条
  • [1] Adiba A, 2019, P NEW CHALL DAT SCI, pP14
  • [2] [Anonymous], 2018, ARXIV180304953
  • [3] Remote Sensing Image Retrieval With Global Morphological Texture Descriptors
    Aptoula, Erchan
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (05): : 3023 - 3034
  • [4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [5] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
  • [6] Bischke B, 2019, IEEE IMAGE PROC, P1480, DOI [10.1109/ICIP.2019.8803050, 10.1109/icip.2019.8803050]
  • [7] Fourure D, 2017, RESIDUAL CONV DECONV
  • [8] Glorot X, 2011, JMLR WORKSHOP C P, P315
  • [9] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [10] Hybrid first and second order attention Unet for building segmentation in remote sensing images
    He, Nanjun
    Fang, Leyuan
    Plaza, Antonio
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (04)