CT-UNet: Context-Transfer-UNet for Building Segmentation in Remote Sensing Images

被引：20

作者：

Liu, Sheng ^{[1
]}

Ye, Huanran ^{[1
]}

Jin, Kun ^{[1
]}

Cheng, Haohao ^{[1
]}

机构：

[1] Zhejiang Univ Technol, Inst Comp Sci & Technol, Hangzhou 310023, Zhejiang, Peoples R China

来源：

NEURAL PROCESSING LETTERS | 2021年 / 53卷 / 06期

基金：

国家重点研发计划;

关键词：

Remote sensing images; Building segmentation; U-Net; Context information; Attention models;

D O I：

10.1007/s11063-021-10592-w

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the proliferation of remote sensing images, how to segment buildings more accurately in remote sensing images is a critical challenge. First, most networks have poor recognition ability on high resolution images, resulting in blurred boundaries in the segmented building maps. Second, the similarity between buildings and background results in intra-class inconsistency. To address these two problems, we propose an UNet-based network named Context-Transfer-UNet (CT-UNet). Specifically, we design Dense Boundary Block. Dense Block utilizes reuse mechanism to refine features and increase recognition capabilities. Boundary Block introduces the low-level spatial information to solve the fuzzy boundary problem. Then, to handle intra-class inconsistency, we construct Spatial Channel Attention Block. It combines context space information and selects more distinguishable features from space and channel. Finally, we propose an improved loss function to enhance the purpose of loss by adding evaluation indicator. Based on our proposed CT-UNet, we achieve 85.33% mean IoU on the Inria dataset, 91.00% mean IoU on the WHU dataset and 83.92% F1-score on the Massachusetts dataset. The results outperform our baseline (U-Net ResNet-34) by 3.76%, exceed Web-Net by 2.24% and surpass HFSA-Unet by 2.17%.

引用

页码：4257 / 4277

页数：21

共 40 条

[1] Adiba A, 2019, P NEW CHALL DAT SCI, pP14
[2] [Anonymous], 2018, ARXIV180304953
[3] Remote Sensing Image Retrieval With Global Morphological Texture Descriptors
Aptoula, Erchan
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (05): : 3023 - 3034
[4] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[5] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
[6] Bischke B, 2019, IEEE IMAGE PROC, P1480, DOI [10.1109/ICIP.2019.8803050, 10.1109/icip.2019.8803050]
[7] Fourure D, 2017, RESIDUAL CONV DECONV
[8] Glorot X, 2011, JMLR WORKSHOP C P, P315
[9] Deep Residual Learning for Image Recognition
He, Kaiming
Zhang, Xiangyu
Ren, Shaoqing
Sun, Jian
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
[10] Hybrid first and second order attention Unet for building segmentation in remote sensing images
He, Nanjun
Fang, Leyuan
Plaza, Antonio
[J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (04)

← 1 2 3 4 →