Building Footprint Extraction from High-Resolution Images via Spatial Residual Inception Convolutional Neural Network

被引：183

作者：

Liu, Penghua ^{[1
,2
]}

Liu, Xiaoping ^{[1
,2
]}

Liu, Mengxi ^{[1
,2
]}

Shi, Qian ^{[1
,2
]}

Yang, Jinxing ^{[3
]}

Xu, Xiaocong ^{[1
,2
]}

Zhang, Yuanying ^{[1
,2
]}

机构：

[1] Sun Yat Sen Univ, Sch Geog & Planning, West Xingang Rd, Guangzhou 510275, Guangdong, Peoples R China

[2] Sun Yat Sen Univ, Guangdong Key Lab Urbanizat & Geosimulat, West Xingang Rd, Guangzhou 510275, Guangdong, Peoples R China

[3] Guangzhou Univ, Sch Geog Sci, West Waihuan St Rd, Guangzhou 510006, Guangdong, Peoples R China

来源：

REMOTE SENSING | 2019年 / 11卷 / 07期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

semantic segmentation; high-resolution image; building footprints extraction; fully convolutional network; multi-scale contexts; REMOTE-SENSING IMAGES; CLASSIFICATION; SEGMENTATION;

D O I：

10.3390/rs11070830

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The rapid development in deep learning and computer vision has introduced new opportunities and paradigms for building extraction from remote sensing images. In this paper, we propose a novel fully convolutional network (FCN), in which a spatial residual inception (SRI) module is proposed to capture and aggregate multi-scale contexts for semantic understanding by successively fusing multi-level features. The proposed SRI-Net is capable of accurately detecting large buildings that might be easily omitted while retaining global morphological characteristics and local details. On the other hand, to improve computational efficiency, depthwise separable convolutions and convolution factorization are introduced to significantly decrease the number of model parameters. The proposed model is evaluated on the Inria Aerial Image Labeling Dataset and the Wuhan University (WHU) Aerial Building Dataset. The experimental results show that the proposed methods exhibit significant improvements compared with several state-of-the-art FCNs, including SegNet, U-Net, RefineNet, and DeepLab v3+. The proposed model shows promising potential for building detection from remote sensing images on a large scale.

引用

页数：19

共 61 条

[1]

Abdulkadir O. A., 2016, MEDICAL IMAGE COMPUT, V9901, P424, DOI [10.1007/978-3-319-46723-8_49, DOI 10.1007/978-3-319-46723-8_49]

[2] Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks [J].

Alshehhi, Rasha ;

Marpu, Prashanth Reddy ;

Woon, Wei Lee ;

Dalla Mura, Mauro .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 :139-149

[3]

[Anonymous], 2016, 160806993 ARXIV

[4]

[Anonymous], 160305027 ARXIV

[5]

[Anonymous], 170905932 ARXIV

[6]

[Anonymous], 2018, 180202611 ARXIV

[7] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[8] Comparing supervised and unsupervised multiresolution segmentation approaches for extracting buildings from very high resolution imagery [J].

Belgiu, Mariana ;

Dragut, Lucian .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 96 :67-75

[9]

Chen L.C., 2014, SEMANTIC IMAGE SEGME

[10] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

← 1 2 3 4 5 6 7 →