Scene-Driven Multitask Parallel Attention Network for Building Extraction in High-Resolution Remote Sensing Images

被引：147

作者：

Guo, Haonan ^{[1
]}

Shi, Qian ^{[1
]}

Du, Bo ^{[2
]}

Zhang, Liangpei ^{[3
]}

Wang, Dongzhi ^{[4
]}

Ding, Huaxiang ^{[4
]}

机构：

[1] Sun Yet Sen Univ, Sch Geog & Planning, Guangzhou 510275, Peoples R China

[2] Wuhan Univ, Sch Comp Sci, Wuhan 430079, Peoples R China

[3] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China

[4] Dept Nat Resources Guangdong Prov, Surveying & Mapping Inst Lands & Resource Dept Gu, Guangzhou 510500, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2021年 / 59卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Building footprint extraction; deep learning; remote sensing image; scene driven; LIDAR DATA; VEHICLE DETECTION; AERIAL IMAGES; URBAN AREAS; CLASSIFICATION; FUSION; MODEL; INDEX;

D O I：

10.1109/TGRS.2020.3014312

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

The application of convolutional neural networks has been shown to significantly improve the accuracy of building extraction from very high-resolution (VHR) remote sensing images. However, there exist so-called semantic gaps among different kinds of buildings due to the large intraclass variance of buildings, and most of the present-day methods are ineffective in extracting various buildings in large areas that cover different scenes, for example, urban villages and high-rise buildings, because existing building extraction strategies are the same for various scenes. With the improvement of the resolution of remote sensing images, it is feasible to improve the image interpretation based on the scene prior. However, this idea has not been fully utilized in building extraction from VHR remote sensing imagery. This study proposes a scene-driven multitask parallel attention convolutional network (MTPA-Net) to resolve these limitations. The proposed approach classifies the input image into multilabel scenes and further separately maps the buildings in pixel level under different scenes. In addition, a simple postprocessing method is applied to integrate the building extraction results and scene prior. Our proposed method does not require multimodel training and the network can learn in an end-to-end manner. The performance of our proposed method is evaluated on a data set that includes various urban and rural scenes with diverse landscapes. The experimental results show that the proposed MTPA-Net outperforms state-of-the-art algorithms by reducing misclassification areas and maintaining improved robustness.

引用

页码：4287 / 4306

页数：20

共 63 条

[1] Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks [J].

Alshehhi, Rasha ;

Marpu, Prashanth Reddy ;

Woon, Wei Lee ;

Dalla Mura, Mauro .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 :139-149

[2]

[Anonymous], 2018, REMOTE SENS BASEL, DOI DOI 10.3390/RS10030407

[3]

[Anonymous], 2016, J APPL REMOTE SENSIN

[4]

Awrangjeb M, 2011, INT ARCH PHOTOGRAMM, V38-3, P143

[5] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[6]

Berman M., 2017, ARXIV170508790

[7]

Chen L.C., 2014, Semantic image segmentation with deep convolutional nets and fully connected CRFs, DOI DOI 10.48550/ARXIV.1412.7062

[8] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[9] Automatic Rooftop Extraction in Nadir Aerial Imagery of Suburban Regions Using Corners and Variational Level Set Evolution [J].

Cote, Melissa ;

Saeedi, Parvaneh .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2013, 51 (01) :313-328

[10] Automatic building extraction from LiDAR data fusion of point and grid-based features [J].

Du, Shouji ;

Zhang, Yunsheng ;

Zou, Zhengrong ;

Xu, Shenghua ;

He, Xue ;

Chen, Siyang .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 :294-307

← 1 2 3 4 5 6 7 →