Scene-Driven Multitask Parallel Attention Network for Building Extraction in High-Resolution Remote Sensing Images

被引：147

作者：

Guo, Haonan ^{[1
]}

Shi, Qian ^{[1
]}

Du, Bo ^{[2
]}

Zhang, Liangpei ^{[3
]}

Wang, Dongzhi ^{[4
]}

Ding, Huaxiang ^{[4
]}

机构：

[1] Sun Yet Sen Univ, Sch Geog & Planning, Guangzhou 510275, Peoples R China

[2] Wuhan Univ, Sch Comp Sci, Wuhan 430079, Peoples R China

[3] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China

[4] Dept Nat Resources Guangdong Prov, Surveying & Mapping Inst Lands & Resource Dept Gu, Guangzhou 510500, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2021年 / 59卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Building footprint extraction; deep learning; remote sensing image; scene driven; LIDAR DATA; VEHICLE DETECTION; AERIAL IMAGES; URBAN AREAS; CLASSIFICATION; FUSION; MODEL; INDEX;

D O I：

10.1109/TGRS.2020.3014312

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

The application of convolutional neural networks has been shown to significantly improve the accuracy of building extraction from very high-resolution (VHR) remote sensing images. However, there exist so-called semantic gaps among different kinds of buildings due to the large intraclass variance of buildings, and most of the present-day methods are ineffective in extracting various buildings in large areas that cover different scenes, for example, urban villages and high-rise buildings, because existing building extraction strategies are the same for various scenes. With the improvement of the resolution of remote sensing images, it is feasible to improve the image interpretation based on the scene prior. However, this idea has not been fully utilized in building extraction from VHR remote sensing imagery. This study proposes a scene-driven multitask parallel attention convolutional network (MTPA-Net) to resolve these limitations. The proposed approach classifies the input image into multilabel scenes and further separately maps the buildings in pixel level under different scenes. In addition, a simple postprocessing method is applied to integrate the building extraction results and scene prior. Our proposed method does not require multimodel training and the network can learn in an end-to-end manner. The performance of our proposed method is evaluated on a data set that includes various urban and rural scenes with diverse landscapes. The experimental results show that the proposed MTPA-Net outperforms state-of-the-art algorithms by reducing misclassification areas and maintaining improved robustness.

引用

页码：4287 / 4306

页数：20

共 63 条

[31] Deep learning in remote sensing applications: A meta-analysis and review [J].

Ma, Lei ;

Liu, Yu ;

Zhang, Xueliang ;

Ye, Yuanxin ;

Yin, Gaofei ;

Johnson, Brian Alan .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 152 :166-177

[32]

Maggiori E, 2017, INT GEOSCI REMOTE SE, P3226, DOI 10.1109/IGARSS.2017.8127684

[33] Detect Residential Buildings from Lidar and Aerial Photographs through Object-Oriented Land-Use Classification [J].

Meng, Xuelian ;

Currit, Nate ;

Wang, Le ;

Yang, Xiaojun .

PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2012, 78 (01) :35-44

[34] Contextual classification of lidar data and building object detection in urban areas [J].

Niemeyer, Joachim ;

Rottensteiner, Franz ;

Soergel, Uwe .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 87 :152-165

[35]

Paszke A., 2019, arXiv, P8026, DOI DOI 10.48550/ARXIV.1912.01703

[36] Urban planning and building smart cities based on the Internet of Things using Big Data analytics [J].

Rathore, M. Mazhar ;

Ahmad, Awais ;

Paul, Anand ;

Rho, Seungmin .

COMPUTER NETWORKS, 2016, 101 :63-80

[37]

[任自珍 REN Zizhen], 2009, [西南交通大学学报, Journal of Southwest Jiaotong University], V44, P83

[38] U-Net: Convolutional Networks for Biomedical Image Segmentation [J].

Ronneberger, Olaf ;

Fischer, Philipp ;

Brox, Thomas .

MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241

[39] Aulomatic generation of high-quality building models from Lidar data [J].

Rottensteiner, F .

IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2003, 23 (06) :42-50

[40] BRRNet: A Fully Convolutional Neural Network for Automatic Building Extraction From High-Resolution Remote Sensing Images [J].

Shao, Zhenfeng ;

Tang, Penghao ;

Wang, Zhongyuan ;

Saleem, Nayyer ;

Yam, Sarath ;

Sommai, Chatpong .

REMOTE SENSING, 2020, 12 (06)

← 1 2 3 4 5 6 7 →