Scene-Driven Multitask Parallel Attention Network for Building Extraction in High-Resolution Remote Sensing Images

被引:147
作者
Guo, Haonan [1 ]
Shi, Qian [1 ]
Du, Bo [2 ]
Zhang, Liangpei [3 ]
Wang, Dongzhi [4 ]
Ding, Huaxiang [4 ]
机构
[1] Sun Yet Sen Univ, Sch Geog & Planning, Guangzhou 510275, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, Wuhan 430079, Peoples R China
[3] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[4] Dept Nat Resources Guangdong Prov, Surveying & Mapping Inst Lands & Resource Dept Gu, Guangzhou 510500, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2021年 / 59卷 / 05期
基金
中国国家自然科学基金;
关键词
Building footprint extraction; deep learning; remote sensing image; scene driven; LIDAR DATA; VEHICLE DETECTION; AERIAL IMAGES; URBAN AREAS; CLASSIFICATION; FUSION; MODEL; INDEX;
D O I
10.1109/TGRS.2020.3014312
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The application of convolutional neural networks has been shown to significantly improve the accuracy of building extraction from very high-resolution (VHR) remote sensing images. However, there exist so-called semantic gaps among different kinds of buildings due to the large intraclass variance of buildings, and most of the present-day methods are ineffective in extracting various buildings in large areas that cover different scenes, for example, urban villages and high-rise buildings, because existing building extraction strategies are the same for various scenes. With the improvement of the resolution of remote sensing images, it is feasible to improve the image interpretation based on the scene prior. However, this idea has not been fully utilized in building extraction from VHR remote sensing imagery. This study proposes a scene-driven multitask parallel attention convolutional network (MTPA-Net) to resolve these limitations. The proposed approach classifies the input image into multilabel scenes and further separately maps the buildings in pixel level under different scenes. In addition, a simple postprocessing method is applied to integrate the building extraction results and scene prior. Our proposed method does not require multimodel training and the network can learn in an end-to-end manner. The performance of our proposed method is evaluated on a data set that includes various urban and rural scenes with diverse landscapes. The experimental results show that the proposed MTPA-Net outperforms state-of-the-art algorithms by reducing misclassification areas and maintaining improved robustness.
引用
收藏
页码:4287 / 4306
页数:20
相关论文
共 63 条
[51]   GeoSay: A geometric saliency for extracting buildings in remote sensing images [J].
Xia, Gui-Song ;
Huang, Jin ;
Xue, Nan ;
Lu, Qikai ;
Zhu, Xiaoxiang .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 186 :37-47
[52]   Automatic Building Rooftop Extraction From Aerial Images via Hierarchical RGB-D Priors [J].
Xu, Shibiao ;
Pan, Xingjia ;
Li, Er ;
Wu, Baoyuan ;
Bu, Shuhui ;
Dong, Weiming ;
Xiang, Shiming ;
Zhang, Xiaopeng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (12) :7369-7387
[53]   Building Extraction in Very High Resolution Remote Sensing Imagery Using Deep Learning and Guided Filters [J].
Xu, Yongyang ;
Wu, Liang ;
Xie, Zhong ;
Chen, Zhanlong .
REMOTE SENSING, 2018, 10 (01)
[54]   A Novel Building and Tree Detection Method From LiDAR Data and Aerial Images [J].
Zarea, Asghar ;
Mohammadzadeh, Ali .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (05) :1864-1875
[55]   Use of normalized difference built-up index in automatically mapping urban areas from TM imagery [J].
Zha, Y ;
Gao, J ;
Ni, S .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2003, 24 (03) :583-594
[56]   CAD-Net: A Context-Aware Detection Network for Objects in Remote Sensing Imagery [J].
Zhang, Gongjie ;
Lu, Shijian ;
Zhang, Wei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (12) :10015-10024
[57]   Automatic construction of building footprint's from airborne LIDAR data [J].
Zhang, Keqi ;
Yan, Jianhua ;
Chen, Shu-Ching .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (09) :2523-2533
[58]  
Zhang MR, 2019, ARXIV190708610
[59]   Urban Building Density Estimation From High-Resolution Imagery Using Multiple Features and Support Vector Regression [J].
Zhang, Tao ;
Huang, Xin ;
Wen, Dawei ;
Li, Jiayi .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (07) :3265-3280
[60]   The Fisher Kernel Coding Framework for High Spatial Resolution Scene Classification [J].
Zhao, Bei ;
Zhong, Yanfei ;
Zhang, Liangpei ;
Huang, Bo .
REMOTE SENSING, 2016, 8 (02)