MAP-Net: Multiple Attending Path Neural Network for Building Footprint Extraction From Remote Sensed Imagery

被引:194
作者
Zhu, Qing [1 ]
Liao, Cheng [1 ]
Hu, Han [1 ]
Mei, Xiaoming [2 ]
Li, Haifeng [2 ]
机构
[1] Southwest Jiaotong Univ, Fac Geosci & Environm Engn, Chengdu 611756, Peoples R China
[2] Cent South Univ, Sch Geosci & Infophys, Changsha 410083, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2021年 / 59卷 / 07期
基金
中国国家自然科学基金;
关键词
Feature extraction; Buildings; Semantics; Data mining; Spatial resolution; Remote sensing; Convolution; Attention mechanism; building footprint extraction; deep learning; remote sensing imagery; semantic segmentation; SEMANTIC SEGMENTATION; AERIAL IMAGERY; SENSING IMAGERY; DATA FUSION; LIDAR DATA; POINT;
D O I
10.1109/TGRS.2020.3026051
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Building footprint extraction is a basic task in the fields of mapping, image understanding, computer vision, and so on. Accurately and efficiently extracting building footprints from a wide range of remote sensed imagery remains a challenge due to the complex structures, variety of scales, and diverse appearances of buildings. Existing convolutional neural network (CNN)-based building extraction methods are criticized for their inability to detect tiny buildings because the spatial information of CNN feature maps is lost during repeated pooling operations of the CNN. In addition, large buildings still have inaccurate segmentation edges. Moreover, features extracted by a CNN are always partially restricted by the size of the receptive field, and large-scale buildings with low texture are always discontinuous and holey when extracted. To alleviate these problems, multiscale strategies are introduced in the latest research works to extract buildings with different scales. The features with higher resolution generally extracted from shallow layers, which extracted insufficient semantic information for tiny buildings. This article proposes a novel multiple attending path neural network (MAP-Net) for accurately extracting multiscale building footprints and precise boundaries. Unlike existing multiscale feature extraction strategies, MAP-Net learns spatial localization-preserved multiscale features through a multiparallel path in which each stage is gradually generated to extract high-level semantic features with fixed resolution. Then, an attention module adaptively squeezes the channel-wise features extracted from each path for optimized multiscale fusion, and a pyramid spatial pooling module captures global dependence for refining discontinuous building footprints. Experimental results show that our method achieved 0.88%, 0.93%, and 0.45% F1-score and 1.53%, 1.50%, and 0.82% intersection over union (IoU) score improvements without increasing computational complexity compared with the latest HRNetv2 on the Urban 3-D, Deep Globe, and WHU data sets, respectively. Specifically, MAP-Net outperforms multiscale aggregation fully convolutional network (MA-FCN), which is the state-of-the-art (SOTA) algorithms with postprocessing and model voting strategies, on the WHU data set without pretraining and postprocessing. The TensorFlow implementation is available at https://github.com/lehaifeng/MAPNet.
引用
收藏
页码:6169 / 6181
页数:13
相关论文
共 50 条
  • [21] BUILDING FOOTPRINT EXTRACTION FROM SPACE- BORNE IMAGERY USING DEEP NEURAL NETWORKS
    Tejeswari, Banda
    Sharma, Surendra Kumar
    Kumar, Minakshi
    Gupta, Kshama
    [J]. XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 641 - 647
  • [22] A Convolutional Neural Network for Coastal Aquaculture Extraction from High-Resolution Remote Sensing Imagery
    Deng, Jinpu
    Bai, Yongqing
    Chen, Zhengchao
    Shen, Ting
    Li, Cong
    Yang, Xuan
    [J]. SUSTAINABILITY, 2023, 15 (06)
  • [23] ReA-Net: A Multiscale Region Attention Network With Neighborhood Consistency Supervision for Building Extraction From Remote Sensing Image
    Xu, Shengjun
    Deng, Bowen
    Meng, Yuebo
    Liu, Guanghui
    Han, Jiuqiang
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 9033 - 9047
  • [24] DE-Net: Deep Encoding Network for Building Extraction from High-Resolution Remote Sensing Imagery
    Liu, Hao
    Luo, Jiancheng
    Huang, Bo
    Hu, Xiaodong
    Sun, Yingwei
    Yang, Yingpin
    Xu, Nan
    Zhou, Nan
    [J]. REMOTE SENSING, 2019, 11 (20)
  • [25] MSST-Net: A Multi-Scale Adaptive Network for Building Extraction from Remote Sensing Images Based on Swin Transformer
    Yuan, Wei
    Xu, Wenbo
    [J]. REMOTE SENSING, 2021, 13 (23)
  • [26] MDCGA-Net: Multiscale Direction Context-Aware Network With Global Attention for Building Extraction From Remote Sensing Images
    Niu, Penghui
    Gu, Junhua
    Zhang, Yajuan
    Zhang, Ping
    Cai, Taotao
    Xu, Wenjia
    Han, Jungong
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8461 - 8476
  • [27] Building Extraction from Very High Resolution Aerial Imagery Using Joint Attention Deep Neural Network
    Ye, Ziran
    Fu, Yongyong
    Gan, Muye
    Deng, Jinsong
    Comber, Alexis
    Wang, Ke
    [J]. REMOTE SENSING, 2019, 11 (24)
  • [28] LRAD-Net: An Improved Lightweight Network for Building Extraction From Remote Sensing Images
    Liu, Jiabin
    Huang, Huaigang
    Sun, Hanxiao
    Wu, Zhifeng
    Luo, Renbo
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 675 - 687
  • [29] TQR-Net: Tighter Quadrangle-Based Convolutional Neural Network for Dense Building Instance Localization in Remote Sensing Imagery
    Jiang, Kaiyu
    Li, Qingpeng
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 281 - 291
  • [30] EU-Net: An Efficient Fully Convolutional Network for Building Extraction from Optical Remote Sensing Images
    Kang, Wenchao
    Xiang, Yuming
    Wang, Feng
    You, Hongjian
    [J]. REMOTE SENSING, 2019, 11 (23)