Building Extraction from Very High Resolution Aerial Imagery Using Joint Attention Deep Neural Network

被引:63
作者
Ye, Ziran [1 ]
Fu, Yongyong [1 ]
Gan, Muye [1 ]
Deng, Jinsong [1 ]
Comber, Alexis [2 ,3 ]
Wang, Ke [1 ]
机构
[1] Zhejiang Univ, Coll Environm & Resource Sci, Inst Appl Remote Sensing & Informat Technol, Hangzhou 310058, Peoples R China
[2] Univ Leeds, Sch Geog, Leeds LS2 9JT, W Yorkshire, England
[3] Univ Leeds, Leeds Inst Data Analyt, Leeds LS2 9JT, W Yorkshire, England
基金
中国国家自然科学基金;
关键词
building extraction; fully convolutional neural network (FCN); attention mechanism; high resolution aerial images; CONVOLUTIONAL NETWORKS; SEMANTIC SEGMENTATION; CLASSIFICATION;
D O I
10.3390/rs11242970
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Automated methods to extract buildings from very high resolution (VHR) remote sensing data have many applications in a wide range of fields. Many convolutional neural network (CNN) based methods have been proposed and have achieved significant advances in the building extraction task. In order to refine predictions, a lot of recent approaches fuse features from earlier layers of CNNs to introduce abundant spatial information, which is known as skip connection. However, this strategy of reusing earlier features directly without processing could reduce the performance of the network. To address this problem, we propose a novel fully convolutional network (FCN) that adopts attention based re-weighting to extract buildings from aerial imagery. Specifically, we consider the semantic gap between features from different stages and leverage the attention mechanism to bridge the gap prior to the fusion of features. The inferred attention weights along spatial and channel-wise dimensions make the low level feature maps adaptive to high level feature maps in a target-oriented manner. Experimental results on three publicly available aerial imagery datasets show that the proposed model (RFA-UNet) achieves comparable and improved performance compared to other state-of-the-art models for building extraction.
引用
收藏
页数:21
相关论文
共 67 条
  • [1] Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks
    Alshehhi, Rasha
    Marpu, Prashanth Reddy
    Woon, Wei Lee
    Dalla Mura, Mauro
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 : 139 - 149
  • [2] [Anonymous], 2018, REMOTE SENS-BASEL, DOI DOI 10.3390/RS10050779
  • [3] [Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.492
  • [4] [Anonymous], 2015, PROC CVPR IEEE
  • [5] [Anonymous], P IEEE C COMP VIS PA
  • [6] [Anonymous], P IEEE C COMP VIS PA
  • [7] [Anonymous], 2013, Ph.D. dissertation
  • [8] [Anonymous], 2017, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2016.2644615
  • [9] [Anonymous], P 6 INT C LEARN REPR
  • [10] Baby M, 2017, 2017 INTERNATIONAL CONFERENCE OF ELECTRONICS, COMMUNICATION AND AEROSPACE TECHNOLOGY (ICECA), VOL 1, P107, DOI 10.1109/ICECA.2017.8203654