HCRB-MSAN: Horizontally Connected Residual Blocks-Based Multiscale Attention Network for Semantic Segmentation of Buildings in HSR Remote Sensing Images

被引:14
作者
Li, Zhen [1 ,2 ]
Zhang, Zhenxin [1 ,2 ]
Chen, Dong [3 ]
Zhang, Liqiang [4 ,5 ]
Zhu, Lin [1 ,2 ]
Wang, Qiang [6 ]
Chen, Siyun [1 ,2 ]
Peng, Xueli [1 ,2 ]
机构
[1] Capital Normal Univ, Key Lab 3D Informat Acquisit & Applicat, MOE, Beijing 100048, Peoples R China
[2] Capital Normal Univ, Coll Resource Environm & Tourism, Beijing 100048, Peoples R China
[3] Nanjing Forestry Univ, Coll Civil Engn, Nanjing 210037, Peoples R China
[4] Beijing Normal Univ, Geog Sci, Beijing 100875, Peoples R China
[5] Beijing Normal Univ, State Key Lab Remote Sensing Sci, Beijing 100875, Peoples R China
[6] Tianjin Normal Univ, Sch Geog & Environm Sci, Tianjin 300387, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Feature extraction; Buildings; Remote sensing; Image segmentation; Semantics; Data mining; Deep learning; Building semantic segmentation; deep learning; horizontally connected residual block; high spatial resolution (HSR) remote sensing image; multiscale attention; RESOLUTION SATELLITE IMAGERY; EXTRACTION; CLASSIFICATION; FUSION; SCENES;
D O I
10.1109/JSTARS.2022.3188515
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accurate and efficient semantic segmentation of buildings in high spatial resolution (HSR) remote sensing images is the basis for applications such as fine urban management, high-precision mapping, land resource utilization investigation, and human settlement suitability evaluation. The current building extraction methods based on deep learning can obtain high-level abstract features of images. However, due to the limitation of convolution kernel size and the vanishing gradient, the extraction of some buildings is inaccurate, and some small-volume buildings are missing as the network deepens. In this regard, we design a horizontally connected residual blocks-based multiscale attention network to achieve high-quality extraction of buildings in HSR remote sensing image. In this network, we subdivide each residual block by channel grouping and feature horizontal connection to consider the difference and saliency of feature information between channels, and then combine the output features with multiscale attention module to consider the contextual semantic relationship of different regions and integrate multilevel local and global information of buildings. A stepwise up-sampling mechanism is designed in the decoding process to finally achieve precise semantic segmentation of buildings. We conduct experiments on two public datasets and compare the proposed method with state-of-the-art semantic segmentation methods. The experiments show that our method could achieve better building extraction results in HSR remote sensing image, which proves the effectiveness of our proposed method.
引用
收藏
页码:5534 / 5544
页数:11
相关论文
共 64 条
[61]   MAP-Net: Multiple Attending Path Neural Network for Building Footprint Extraction From Remote Sensed Imagery [J].
Zhu, Qing ;
Liao, Cheng ;
Hu, Han ;
Mei, Xiaoming ;
Li, Haifeng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (07) :6169-6181
[62]   End-to-end Flow Correlation Tracking with Spatial-temporal Attention [J].
Zhu, Zheng ;
Wu, Wei ;
Zou, Wei ;
Yan, Junjie .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :548-557
[63]   Distractor-Aware Siamese Networks for Visual Object Tracking [J].
Zhu, Zheng ;
Wang, Qiang ;
Li, Bo ;
Wu, Wei ;
Yan, Junjie ;
Hu, Weiming .
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :103-119
[64]  
Zui H. U., 2011, TROP GEOGRAPHY, V31, P211