EFFICIENT SEMANTIC SEGMENTATION METHOD WITH STRIP POOLING FOR VHR REMOTE SENSING IMAGES

被引:1
作者
Sheng, Yifan [1 ]
Yang, Junli [1 ]
Lin, Youguang [1 ]
Lei, Yu [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
来源
2021 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM IGARSS | 2021年
关键词
Semantic segmentation; high resolution remote sensing images; strip pooling;
D O I
10.1109/IGARSS47720.2021.9553336
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
In this paper, we address the problem of multi-class semantic segmentation of high resolution remote sensing images with a deep convolutional neutral network based model named Strip Pooling Network(SPNet). The objects in high resolution remote sensing images vary greatly in size. Moreover, these objects have different extensibility and directions, such as long narrow roads and wide grasslands. These bring big challenge for traditional square pooling kernels. If we want to capture the long-rang dependencies and enough contextual information, we need to use large pooling kernel which definitely increases computation greatly and incorporates interference information from irrelevant regions. SPNet introduces a new pooling strategy, called strip pooling which uses a long but narrow kernel, i.e., 1 x N or N x 1. It can solve the above problem while preventing information from irrelevant regions. Meanwhile, two paths in the strip pooling module focus on the horizontal and vertical spatial dimensions respectively, which compensate for the lack of one path captured in the narrow dimension of each other. Experimental results on public available Potsdam dataset demonstrate that SPNet obtains an overall accuracy of 89.1%, which outperforms other state-of-the-art methods.
引用
收藏
页码:2759 / 2762
页数:4
相关论文
共 9 条
[1]   Attention to Scale: Scale-aware Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Yang, Yi ;
Wang, Jiang ;
Xu, Wei ;
Yuille, Alan L. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3640-3649
[2]   Strip Pooling: Rethinking Spatial Pooling for Scene Parsing [J].
Hou, Qibin ;
Zhang, Li ;
Cheng, Ming-Ming ;
Feng, Jiashi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4002-4011
[3]  
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[4]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241
[5]   Results of the ISPRS benchmark on urban object detection and 3D building reconstruction [J].
Rottensteiner, Franz ;
Sohn, Gunho ;
Gerke, Markus ;
Wegner, Jan Dirk ;
Breitkopf, Uwe ;
Jung, Jaewook .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 93 :256-271
[6]  
Wu H, 2019, ARXIV
[7]  
Yu F., 2016, ARXIV, V4, DOI 10.48550/arXiv.1511.07122
[8]   Pyramid Scene Parsing Network [J].
Zhao, Hengshuang ;
Shi, Jianping ;
Qi, Xiaojuan ;
Wang, Xiaogang ;
Jia, Jiaya .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6230-6239
[9]   D-LinkNet: LinkNet with Pretrained Encoder and Dilated Convolution for High Resolution Satellite Imagery Road Extraction [J].
Zhou, Lichen ;
Zhang, Chuang ;
Wu, Ming .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :192-196