Semantic Segmentation for High Spatial Resolution Remote Sensing Images Based on Convolution Neural Network and Pyramid Pooling Module

被引:156
作者
Yu, Bo [1 ]
Yang, Lu [2 ]
Chen, Fang [1 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[2] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[3] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Hainan Key Lab Earth Observat, Sanya 572029, Peoples R China
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Remotely sensed images; semantic segmentation; CLASSIFICATION; EXTRACTION;
D O I
10.1109/JSTARS.2018.2860989
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation provides a practical way to segment remotely sensed images into multiple ground objects simultaneously, which can be potentially applied to multiple remote sensed related aspects. Current classification algorithms in remotely sensed images are mostly limited by different imaging conditions, the multiple ground objects are difficult to be separated from each other due to high intraclass spectral variances and interclass spectral similarities. In this study, we propose an end-to-end framework to semantically segment high-resolution aerial images without postprocessing to refine the segmentation results. The framework provides a pixel-wise segmentation result, comprising convolutional neural network structure and pyramid pooling module, which aims to extract feature maps at multiple scales. The proposed model is applied to the ISPRS Vaihingen benchmark dataset from the ISPRS 2D Semantic Labeling Challenge. Its segmentation results are compared with previous state-of-the-art method UZ _1, UPB and three other methods that segment images into objects of all the classes (including clutter/background) based on true orthophoto tiles, and achieve the highest overall accuracy of 87.8% over the published performances, to the best of our knowledge. The results validate the efficiency of the proposed model in segmenting multiple ground objects from remotely sensed images simultaneously.
引用
收藏
页码:3252 / 3261
页数:10
相关论文
共 61 条
[1]   Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks [J].
Alshehhi, Rasha ;
Marpu, Prashanth Reddy ;
Woon, Wei Lee ;
Dalla Mura, Mauro .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 :139-149
[2]  
[Anonymous], ISPRS SEMANTIC LABEL
[3]  
[Anonymous], 2015, P INT C MACH LEAM IC
[4]  
[Anonymous], 2005, P 4 SIGHAN WORKSH CH
[5]  
[Anonymous], PAMI
[6]  
[Anonymous], 2D SEM LAB CONT
[7]  
[Anonymous], 2010, EPFLREPORT149300
[8]  
[Anonymous], PROC CVPR IEEE
[9]  
[Anonymous], 2011, ADV NEURAL INF PROCE
[10]  
[Anonymous], P AS C COMP VIS