On the Effectiveness of Weakly Supervised Semantic Segmentation for Building Extraction From High-Resolution Remote Sensing Imagery

被引:58
作者
Li, Zhenshi [1 ]
Zhang, Xueliang [1 ]
Xiao, Pengfeng [1 ]
Zheng, Zixian [1 ]
机构
[1] Nanjing Univ, Sch Geog & Ocean Sci, Minist Nat Resources,Jiangsu Prov Key Lab Geog In, Key Lab Land Satellite Remote Sensing Applicat, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Training; Buildings; Remote sensing; Semantics; Feature extraction; Data mining; Building extraction; fully convolutional network; high-resolution remote sensing imagery; weakly supervised semantic segmentation (WSSS); CONVOLUTIONAL NEURAL-NETWORK; DEEP; CLASSIFICATION;
D O I
10.1109/JSTARS.2021.3063788
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A critical obstacle to achieve semantic segmentation of remote sensing images by the deep convolutional neural network is the requirement of huge pixel-level labels. Taking building extraction as an example, this study focuses on how to effectively apply weakly supervised semantic segmentation (WSSS) to high-resolution remote sensing (HR) images with image-level labels, which is a prominent solution for the huge labeling challenge. The widely used two-step WSSS framework is adopted, in which the pseudo-masks are first produced from image-level labels and followed by a segmentation network trained by the pseudo-masks. In addition, the fully connected conditional random field (CRF) is utilized to explore spatial context in both training and prediction stages. Detailed analyzes are implemented on applying WSSS on HR images in terms of producing pseudo-masks, training segmentation network, and optimizing predictions. We show that the tradeoff between precision and recall of pseudo-masks, as well as the boundary accuracy and the background, needs to be carefully considered. The benefits of the segmentation network in the two-step framework are demonstrated in comparison to using classification network only for WSSS, and the effects of CRF-loss are identified to be powerful for improving the segmentation network while it is not appropriate for dense buildings. An overlapping strategy and CRF postprocessing are further demonstrated to be effective for optimizing the segmentation results during inferencing. Through deliberate settings, we can generate results comparable to fully supervised on the ISPRS Potsdam and Vaihingen dataset, which is meaningful for promoting WSSS applications for extracting geographic information from HR images.
引用
收藏
页码:3266 / 3281
页数:16
相关论文
共 59 条
[1]   Learning Pixel-level Semantic Affinity with Image-level Supervision forWeakly Supervised Semantic Segmentation [J].
Ahn, Jiwoon ;
Kwak, Suha .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4981-4990
[2]   Destruction from sky: Weakly supervised approach for destruction detection in satellite imagery [J].
Ali, Muhammad Usman ;
Sultani, Waqas ;
Ali, Mohsen .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 162 :115-124
[3]  
[Anonymous], NIPS 2011
[4]  
[Anonymous], 2014, ARXIV13126034
[5]   Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32
[6]   Geographic Object-Based Image Analysis - Towards a new paradigm [J].
Blaschke, Thomas ;
Hay, Geoffrey J. ;
Kelly, Maggi ;
Lang, Stefan ;
Hofmann, Peter ;
Addink, Elisabeth ;
Feitosa, Raul Queiroz ;
van der Meer, Freek ;
van der Werff, Harald ;
van Coillie, Frieke ;
Tiede, Dirk .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 87 :180-191
[7]   Cloud and cloud shadow detection in Landsat imagery based on deep convolutional neural networks [J].
Chai, Dengfeng ;
Newsam, Shawn ;
Zhang, Hankui K. ;
Qiu, Yifan ;
Huang, Jingfeng .
REMOTE SENSING OF ENVIRONMENT, 2019, 225 :307-316
[8]   A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains [J].
Chan, Lyndon ;
Hosseini, Mahdi S. ;
Plataniotis, Konstantinos N. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) :361-384
[9]   Geographic object-based image analysis (GEOBIA): emerging trends and future opportunities [J].
Chen, Gang ;
Weng, Qihao ;
Hay, Geoffrey J. ;
He, Yinan .
GISCIENCE & REMOTE SENSING, 2018, 55 (02) :159-182
[10]   SPMF-Net: Weakly Supervised Building Segmentation by Combining Superpixel Pooling and Multi-Scale Feature Fusion [J].
Chen, Jie ;
He, Fen ;
Zhang, Yi ;
Sun, Geng ;
Deng, Min .
REMOTE SENSING, 2020, 12 (06)