Multi-Class Strategies for Joint Building Footprint and Road Detection in Remote Sensing

被引:4
|
作者
Ayala, Christian [1 ]
Aranda, Carlos [1 ]
Galar, Mikel [2 ]
机构
[1] Tracasa Instrumental, Calle Cabarceno 6, Sarriguren 31621, Spain
[2] Publ Univ Navarre UPNA, Inst Smart Cities ISC, Arrosadia Campus, Pamplona 31006, Spain
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 18期
关键词
Sentinel-1; Sentinel-2; remote sensing; building detection; road detection; deep learning; convolutional neural networks; multi-class semantic segmentation; binary semantic segmentation; multi-task semantic segmentation; CLASSIFICATION;
D O I
10.3390/app11188340
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Building footprints and road networks are important inputs for a great deal of services. For instance, building maps are useful for urban planning, whereas road maps are essential for disaster response services. Traditionally, building and road maps are manually generated by remote sensing experts or land surveying, occasionally assisted by semi-automatic tools. In the last decade, deep learning-based approaches have demonstrated their capabilities to extract these elements automatically and accurately from remote sensing imagery. The building footprint and road network detection problem can be considered a multi-class semantic segmentation task, that is, a single model performs a pixel-wise classification on multiple classes, optimizing the overall performance. However, depending on the spatial resolution of the imagery used, both classes may coexist within the same pixel, drastically reducing their separability. In this regard, binary decomposition techniques, which have been widely studied in the machine learning literature, are proved useful for addressing multi-class problems. Accordingly, the multi-class problem can be split into multiple binary semantic segmentation sub-problems, specializing different models for each class. Nevertheless, in these cases, an aggregation step is required to obtain the final output labels. Additionally, other novel approaches, such as multi-task learning, may come in handy to further increase the performance of the binary semantic segmentation models. Since there is no certainty as to which strategy should be carried out to accurately tackle a multi-class remote sensing semantic segmentation problem, this paper performs an in-depth study to shed light on the issue. For this purpose, open-access Sentinel-1 and Sentinel-2 imagery (at 10 m) are considered for extracting buildings and roads, making use of the well-known U-Net convolutional neural network. It is worth stressing that building and road classes may coexist within the same pixel when working at such a low spatial resolution, setting a challenging problem scheme. Accordingly, a robust experimental study is developed to assess the benefits of the decomposition strategies and their combination with a multi-task learning scheme. The obtained results demonstrate that decomposing the considered multi-class remote sensing semantic segmentation problem into multiple binary ones using a One-vs.-All binary decomposition technique leads to better results than the standard direct multi-class approach. Additionally, the benefits of using a multi-task learning scheme for pushing the performance of binary segmentation models are also shown.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] MultiDefectNet: Multi-Class Defect Detection of Building Facade Based on Deep Convolutional Neural Network
    Lee, Kisu
    Hong, Goopyo
    Sael, Lee
    Lee, Sanghyo
    Kim, Ha Young
    SUSTAINABILITY, 2020, 12 (22) : 1 - 14
  • [32] Tilt Correction Toward Building Detection of Remote Sensing Images
    Liu, Kang
    Jiang, Zhiyu
    Xu, Mingliang
    Perc, Matjaz
    Li, Xuelong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 5854 - 5866
  • [33] Multi-scale Cross Dual Attention Network for Building Change Detection in Remote Sensing Images
    Zhang J.
    Yan Z.
    Ma S.
    Journal of Geo-Information Science, 2023, 25 (12) : 2487 - 2500
  • [34] Remote Sensing Building Detection Based on Binarized Semantic Segmentation
    Zhu Tianyou
    Dong Feng
    Gong Huixing
    ACTA OPTICA SINICA, 2019, 39 (12)
  • [35] Automatic Building Footprint Extraction from Multi-Resolution Remote Sensing Images Using a Hybrid FCN
    Schuegraf, Philipp
    Bittner, Ksenia
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (04)
  • [36] A review of building detection from very high resolution optical remote sensing images
    Li, Jiayi
    Huang, Xin
    Tu, Lilin
    Zhang, Tao
    Wang, Leiguang
    GISCIENCE & REMOTE SENSING, 2022, 59 (01) : 1199 - 1225
  • [37] Deep Multi-Scale Fusion Neural Network for Multi-Class Arrhythmia Detection
    Wang, Ruxin
    Fan, Jianping
    Li, Ye
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (09) : 2461 - 2472
  • [38] Road Detection of Remote Sensing Image Based on Convolutional Neural Network
    Zhu, Yuting
    Yan, Jingwen
    Wang, Cong
    Zhou, Yiqing
    IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 106 - 118
  • [39] Richer U-Net: Learning More Details for Road Detection in Remote Sensing Images
    Zao, Yifan
    Shi, Zhenwei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [40] Multi-class SVM based remote sensing image classification and its semi-supervised improvement scheme
    Qi, HN
    Yang, JG
    Zhong, YW
    Deng, C
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 3146 - 3151