Multi-Task cGAN for Simultaneous Spaceborne DSM Refinement and Roof-Type Classification

被引:17
作者
Bittner, Ksenia [1 ]
Koerner, Marco [2 ]
Fraundorfer, Friedrich [3 ]
Reinartz, Peter [1 ]
机构
[1] German Aerosp Ctr DLR, Remote Sensing Technol Inst, Munchner Str 20, D-82234 Wessling, Germany
[2] TUM, Chair Remote Sensing Technol, Dept Civil Geo & Environm Engn, Arcisstr 21, D-80333 Munich, Germany
[3] Graz Univ Technol, Inst Comp Graph & Vis, A-8010 Graz, Austria
关键词
multi-task learning; conditional generative adversarial networks; digital surface model; 3D scene refinement; semantic segmentation; roof type classification; urban region; satellite imagery; DIGITAL SURFACE MODELS; ACCURACY ASSESSMENT; RECOGNITION; GENERATION; RESOLUTION; LIDAR;
D O I
10.3390/rs11111262
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Various deep learning applications benefit from multi-task learning with multiple regression and classification objectives by taking advantage of the similarities between individual tasks. This can result in improved learning efficiency and prediction accuracy for the task-specific models compared to separately trained models. In this paper, we make an observation of such influences for important remote sensing applications like elevation model generation and semantic segmentation tasks from the stereo half-meter resolution satellite digital surface models (DSMs). Mainly, we aim to generate good-quality DSMs with complete, as well as accurate level of detail (LoD)2-like building forms and to assign an object class label to each pixel in the DSMs. For the label assignment task, we select the roof type classification problem to distinguish between flat, non-flat, and background pixels. To realize those tasks, we train a conditional generative adversarial network (cGAN) with an objective function based on least squares residuals and an auxiliary term based on normal vectors for further roof surface refinement. Besides, we investigate recently published deep learning architectures for both tasks and develop the final end-to-end network, which combines different models, as using them first separately, they provide the best results for their individual tasks.
引用
收藏
页数:22
相关论文
共 67 条
[1]   KNOWLEDGE BASED 3D BUILDING MODEL RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS FROM LIDAR AND AERIAL IMAGERIES [J].
Alidoost, F. ;
Arefi, H. .
XXIII ISPRS CONGRESS, COMMISSION III, 2016, 41 (B3) :833-840
[2]   LIDAR density and linear interpolator effects on elevation estimates [J].
Anderson, ES ;
Thompson, JA ;
Austin, RE .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2005, 26 (18) :3889-3900
[3]  
[Anonymous], HIGH RESOLUT EARTH I
[4]  
[Anonymous], 2018, P EUROPEAN C COMPUTE
[5]  
[Anonymous], 2010, P ISPRS COMM 7 S VIE
[6]  
[Anonymous], 2016, ARXIV160106759
[7]  
[Anonymous], P 3 INT C LEARNING R
[8]  
[Anonymous], PROC CVPR IEEE
[9]  
[Anonymous], 2008, ISPRS J
[10]  
[Anonymous], P GIS OSTR 2013 GEOI