SceneAdapt: Scene-based domain adaptation for semantic segmentation using adversarial learning

被引:13
作者
Di Mauro, Daniele [1 ,2 ]
Furnari, Antonino [1 ]
Patane, Giuseppe [2 ]
Battiato, Sebastiano [1 ]
Farinella, Giovanni Maria [1 ]
机构
[1] Univ Catania, Dept Math & Comp Sci, Catania, Italy
[2] Pk Smart Srl, Catania, Italy
关键词
Semantic segmentation; Domain adaptation; Scene adaptation; Adversarial learning;
D O I
10.1016/j.patrec.2020.06.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation methods have achieved outstanding performance thanks to deep learning. Nevertheless, when such algorithms are deployed to new contexts not seen during training, it is necessary to collect and label scene-specific data in order to adapt them to the new domain using fine-tuning. This process is required whenever an already installed camera is moved or a new camera is introduced in a camera network due to the different scene layouts induced by the different viewpoints. To limit the amount of additional training data to be collected, it would be ideal to train a semantic segmentation method using labeled data already available and only unlabeled data coming from the new camera. We formalize this problem as a domain adaptation task and introduce a novel dataset of urban scenes with the related semantic labels. As a first approach to address this challenging task, we propose SceneAdapt, a method for scene adaptation of semantic segmentation algorithms based on adversarial learning. Experiments and comparisons with state-of-the-art approaches to domain adaptation highlight that promising performance can be achieved using adversarial learning both when the two scenes have different but points of view, and when they comprise images of completely different scenes. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:175 / 182
页数:8
相关论文
共 26 条
[1]  
Badrinarayanan V., 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence, DOI DOI 10.1109/TPAMI.2016.2644615
[2]   On-board monitoring system for road traffic safety analysis [J].
Battiato, Sebastiano ;
Farinella, Giovanni Maria ;
Gallo, Giovanni ;
Giudice, Oliver .
COMPUTERS IN INDUSTRY, 2018, 98 :208-217
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]  
Di Mauro D., 2018, 15 IEEE INT C ADV VI
[5]  
Dosovitskiy Alexey, 2017, P 1 ANN C ROB LEARN, DOI DOI 10.48550/ARXIV.1711.03938
[6]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[7]  
He K, 2016, PROC CVPR IEEE, P770, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90]
[8]  
Hoffman J., 2018, P 35 INT C MACH LEAR, P1994, DOI DOI 10.48550/ARXIV.1711.03213
[9]  
Hoffmann Johannes, 2016, 2016 Conference on Precision Electromagnetic Measurements (CPEM), P1, DOI 10.1109/CPEM.2016.7540615
[10]  
Isola P., 2017, P IEEE C COMP VIS PA, P1125, DOI [10.1109/CVPR.2017.632, DOI 10.1109/CVPR.2017.632]