SceneAdapt: Scene-based domain adaptation for semantic segmentation using adversarial learning

被引:12
作者
Di Mauro, Daniele [1 ,2 ]
Furnari, Antonino [1 ]
Patane, Giuseppe [2 ]
Battiato, Sebastiano [1 ]
Farinella, Giovanni Maria [1 ]
机构
[1] Univ Catania, Dept Math & Comp Sci, Catania, Italy
[2] Pk Smart Srl, Catania, Italy
关键词
Semantic segmentation; Domain adaptation; Scene adaptation; Adversarial learning;
D O I
10.1016/j.patrec.2020.06.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation methods have achieved outstanding performance thanks to deep learning. Nevertheless, when such algorithms are deployed to new contexts not seen during training, it is necessary to collect and label scene-specific data in order to adapt them to the new domain using fine-tuning. This process is required whenever an already installed camera is moved or a new camera is introduced in a camera network due to the different scene layouts induced by the different viewpoints. To limit the amount of additional training data to be collected, it would be ideal to train a semantic segmentation method using labeled data already available and only unlabeled data coming from the new camera. We formalize this problem as a domain adaptation task and introduce a novel dataset of urban scenes with the related semantic labels. As a first approach to address this challenging task, we propose SceneAdapt, a method for scene adaptation of semantic segmentation algorithms based on adversarial learning. Experiments and comparisons with state-of-the-art approaches to domain adaptation highlight that promising performance can be achieved using adversarial learning both when the two scenes have different but points of view, and when they comprise images of completely different scenes. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:175 / 182
页数:8
相关论文
共 26 条
[11]   Perceptual Losses for Real-Time Style Transfer and Super-Resolution [J].
Johnson, Justin ;
Alahi, Alexandre ;
Li Fei-Fei .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :694-711
[12]   Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network [J].
Ledig, Christian ;
Theis, Lucas ;
Huszar, Ferenc ;
Caballero, Jose ;
Cunningham, Andrew ;
Acosta, Alejandro ;
Aitken, Andrew ;
Tejani, Alykhan ;
Totz, Johannes ;
Wang, Zehan ;
Shi, Wenzhe .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :105-114
[13]   Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks [J].
Li, Chuan ;
Wand, Michael .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :702-716
[14]  
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[15]  
Mathieu M., 2015, 4 INT C LEARNING REP
[16]   Semantic segmentation of images exploiting DCT based features and random forest [J].
Ravi, D. ;
Bober, M. ;
Farinella, G. M. ;
Guarnera, M. ;
Battiato, S. .
PATTERN RECOGNITION, 2016, 52 :260-273
[17]  
Raymond J.-F., 2001, Designing Privacy Enhancing Technologies. International Workshop on Design Issues in Anonymity and Unobservability. Proceedings (Lecture Notes in Computer Science Vol.2009), P10
[18]  
Romera E, 2019, IEEE INT VEH SYM, P1312, DOI [10.1109/IVS.2019.8813888, 10.1109/ivs.2019.8813888]
[19]  
Sankaranarayanan S., 2018, IEEE C COMP VIS PATT
[20]  
Tao Y, 2017, CHIN CONTR CONF, P4288, DOI 10.23919/ChiCC.2017.8028032