Self-Ensembling with GAN-based Data Augmentation for Domain Adaptation in Semantic Segmentation

被引:148
作者
Choi, Jaehoon [1 ]
Kim, Taekyung [1 ]
Kim, Changick [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/ICCV.2019.00693
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning-based semantic segmentation methods have an intrinsic limitation that training a model requires a large amount of data with pixel-level annotations. To address this challenging issue, many researchers give attention to unsupervised domain adaptation for semantic segmentation. Unsupervised domain adaptation seeks to adapt the model trained on the source domain to the target domain. In this paper, we introduce a self-ensembling technique, one of the successful methods for domain adaptation in classification. However, applying self-ensembling to semantic segmentation is very difficult because heavily-tuned manual data augmentation used in self-ensembling is not useful to reduce the large domain gap in the semantic segmentation. To overcome this limitation, we propose a novel framework consisting of two components, which are complementary to each other. First, we present a data augmentation method based on Generative Adversarial Networks (GANs), which is computationally efficient and effective to facilitate domain alignment. Given those augmented images, we apply self-ensembling to enhance the performance of the segmentation network on the target domain. The proposed method outperforms state-of-the-art semantic segmentation methods on unsupervised domain adaptation benchmarks.
引用
收藏
页码:6829 / 6839
页数:11
相关论文
共 57 条
  • [1] Almahairi Amjad, 2018, INT C MACHINE LEARNI, P195, DOI DOI 10.48550/ARXIV.1802.10151
  • [2] [Anonymous], ARXIV181100174
  • [3] [Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.265
  • [4] Athiwaratkun B., 2018, There are many consistent explanations of unlabeled data: Why you should average
  • [5] A theory of learning from different domains
    Ben-David, Shai
    Blitzer, John
    Crammer, Koby
    Kulesza, Alex
    Pereira, Fernando
    Vaughan, Jennifer Wortman
    [J]. MACHINE LEARNING, 2010, 79 (1-2) : 151 - 175
  • [6] Chen H, 2016, P AMER CONTR CONF, P2223, DOI 10.1109/ACC.2016.7525248
  • [7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [8] ROAD: Reality Oriented Adaptation for Semantic Segmentation of Urban Scenes
    Chen, Yuhua
    Li, Wen
    Van Gool, Luc
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7892 - 7901
  • [9] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [10] Inference of gene regulations between multiple activators/inhibitors
    Deng, Xiaozheng
    Chen, Yu
    [J]. 2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, : 192 - 198