Self-Supervised CycleGAN for Object-Preserving Image-to-Image Domain Adaptation

被引:21
作者
Xie, Xinpeng [1 ]
Chen, Jiawei [2 ]
Li, Yuexiang [2 ]
Shen, Linlin [1 ]
Ma, Kai [2 ]
Zheng, Yefeng [2 ]
机构
[1] Shenzhen Univ, Comp Vis Inst, Shenzhen, Peoples R China
[2] Tencent Jarvis Lab, Shenzhen, Peoples R China
来源
COMPUTER VISION - ECCV 2020, PT XX | 2020年 / 12365卷
关键词
Image-to-image translation; Domain adaptation; Semantic segmentation;
D O I
10.1007/978-3-030-58565-5_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent generative adversarial network (GAN) based methods (e.g., CycleGAN) are prone to fail at preserving image-objects in image-to-image translation, which reduces their practicality on tasks such as domain adaptation. Some frameworks have been proposed to adopt a segmentation network as the auxiliary regularization to prevent the content distortion. However, all of them require extra pixel-wise annotations, which is difficult to fulfill in practical applications. In this paper, we propose a novel GAN (namely OP-GAN) to address the problem, which involves a self-supervised module to enforce the image content consistency during image-to-image translations without any extra annotations. We evaluate the proposed OP-GAN on three publicly available datasets. The experimental results demonstrate that our OP-GAN can yield visually plausible translated images and significantly improve the semantic segmentation accuracy in different domain adaptation scenarios with off-the-shelf deep learning networks such as PSPNet and U-Net.
引用
收藏
页码:498 / 513
页数:16
相关论文
共 38 条
[1]   Segmentation and Recognition Using Structure from Motion Point Clouds [J].
Brostow, Gabriel J. ;
Shotton, Jamie ;
Fauqueur, Julien ;
Cipolla, Roberto .
COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 :44-+
[2]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[3]   Knowledge-Embedded Routing Network for Scene Graph Generation [J].
Chen, Tianshui ;
Yu, Weihao ;
Chen, Riquan ;
Lin, Liang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6156-6164
[4]   CartoonGAN: Generative Adversarial Networks for Photo Cartoonization [J].
Chen, Yang ;
Lai, Yu-Kun ;
Liu, Yong-Jin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9465-9474
[5]   Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification [J].
Deng, Weijian ;
Zheng, Liang ;
Ye, Qixiang ;
Kang, Guoliang ;
Yang, Yi ;
Jiao, Jianbin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :994-1003
[6]  
Fu H, 2019, PROC CVPR IEEE, P2422, DOI [10.1109/CVPR.2019.00253, 10.1109/cvpr.2019.00253]
[7]   Boosting Few-Shot Visual Learning with Self-Supervision [J].
Gidaris, Spyros ;
Bursuc, Andrei ;
Komodakis, Nikos ;
Perez, Patrick ;
Cord, Matthieu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8058-8067
[8]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[9]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[10]   AugGAN: Cross Domain Adaptation with GAN-Based Data Augmentation [J].
Huang, Sheng-Wei ;
Lin, Che-Tsung ;
Chen, Shu-Ping ;
Wu, Yen-Yi ;
Hsu, Po-Hao ;
Lai, Shang-Hong .
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :731-744