Self-Supervised CycleGAN for Object-Preserving Image-to-Image Domain Adaptation

被引：21

作者：

Xie, Xinpeng ^{[1
]}

Chen, Jiawei ^{[2
]}

Li, Yuexiang ^{[2
]}

Shen, Linlin ^{[1
]}

Ma, Kai ^{[2
]}

Zheng, Yefeng ^{[2
]}

机构：

[1] Shenzhen Univ, Comp Vis Inst, Shenzhen, Peoples R China

[2] Tencent Jarvis Lab, Shenzhen, Peoples R China

来源：

COMPUTER VISION - ECCV 2020, PT XX | 2020年 / 12365卷

关键词：

Image-to-image translation; Domain adaptation; Semantic segmentation;

D O I：

10.1007/978-3-030-58565-5_30

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent generative adversarial network (GAN) based methods (e.g., CycleGAN) are prone to fail at preserving image-objects in image-to-image translation, which reduces their practicality on tasks such as domain adaptation. Some frameworks have been proposed to adopt a segmentation network as the auxiliary regularization to prevent the content distortion. However, all of them require extra pixel-wise annotations, which is difficult to fulfill in practical applications. In this paper, we propose a novel GAN (namely OP-GAN) to address the problem, which involves a self-supervised module to enforce the image content consistency during image-to-image translations without any extra annotations. We evaluate the proposed OP-GAN on three publicly available datasets. The experimental results demonstrate that our OP-GAN can yield visually plausible translated images and significantly improve the semantic segmentation accuracy in different domain adaptation scenarios with off-the-shelf deep learning networks such as PSPNet and U-Net.

引用

页码：498 / 513

页数：16

共 38 条

[1] Segmentation and Recognition Using Structure from Motion Point Clouds [J].

Brostow, Gabriel J. ;

Shotton, Jamie ;

Fauqueur, Julien ;

Cipolla, Roberto .

COMPUTER VISION - ECCV 2008, PT I, PROCEEDINGS, 2008, 5302 :44-+

[2] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[3] Knowledge-Embedded Routing Network for Scene Graph Generation [J].

Chen, Tianshui ;

Yu, Weihao ;

Chen, Riquan ;

Lin, Liang .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6156-6164

[4] CartoonGAN: Generative Adversarial Networks for Photo Cartoonization [J].

Chen, Yang ;

Lai, Yu-Kun ;

Liu, Yong-Jin .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9465-9474

[5] Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification [J].

Deng, Weijian ;

Zheng, Liang ;

Ye, Qixiang ;

Kang, Guoliang ;

Yang, Yi ;

Jiao, Jianbin .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :994-1003

[6]

Fu H, 2019, PROC CVPR IEEE, P2422, DOI [10.1109/CVPR.2019.00253, 10.1109/cvpr.2019.00253]

[7] Boosting Few-Shot Visual Learning with Self-Supervision [J].

Gidaris, Spyros ;

Bursuc, Andrei ;

Komodakis, Nikos ;

Perez, Patrick ;

Cord, Matthieu .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8058-8067

[8]

Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672

[9]

Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]

[10] AugGAN: Cross Domain Adaptation with GAN-Based Data Augmentation [J].

Huang, Sheng-Wei ;

Lin, Che-Tsung ;

Chen, Shu-Ping ;

Wu, Yen-Yi ;

Hsu, Po-Hao ;

Lai, Shang-Hong .

COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 :731-744

← 1 2 3 4 →