Disentangled Cycle Consistency for Highly-realistic Virtual Try-On

被引：70

作者：

Ge, Chongjian ^{[1
]}

Song, Yibing ^{[2
]}

Ge, Yuying ^{[1
]}

Yang, Han ^{[3
]}

Liu, Wei ^{[4
]}

Luo, Ping ^{[1
]}

机构：

[1] Univ Hong Kong, Hong Kong, Peoples R China

[2] Tencent AI Lab, Bellevue, WA 98004 USA

[3] Swiss Fed Inst Technol, Zurich, Switzerland

[4] Tencent Data Platform, Bellevue, WA USA

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.01665

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image virtual try-on replaces the clothes on a person image with a desired in-shop clothes image. It is challenging because the person and the in-shop clothes are unpaired. Existing methods formulate virtual try-on as either in-painting or cycle consistency. Both of these two formulations encourage the generation networks to reconstruct the input image in a self-supervised manner. However, existing methods do not differentiate clothing and non-clothing regions. A straightforward generation impedes the virtual try-on quality because of the heavily coupled image contents. In this paper, we propose a Disentangled Cycle-consistency Try-On Network (DCTON). The DCTON is able to produce highly-realistic try-on images by disentangling important components of virtual try-on including clothes warping, skin synthesis, and image composition. Moreover, DCTON can be naturally trained in a self-supervised manner following cycle consistency learning. Extensive experiments on challenging benchmarks show that DCTON outperforms state-of-the-art approaches favorably.

引用

页码：16923 / 16932

页数：10

共 48 条

[1]

[Anonymous], 1977, CONSTRUCTIVE THEORY

[2] ComboGAN: Unrestrained Scalability for Image Domain Translation [J].

Anoosheh, Asha ;

Agustsson, Eirikur ;

Timofte, Radu ;

Van Gool, Luc .

PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :896-903

[3] Everybody Dance Now [J].

Chan, Caroline ;

Ginosar, Shiry ;

Zhou, Tinghui ;

Efros, Alexei A. .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5932-5941

[4] PairedCycleGAN: Asymmetric Style Transfer for Applying and Removing Makeup [J].

Chang, Huiwen ;

Lu, Jingwan ;

Yu, Fisher ;

Finkelstein, Adam .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :40-48

[5] Deep Photo Enhancer: Unpaired Learning for Image Enhancement from Photographs with GANs [J].

Chen, Yu-Sheng ;

Wang, Yu-Ching ;

Kao, Man-Hsin ;

Chuang, Yung-Yu .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6306-6314

[6] ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data [J].

Diakogiannis, Foivos, I ;

Waldner, Francois ;

Caccetta, Peter ;

Wu, Chen .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 162 :94-114

[7] Towards Multi-pose Guided Virtual Try-on Network [J].

Dong, Haoye ;

Liang, Xiaodan ;

Shen, Xiaohui ;

Wang, Bochao ;

Lai, Hanjiang ;

Zhu, Jia ;

Hu, Zhiting ;

Yin, Jian .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9025-9034

[8] THE FRECHET DISTANCE BETWEEN MULTIVARIATE NORMAL-DISTRIBUTIONS [J].

DOWSON, DC ;

LANDAU, BV .

JOURNAL OF MULTIVARIATE ANALYSIS, 1982, 12 (03) :450-455

[9]

Ehara Jun, 2006, 2006 IEEE/ACM International Symposium on Mixed and Augmented Reality, P139, DOI 10.1109/ISMAR.2006.297805

[10] SINGULAR VALUE DECOMPOSITION AND LEAST SQUARES SOLUTIONS [J].

GOLUB, GH ;

REINSCH, C .

NUMERISCHE MATHEMATIK, 1970, 14 (05) :403-&

← 1 2 3 4 5 →