Disentangled Cycle Consistency for Highly-realistic Virtual Try-On

被引:70
作者
Ge, Chongjian [1 ]
Song, Yibing [2 ]
Ge, Yuying [1 ]
Yang, Han [3 ]
Liu, Wei [4 ]
Luo, Ping [1 ]
机构
[1] Univ Hong Kong, Hong Kong, Peoples R China
[2] Tencent AI Lab, Bellevue, WA 98004 USA
[3] Swiss Fed Inst Technol, Zurich, Switzerland
[4] Tencent Data Platform, Bellevue, WA USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.01665
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image virtual try-on replaces the clothes on a person image with a desired in-shop clothes image. It is challenging because the person and the in-shop clothes are unpaired. Existing methods formulate virtual try-on as either in-painting or cycle consistency. Both of these two formulations encourage the generation networks to reconstruct the input image in a self-supervised manner. However, existing methods do not differentiate clothing and non-clothing regions. A straightforward generation impedes the virtual try-on quality because of the heavily coupled image contents. In this paper, we propose a Disentangled Cycle-consistency Try-On Network (DCTON). The DCTON is able to produce highly-realistic try-on images by disentangling important components of virtual try-on including clothes warping, skin synthesis, and image composition. Moreover, DCTON can be naturally trained in a self-supervised manner following cycle consistency learning. Extensive experiments on challenging benchmarks show that DCTON outperforms state-of-the-art approaches favorably.
引用
收藏
页码:16923 / 16932
页数:10
相关论文
共 48 条
[1]  
[Anonymous], 1977, CONSTRUCTIVE THEORY
[2]   ComboGAN: Unrestrained Scalability for Image Domain Translation [J].
Anoosheh, Asha ;
Agustsson, Eirikur ;
Timofte, Radu ;
Van Gool, Luc .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :896-903
[3]   Everybody Dance Now [J].
Chan, Caroline ;
Ginosar, Shiry ;
Zhou, Tinghui ;
Efros, Alexei A. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5932-5941
[4]   PairedCycleGAN: Asymmetric Style Transfer for Applying and Removing Makeup [J].
Chang, Huiwen ;
Lu, Jingwan ;
Yu, Fisher ;
Finkelstein, Adam .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :40-48
[5]   Deep Photo Enhancer: Unpaired Learning for Image Enhancement from Photographs with GANs [J].
Chen, Yu-Sheng ;
Wang, Yu-Ching ;
Kao, Man-Hsin ;
Chuang, Yung-Yu .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6306-6314
[6]   ResUNet-a: A deep learning framework for semantic segmentation of remotely sensed data [J].
Diakogiannis, Foivos, I ;
Waldner, Francois ;
Caccetta, Peter ;
Wu, Chen .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 162 :94-114
[7]   Towards Multi-pose Guided Virtual Try-on Network [J].
Dong, Haoye ;
Liang, Xiaodan ;
Shen, Xiaohui ;
Wang, Bochao ;
Lai, Hanjiang ;
Zhu, Jia ;
Hu, Zhiting ;
Yin, Jian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9025-9034
[8]   THE FRECHET DISTANCE BETWEEN MULTIVARIATE NORMAL-DISTRIBUTIONS [J].
DOWSON, DC ;
LANDAU, BV .
JOURNAL OF MULTIVARIATE ANALYSIS, 1982, 12 (03) :450-455
[9]  
Ehara Jun, 2006, 2006 IEEE/ACM International Symposium on Mixed and Augmented Reality, P139, DOI 10.1109/ISMAR.2006.297805
[10]   SINGULAR VALUE DECOMPOSITION AND LEAST SQUARES SOLUTIONS [J].
GOLUB, GH ;
REINSCH, C .
NUMERISCHE MATHEMATIK, 1970, 14 (05) :403-&