VTNFP: An Image-based Virtual Try-on Network with Body and Clothing Feature Preservation

被引:117
作者
Yu, Ruiyun [1 ]
Wang, Xiaoqi [1 ]
Xie, Xiaohui [2 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang, Peoples R China
[2] Univ Calif Irvine, Dept Comp Sci, Irvine, CA 92617 USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV.2019.01061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
dImage-based virtual try-on systems with the goal of transferring a desired clothing item onto the corresponding region of a person have made great strides recently, but challenges remain in generating realistic looking images that preserve both body and clothing details. Here we present a new virtual try-on network, called VTNFP, to synthesize photo-realistic images given the images of a clothed person and a target clothing item. In order to better preserve clothing and body features, VTNFP follows a three-stage design strategy. First, it transforms the target clothing into a warped form compatible with the pose of the given person. Next, it predicts a body segmentation map of the person wearing the target clothing, delineating body parts as well as clothing regions. Finally, the warped clothing, body segmentation map and given person image are fused together for fine-scale image synthesis. A key innovation of VTNFP is the body segmentation map prediction module, which provides critical information to guide image synthesis in regions where body parts and clothing intersects, and is very beneficial for preventing blurry pictures and preserving clothing and body part details. Experiments on a fashion dataset demonstrate that VTNFP generates substantially better results than state-of-the-art methods.
引用
收藏
页码:10510 / 10519
页数:10
相关论文
共 50 条
[1]  
[Anonymous], 2017, arXiv preprint arXiv:1705.01908
[2]  
[Anonymous], 2017, CVPR 2017 IEEE C COM
[3]   Abnormal behavior recognition for intelligent video surveillance systems: A review [J].
Ben Mabrouk, Amira ;
Zagrouba, Ezzeddine .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 91 :480-491
[4]  
Berthelot D., 2017, ARXIV170310717
[5]  
Borghi G., 2017, P 28 IEEE INT VEH S
[6]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[7]   Fast Keyframe Selection and Switching for ICP-based Camera Pose Estimation [J].
Chen, Chun-Wei ;
Hsiao, Wen-Yuan ;
Lin, Ting-Yu ;
Wang, Jonas ;
Shieh, Ming-Der .
2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
[8]   Synthesizing Training Images for Boosting Human 3D Pose Estimation [J].
Chen, Wenzheng ;
Wang, Huan ;
Li, Yangyan ;
Su, Hao ;
Wang, Zhenhua ;
Tu, Changhe ;
Lischinski, Dani ;
Cohen-Or, Daniel ;
Chen, Baoquan .
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :479-488
[9]  
CHOLLET F, 2017, PROC CVPR IEEE, P1800, DOI DOI 10.1109/CVPR.2017.195
[10]  
Denton E. L., 2015, Deep Generative Image Models using a obj Laplacian Pyramid of Adversarial Networks, P1486