Shape Controllable Virtual Try-on for Underwear Models

被引:6
作者
Gao, Xin [1 ]
Liu, Zhenjiang [1 ]
Feng, Zunlei [2 ]
Shen, Chengji [2 ]
Ou, Kairi [1 ]
Tang, Haihong [1 ]
Song, Mingli [2 ]
机构
[1] Alibaba Grp, Hangzhou, Peoples R China
[2] Zhejiang Univ, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
基金
中国国家自然科学基金;
关键词
Virtual try-on; Graph attention networks; Image warping;
D O I
10.1145/3474085.3475210
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image virtual try-on task has abundant applications and has become a hot research topic recently. Existing 2D image-based virtual try-on methods aim to transfer a target clothing image onto a reference person, which has two main disadvantages: cannot control the size and length precisely; unable to accurately estimate the user's figure in the case of users wearing thick clothing, resulting in inaccurate dressing effect. In this paper, we put forward an akin task that aims to dress clothing for underwear models. To solve the above drawbacks, we propose a Shape Controllable Virtual Try-On Network (SC-VTON), where a graph attention network integrates the information of model and clothing to generate the warped clothing image. In addition, the control points are incorporated into SC-VTON for the desired clothing shape. Furthermore, by adding a Splitting Network and a Synthesis Network, we can use in-shop clothing/model pair data to help optimize the deformation module and generalize the task to the typical virtual try-on task. Extensive experiments show that the proposed method can achieve accurate shape control. Meanwhile, compared with other methods, our method can generate high-resolution results with detailed textures, which can be applied in real applications.
引用
收藏
页码:563 / 572
页数:10
相关论文
共 54 条
[1]   Learning to Reconstruct People in Clothing from a Single RGB Camera [J].
Alldieck, Thiemo ;
Magnor, Marcus ;
Bhatnagar, Bharat Lal ;
Theobalt, Christian ;
Pons-Moll, Gerard .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1175-1186
[2]   Multi-Garment Net: Learning to Dress 3D People from Images [J].
Bhatnagar, Bharat Lal ;
Tiwari, Garvita ;
Theobalt, Christian ;
Pons-Moll, Gerard .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5419-5429
[3]  
Chaudhuri B, 2021, Arxiv, DOI arXiv:2103.17266
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]  
Cheng HQ, 2020, PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), P987, DOI [10.1109/ITNEC48623.2020.9085163, 10.1109/itnec48623.2020.9085163]
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]   Towards Multi-pose Guided Virtual Try-on Network [J].
Dong, Haoye ;
Liang, Xiaodan ;
Shen, Xiaohui ;
Wang, Bochao ;
Lai, Hanjiang ;
Zhu, Jia ;
Hu, Zhiting ;
Yin, Jian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9025-9034
[8]  
Ge YY, 2021, Arxiv, DOI arXiv:2103.04559
[9]   HoloPose: Holistic 3D Human Reconstruction In-The-Wild [J].
Guler, Riza Alp ;
Kokkinos, Iasonas .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10876-10886
[10]   DensePose: Dense Human Pose Estimation In The Wild [J].
Guler, Riza Alp ;
Neverova, Natalia ;
Kokkinos, Lasonas .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7297-7306