Toward Detail-Oriented Image-Based Virtual Try-On with Arbitrary Poses

被引:2
作者
Chang, Yuan [1 ]
Peng, Tao [1 ]
He, Ruhan [1 ]
Hu, Xinrong [1 ]
Liu, Junping [1 ]
Zhang, Zili [1 ]
Jiang, Minghua [1 ]
机构
[1] Wuhan Text Univ, Engn Res Ctr Hubei Prov Clothing Informat, Wuhan 430200, Peoples R China
来源
MULTIMEDIA MODELING (MMM 2022), PT I | 2022年 / 13141卷
关键词
Virtual Try-On; Arbitrary poses; Spatial alignment; Dilated convolution;
D O I
10.1007/978-3-030-98358-1_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image-based virtual try-on with arbitrary poses has attracted many attentions recently. The purpose of this study is to synthesize a reference person image wearing a target clothes with a target pose. However, it is still a challenge for the existing methods to preserve the clothing details and person identity while generating fine-grained tryon images. To resolve the issues, we present a new detail-oriented virtual try-on network with arbitrary poses (DO-VTON). Specifically, our DO-VTON consists of three major modules: First, a semantic prediction module adopts a two-stage strategy to gradually predict a semantic map of the reference person. Second, a spatial alignment module warps the target clothes and non-target details to align with the target pose. Third, a try-on synthesis module generates final try-on images. Moreover, to generate high-quality images, we introduce a new multi-scale dilated convolution U-Net to enlarge the receptive field and capture context information. Extensive experiments on two famous benchmark datasets demonstrate our system achieves the state-of-the-art virtual try-on performance both qualitatively and quantitatively.
引用
收藏
页码:82 / 94
页数:13
相关论文
共 37 条
[1]   Design Preserving Garment Transfer [J].
Brouet, Remi ;
Sheffer, Alla ;
Boissieux, Laurence ;
Cani, Marie-Paule .
ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)
[2]   Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields [J].
Cao, Zhe ;
Simon, Tomas ;
Wei, Shih-En ;
Sheikh, Yaser .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1302-1310
[3]   DP-VTON: TOWARD DETAIL-PRESERVING IMAGE-BASED VIRTUAL TRY-ON NETWORK [J].
Chang, Yuan ;
Peng, Tao ;
He, Ruhan ;
Hu, Xinrong ;
Liu, Junping ;
Zhang, Zili ;
Jiang, Minghua .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :2295-2299
[4]   Synthesizing Training Images for Boosting Human 3D Pose Estimation [J].
Chen, Wenzheng ;
Wang, Huan ;
Li, Yangyan ;
Su, Hao ;
Wang, Zhenhua ;
Tu, Changhe ;
Lischinski, Dani ;
Cohen-Or, Daniel ;
Chen, Baoquan .
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :479-488
[5]   Towards Multi-pose Guided Virtual Try-on Network [J].
Dong, Haoye ;
Liang, Xiaodan ;
Shen, Xiaohui ;
Wang, Bochao ;
Lai, Hanjiang ;
Zhu, Jia ;
Hu, Zhiting ;
Yin, Jian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9025-9034
[6]   Parser-Free Virtual Try-on via Distilling Appearance Flows [J].
Ge, Yuying ;
Song, Yibing ;
Zhang, Ruimao ;
Ge, Chongjian ;
Liu, Wei ;
Luo, Ping .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8481-8489
[7]   Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing [J].
Gong, Ke ;
Liang, Xiaodan ;
Zhang, Dongyu ;
Shen, Xiaohui ;
Lin, Liang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6757-6765
[8]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[9]   DRAPE: DRessing Any PErson [J].
Guan, Peng ;
Reiss, Loretta ;
Hirshberg, David A. ;
Weiss, Alexander ;
Black, Michael J. .
ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)
[10]   VITON: An Image-based Virtual Try-on Network [J].
Han, Xintong ;
Wu, Zuxuan ;
Wu, Zhe ;
Yu, Ruichi ;
Davis, Larry S. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7543-7552