Single Stage Virtual Try-On Via Deformable Attention Flows

被引:47
作者
Bai, Shuai [1 ]
Zhou, Huiling [1 ]
Li, Zhikang [1 ]
Zhou, Chang [1 ]
Yang, Hongxia [1 ]
机构
[1] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
来源
COMPUTER VISION - ECCV 2022, PT XV | 2022年 / 13675卷
关键词
Virtual try-on; Single stage; Deformable attention flows;
D O I
10.1007/978-3-031-19784-0_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Virtual try-on aims to generate a photo-realistic fitting result given an in-shop garment and a reference person image. Existing methods usually build up multi-stage frameworks to deal with clothes warping and body blending respectively, or rely heavily on intermediate parser-based labels which may be noisy or even inaccurate. To solve the above challenges, we propose a single-stage try-on framework by developing a novel Deformable Attention Flow (DAFlow), which applies the deformable attention scheme to multi-flow estimation. With pose keypoints as the guidance only, the self- and cross-deformable attention flows are estimated for the reference person and the garment images, respectively. By sampling multiple flow fields, the feature-level and pixel-level information from different semantic areas is simultaneously extracted and merged through the attention mechanism. It enables clothes warping and body synthesizing at the same time which leads to photo-realistic results in an end-to-end manner. Extensive experiments on two try-on datasets demonstrate that our proposed method achieves state-of-the-art performance both qualitatively and quantitatively. Furthermore, additional experiments on the other two image editing tasks illustrate the versatility of our method for multi-view synthesis and image animation. Code will be made available at https://github.com/OFA-Sys/DAFlow.
引用
收藏
页码:409 / 425
页数:17
相关论文
共 48 条
[1]  
[Anonymous], 1977, Constructive Theory of Functions of Several Variables
[2]   CLOTH3D: Clothed 3D Humans [J].
Bertiche, Hugo ;
Madadi, Meysam ;
Escalera, Sergio .
COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :344-359
[3]   Multi-Garment Net: Learning to Dress 3D People from Images [J].
Bhatnagar, Bharat Lal ;
Tiwari, Garvita ;
Theobalt, Christian ;
Pons-Moll, Gerard .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5419-5429
[4]   VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization [J].
Choi, Seunghwan ;
Park, Sunghyun ;
Lee, Minsoo ;
Choo, Jaegul .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14126-14135
[5]   ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors [J].
Chopra, Ayush ;
Jain, Rishabh ;
Hemani, Mayur ;
Krishnamurthy, Balaji .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5413-5422
[6]   Towards Multi-pose Guided Virtual Try-on Network [J].
Dong, Haoye ;
Liang, Xiaodan ;
Shen, Xiaohui ;
Wang, Bochao ;
Lai, Hanjiang ;
Zhu, Jia ;
Hu, Zhiting ;
Yin, Jian .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9025-9034
[7]   Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network [J].
Feng, Yao ;
Wu, Fan ;
Shao, Xiaohu ;
Wang, Yanfeng ;
Zhou, Xi .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :557-574
[8]   Disentangled Cycle Consistency for Highly-realistic Virtual Try-On [J].
Ge, Chongjian ;
Song, Yibing ;
Ge, Yuying ;
Yang, Han ;
Liu, Wei ;
Luo, Ping .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16923-16932
[9]   Parser-Free Virtual Try-on via Distilling Appearance Flows [J].
Ge, Yuying ;
Song, Yibing ;
Zhang, Ruimao ;
Ge, Chongjian ;
Liu, Wei ;
Luo, Ping .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8481-8489
[10]   Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing [J].
Gong, Ke ;
Liang, Xiaodan ;
Zhang, Dongyu ;
Shen, Xiaohui ;
Lin, Liang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6757-6765