Single Stage Virtual Try-On Via Deformable Attention Flows

被引：47

作者：

Bai, Shuai ^{[1
]}

Zhou, Huiling ^{[1
]}

Li, Zhikang ^{[1
]}

Zhou, Chang ^{[1
]}

Yang, Hongxia ^{[1
]}

机构：

[1] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China

来源：

COMPUTER VISION - ECCV 2022, PT XV | 2022年 / 13675卷

关键词：

Virtual try-on; Single stage; Deformable attention flows;

D O I：

10.1007/978-3-031-19784-0_24

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Virtual try-on aims to generate a photo-realistic fitting result given an in-shop garment and a reference person image. Existing methods usually build up multi-stage frameworks to deal with clothes warping and body blending respectively, or rely heavily on intermediate parser-based labels which may be noisy or even inaccurate. To solve the above challenges, we propose a single-stage try-on framework by developing a novel Deformable Attention Flow (DAFlow), which applies the deformable attention scheme to multi-flow estimation. With pose keypoints as the guidance only, the self- and cross-deformable attention flows are estimated for the reference person and the garment images, respectively. By sampling multiple flow fields, the feature-level and pixel-level information from different semantic areas is simultaneously extracted and merged through the attention mechanism. It enables clothes warping and body synthesizing at the same time which leads to photo-realistic results in an end-to-end manner. Extensive experiments on two try-on datasets demonstrate that our proposed method achieves state-of-the-art performance both qualitatively and quantitatively. Furthermore, additional experiments on the other two image editing tasks illustrate the versatility of our method for multi-view synthesis and image animation. Code will be made available at https://github.com/OFA-Sys/DAFlow.

引用

页码：409 / 425

页数：17

共 48 条

[1]

[Anonymous], 1977, Constructive Theory of Functions of Several Variables

[2] CLOTH3D: Clothed 3D Humans [J].

Bertiche, Hugo ;

Madadi, Meysam ;

Escalera, Sergio .

COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :344-359

[3] Multi-Garment Net: Learning to Dress 3D People from Images [J].

Bhatnagar, Bharat Lal ;

Tiwari, Garvita ;

Theobalt, Christian ;

Pons-Moll, Gerard .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5419-5429

[4] VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization [J].

Choi, Seunghwan ;

Park, Sunghyun ;

Lee, Minsoo ;

Choo, Jaegul .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14126-14135

[5] ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors [J].

Chopra, Ayush ;

Jain, Rishabh ;

Hemani, Mayur ;

Krishnamurthy, Balaji .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5413-5422

[6] Towards Multi-pose Guided Virtual Try-on Network [J].

Dong, Haoye ;

Liang, Xiaodan ;

Shen, Xiaohui ;

Wang, Bochao ;

Lai, Hanjiang ;

Zhu, Jia ;

Hu, Zhiting ;

Yin, Jian .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9025-9034

[7] Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network [J].

Feng, Yao ;

Wu, Fan ;

Shao, Xiaohu ;

Wang, Yanfeng ;

Zhou, Xi .

COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :557-574

[8] Disentangled Cycle Consistency for Highly-realistic Virtual Try-On [J].

Ge, Chongjian ;

Song, Yibing ;

Ge, Yuying ;

Yang, Han ;

Liu, Wei ;

Luo, Ping .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16923-16932

[9] Parser-Free Virtual Try-on via Distilling Appearance Flows [J].

Ge, Yuying ;

Song, Yibing ;

Zhang, Ruimao ;

Ge, Chongjian ;

Liu, Wei ;

Luo, Ping .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8481-8489

[10] Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing [J].

Gong, Ke ;

Liang, Xiaodan ;

Zhang, Dongyu ;

Shen, Xiaohui ;

Lin, Liang .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6757-6765

← 1 2 3 4 5 →