A Two-Stage Personalized Virtual Try-On Framework With Shape Control and Texture Guidance

被引：1

作者：

Zhang, Shufang ^{[1
]}

Ni, Minxue ^{[1
]}

Chen, Shuai ^{[2
]}

Wang, Lei ^{[1
]}

Ding, Wenxin ^{[1
]}

Liu, Yuhong ^{[3
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Ocean Univ China, Coll Ocean & Atmospher Sci, Qingdao 260000, Peoples R China

[3] Santa Clara Univ, Dept Comp Sci & Engn, Santa Clara, CA 95053 USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

关键词：

Clothing; Shape; Noise; Semantics; Shape control; Electronic mail; Context modeling; Human generation; image manipulation; virtual try-on;

D O I：

10.1109/TMM.2024.3405718

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Diffusion model has a strong ability to generate wild images. However, the model can just generate inaccurate images with the guidance of text, which makes it very challenging to directly apply the text-guided generative model for virtual try-on scenarios. Taking images as guiding conditions of the diffusion model, this paper proposes a brand new personalized virtual try-on model (PE-VITON), which uses the two stages (shape control and texture guidance) to decouple the clothing attributes. Specifically, the proposed model adaptively matches the clothing to human body parts through the Shape Control Module (SCM) to mitigate the misalignment of the clothing and the human body parts. The semantic information of the input clothing is parsed by the Texture Guided Module (TGM), and the corresponding texture is generated by directional guidance. Therefore, this model can effectively solve the problems of weak reduction of clothing folds, poor generation effect under complex human posture, blurred edges of clothing, and unclear texture styles in traditional try-on methods. Meanwhile, the model can automatically enhance the generated clothing folds and textures according to the human posture, and improve the authenticity of the virtual try-on. In this paper, qualitative and quantitative experiments are carried out on high-resolution paired and unpaired datasets, the results show that the proposed model outperforms the state-of-the-art model.

引用

页码：10225 / 10236

页数：12

共 53 条

[1] Efficient Multi-Attribute Similarity Learning Towards Attribute-based Fashion Search [J].

Ak, Kenan E. ;

Lim, Joo Hwee ;

Tham, Jo Yew ;

Kassim, Ashraf A. .

2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :1671-1679

[2] Blended Diffusion for Text-driven Editing of Natural Images [J].

Avrahami, Omri ;

Lischinski, Dani ;

Fried, Ohad .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :18187-18197

[3] Shape matching and object recognition using shape contexts [J].

Belongie, S ;

Malik, J ;

Puzicha, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522

[4] Person Image Synthesis via Denoising Diffusion Model [J].

Bhunia, Ankan Kumar ;

Khan, Salman ;

Cholakkal, Hisham ;

Anwer, Rao Muhammad ;

Laaksonen, Jorma ;

Shah, Mubarak ;

Khan, Fahad Shahbaz .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :5968-5976

[5]

Blattmann Andreas, 2022, ADV NEUR IN

[6] Cluster Analysis of DSC MRI, Dynamic Contrast-Enhanced MRI, and DWI Parameters Associated with Prognosis in Patients with Glioblastoma after Removal of the Contrast-Enhancing Component: A Preliminary Study [J].

Chung, H. ;

Seo, H. ;

Choi, S. H. ;

Park, C. -k. ;

Kim, T. M. ;

Park, S. -h. ;

Won, J. K. ;

Lee, J. H. ;

Lee, S. -t. ;

Lee, J. Y. ;

Hwang, I. ;

Kang, K. M. ;

Yun, T. J. .

AMERICAN JOURNAL OF NEURORADIOLOGY, 2022, 43 (11) :1559-1566

[7] Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing [J].

Cui, Aiyu ;

McKee, Daniel ;

Lazebnik, Svetlana .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :14618-14627

[8]

Gal R, 2022, Arxiv, DOI [arXiv:2208.01618, 10.48550/arXiv.2208.01618]

[9] Disentangled Cycle Consistency for Highly-realistic Virtual Try-On [J].

Ge, Chongjian ;

Song, Yibing ;

Ge, Yuying ;

Yang, Han ;

Liu, Wei ;

Luo, Ping .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16923-16932

[10] Parser-Free Virtual Try-on via Distilling Appearance Flows [J].

Ge, Yuying ;

Song, Yibing ;

Zhang, Ruimao ;

Ge, Chongjian ;

Liu, Wei ;

Luo, Ping .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8481-8489

← 1 2 3 4 5 6 →