Limb-Aware Virtual Try-On Network With Progressive Clothing Warping

被引：3

作者：

Zhang, Shengping ^{[1
]}

Han, Xiaoyu ^{[1
]}

Zhang, Weigang ^{[1
]}

Lan, Xiangyuan ^{[2
]}

Yao, Hongxun ^{[3
]}

Huang, Qingming ^{[4
]}

机构：

[1] Harbin Inst Technol, Weihai 264209, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China

[3] Harbin Inst Technol, Harbin 150001, Peoples R China

[4] Univ Chinese Acad Sci, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Virtual try-on; image synthesis; appearance flow; STYLE; DRESS;

D O I：

10.1109/TMM.2023.3286278

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Image-based virtual try-on aims to transfer an in-shop clothing image to a person image. Most existing methods adopt a single global deformation to perform clothing warping directly, which lacks fine-grained modeling of in-shop clothing and leads to distorted clothing appearance. In addition, existing methods usually fail to generate limb details well because they are limited by the used clothing-agnostic person representation without referring to the limb textures of the person image. To address these problems, we propose Limb-aware Virtual Try-on Network named PL-VTON, which performs fine-grained clothing warping progressively and generates high-quality try-on results with realistic limb details. Specifically, we present Progressive Clothing Warping (PCW) that explicitly models the location and size of in-shop clothing and utilizes a two-stage alignment strategy to progressively align the in-shop clothing with the human body. Moreover, a novel gravity-aware loss that considers the fit of the person wearing clothing is adopted to better handle the clothing edges. Then, we design Person Parsing Estimator (PPE) with a non-limb target parsing map to semantically divide the person into various regions, which provides structural constraints on the human body and therefore alleviates texture bleeding between clothing and body regions. Finally, we introduce Limb-aware Texture Fusion (LTF) that focuses on generating realistic details in limb regions, where a coarse try-on result is first generated by fusing the warped clothing image with the person image, then limb textures are further fused with the coarse result under limb-aware guidance to refine limb details. Extensive experiments demonstrate that our PL-VTON outperforms the state-of-the-art methods both qualitatively and quantitatively.

引用

页码：1731 / 1746

页数：16

共 73 条

[1] Modeling Fashion Influence From Photos [J].

Al-Halah, Ziad ;

Grauman, Kristen .

IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :4143-4157

[2] Single Stage Virtual Try-On Via Deformable Attention Flows [J].

Bai, Shuai ;

Zhou, Huiling ;

Li, Zhikang ;

Zhou, Chang ;

Yang, Hongxia .

COMPUTER VISION - ECCV 2022, PT XV, 2022, 13675 :409-425

[3]

Chen C.-Y., 2021, P IEEE CVF INT C COM, P13809

[4] Describing Clothing by Semantic Attributes [J].

Chen, Huizhong ;

Gallagher, Andrew ;

Girod, Bernd .

COMPUTER VISION - ECCV 2012, PT III, 2012, 7574 :609-623

[5] Synthesizing Training Images for Boosting Human 3D Pose Estimation [J].

Chen, Wenzheng ;

Wang, Huan ;

Li, Yangyan ;

Su, Hao ;

Wang, Zhenhua ;

Tu, Changhe ;

Lischinski, Dani ;

Cohen-Or, Daniel ;

Chen, Baoquan .

PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :479-488

[6] Query-Free Clothing Retrieval via Implicit Relevance Feedback [J].

Chen, Zhuoxiang ;

Xu, Zhe ;

Zhang, Ya ;

Gu, Xiao .

IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (08) :2126-2137

[7] VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization [J].

Choi, Seunghwan ;

Park, Sunghyun ;

Lee, Minsoo ;

Choo, Jaegul .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14126-14135

[8] ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors [J].

Chopra, Ayush ;

Jain, Rishabh ;

Hemani, Mayur ;

Krishnamurthy, Balaji .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :5413-5422

[9] Leveraging Weakly Annotated Data for Fashion Image Retrieval and Label Prediction [J].

Corbiere, Charles ;

Ben-Younes, Hedi ;

Rame, Alexandre ;

Ollion, Charles .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2268-2274

[10] Localized Triplet Loss for Fine-grained Fashion Image Retrieval [J].

D'Innocente, Antonio ;

Garg, Nikhil ;

Zhang, Yuan ;

Bazzani, Loris ;

Donoser, Michael .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :3905-3910

← 1 2 3 4 5 6 7 8 →