Template-Free Try-On Image Synthesis via Semantic-Guided Optimization

被引：14

作者：

Chou, Chien-Lung ^{[1
]}

Chen, Chieh-Yun ^{[2
]}

Hsieh, Chia-Wei ^{[3
]}

Shuai, Hong-Han ^{[4
]}

Liu, Jiaying ^{[5
]}

Cheng, Wen-Huang ^{[2
,6
]}

机构：

[1] Univ Michigan, Dept Elect & Comp Engn, Ann Arbor, MI 48109 USA

[2] Natl Chiao Tung Univ, Inst Elect, Hsinchu 30010, Taiwan

[3] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA

[4] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 30010, Taiwan

[5] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100871, Peoples R China

[6] Natl Chung Hsing Univ, Artificial Intelligence & Data Sci Program, Taichung 402, Taiwan

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2022年 / 33卷 / 09期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

Clothing; Semantics; Image segmentation; Feature extraction; Faces; Task analysis; Image synthesis; Cross-modal learning; image synthesis; pose transfer; semantic-guided learning; virtual try-on; RECOGNITION;

D O I：

10.1109/TNNLS.2021.3058379

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The virtual try-on task is so attractive that it has drawn considerable attention in the field of computer vision. However, presenting the 3-D physical characteristic (e.g., pleat and shadow) based on a 2-D image is very challenging. Although there have been several previous studies on 2-D-based virtual try-on work, most: 1) required user-specified target poses that are not user-friendly and may not be the best for the target clothing and 2) failed to address some problematic cases, including facial details, clothing wrinkles, and body occlusions. To address these two challenges, in this article, we propose an innovative template-free try-on image synthesis (TF-TIS) network. The TF-TIS first synthesizes the target pose according to the user-specified in-shop clothing. Afterward, given an in-shop clothing image, a user image, and a synthesized pose, we propose a novel model for synthesizing a human try-on image with the target clothing in the best fitting pose. The qualitative and quantitative experiments both indicate that the proposed TF-TIS outperforms the state-of-the-art methods, especially for difficult cases.

引用

页码：4584 / 4597

页数：14

共 58 条

[1] 2D Human Pose Estimation: New Benchmark and State of the Art Analysis
Andriluka, Mykhaylo
Pishchulin, Leonid
Gehler, Peter
Schiele, Bernt
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3686 - 3693
[2] [Anonymous], 2012, P ACM INT C MULT
[3] Shape matching and object recognition using shape contexts
Belongie, S
Malik, J
Puzicha, J
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) : 509 - 522
[4] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
Cao, Zhe
Hidalgo, Gines
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
[5] Learning Aligned Cross-Modal Representations from Weakly Aligned Data
Castrejon, Lluis
Aytar, Yusuf
Vondrick, Carl
Pirsiavash, Hamed
Torralba, Antonio
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2940 - 2949
[6] Improved Bootstrapping for Approximate Homomorphic Encryption
Chen, Hao
Chillotti, Ilaria
Song, Yongsoo
[J]. ADVANCES IN CRYPTOLOGY - EUROCRYPT 2019, PT II, 2019, 11477 : 34 - 54
[7] Instance-Level Human Parsing via Part Grouping Network
Gong, Ke
Liang, Xiaodan
Li, Yicheng
Chen, Yimin
Yang, Ming
Lin, Liang
[J]. COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 805 - 822
[8] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
[9] GarNet: A Two-Stream Network for Fast and Accurate 3D Cloth Draping
Gundogdu, Erhan
Constantin, Victor
Seifoddini, Amrollah
Dang, Minh
Salzmann, Mathieu
Fua, Pascal
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8738 - 8747
[10] Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification
Guo, Jianyuan
Yuan, Yuhui
Huang, Lang
Zhang, Chao
Yao, Jin-Ge
Han, Kai
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3641 - 3650

← 1 2 3 4 5 6 →