Template-Free Try-On Image Synthesis via Semantic-Guided Optimization

被引:14
作者
Chou, Chien-Lung [1 ]
Chen, Chieh-Yun [2 ]
Hsieh, Chia-Wei [3 ]
Shuai, Hong-Han [4 ]
Liu, Jiaying [5 ]
Cheng, Wen-Huang [2 ,6 ]
机构
[1] Univ Michigan, Dept Elect & Comp Engn, Ann Arbor, MI 48109 USA
[2] Natl Chiao Tung Univ, Inst Elect, Hsinchu 30010, Taiwan
[3] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
[4] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 30010, Taiwan
[5] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100871, Peoples R China
[6] Natl Chung Hsing Univ, Artificial Intelligence & Data Sci Program, Taichung 402, Taiwan
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Clothing; Semantics; Image segmentation; Feature extraction; Faces; Task analysis; Image synthesis; Cross-modal learning; image synthesis; pose transfer; semantic-guided learning; virtual try-on; RECOGNITION;
D O I
10.1109/TNNLS.2021.3058379
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The virtual try-on task is so attractive that it has drawn considerable attention in the field of computer vision. However, presenting the 3-D physical characteristic (e.g., pleat and shadow) based on a 2-D image is very challenging. Although there have been several previous studies on 2-D-based virtual try-on work, most: 1) required user-specified target poses that are not user-friendly and may not be the best for the target clothing and 2) failed to address some problematic cases, including facial details, clothing wrinkles, and body occlusions. To address these two challenges, in this article, we propose an innovative template-free try-on image synthesis (TF-TIS) network. The TF-TIS first synthesizes the target pose according to the user-specified in-shop clothing. Afterward, given an in-shop clothing image, a user image, and a synthesized pose, we propose a novel model for synthesizing a human try-on image with the target clothing in the best fitting pose. The qualitative and quantitative experiments both indicate that the proposed TF-TIS outperforms the state-of-the-art methods, especially for difficult cases.
引用
收藏
页码:4584 / 4597
页数:14
相关论文
共 58 条
  • [1] 2D Human Pose Estimation: New Benchmark and State of the Art Analysis
    Andriluka, Mykhaylo
    Pishchulin, Leonid
    Gehler, Peter
    Schiele, Bernt
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3686 - 3693
  • [2] [Anonymous], 2012, P ACM INT C MULT
  • [3] Shape matching and object recognition using shape contexts
    Belongie, S
    Malik, J
    Puzicha, J
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) : 509 - 522
  • [4] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
    Cao, Zhe
    Hidalgo, Gines
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
  • [5] Learning Aligned Cross-Modal Representations from Weakly Aligned Data
    Castrejon, Lluis
    Aytar, Yusuf
    Vondrick, Carl
    Pirsiavash, Hamed
    Torralba, Antonio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2940 - 2949
  • [6] Improved Bootstrapping for Approximate Homomorphic Encryption
    Chen, Hao
    Chillotti, Ilaria
    Song, Yongsoo
    [J]. ADVANCES IN CRYPTOLOGY - EUROCRYPT 2019, PT II, 2019, 11477 : 34 - 54
  • [7] Instance-Level Human Parsing via Part Grouping Network
    Gong, Ke
    Liang, Xiaodan
    Li, Yicheng
    Chen, Yimin
    Yang, Ming
    Lin, Liang
    [J]. COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 805 - 822
  • [8] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
  • [9] GarNet: A Two-Stream Network for Fast and Accurate 3D Cloth Draping
    Gundogdu, Erhan
    Constantin, Victor
    Seifoddini, Amrollah
    Dang, Minh
    Salzmann, Mathieu
    Fua, Pascal
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8738 - 8747
  • [10] Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification
    Guo, Jianyuan
    Yuan, Yuhui
    Huang, Lang
    Zhang, Chao
    Yao, Jin-Ge
    Han, Kai
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3641 - 3650