A progressive distillation network for practical image-based virtual try-on

被引:3
|
作者
Luo, Weihao [1 ]
Zeng, Zezhen [2 ]
Zhong, Yueqi [1 ]
机构
[1] Donghua Univ, Coll Text, Key Lab Text Sci & Technol, Minist Educ, Shanghai 201620, Peoples R China
[2] Zhejiang Lab, Res Ctr Appl Math & Machine Intelligence, Hangzhou 311121, Peoples R China
基金
上海市自然科学基金;
关键词
Virtual try -on; Knowledge distillation; Progressive distillation; Cross attention; Transformer;
D O I
10.1016/j.eswa.2024.123213
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The image-based virtual try-on technology aims to match in-store clothing to person's image wearing clothes. Previous methods require a large amount of input information for each try-on result, which is not practical, and slight deviations in input information can cause a large number of artifacts in the try-on results. In recent years, researchers have paid attention to parsing-free virtual try-on methods, and a groundbreaking work used knowledge distillation techniques to eliminate the redundant input. This method uses the parsing-based virtual try-on model as the supervision information to train a student network model, which can generate try-on results without extra input. However, due to the huge knowledge gap between the teacher-student networks, direct distillation makes it difficult for the student network to fully simulate the teacher network. To solve this problem, we propose a progressive distillation scheme for image-based virtual try-on called PD-VTON, using an assistant network to alleviate this huge knowledge gap. Our method can generate more realistic and reasonable try-on results without requiring extra body parsing information or body pose information. Specifically, unlike existing distillation-based parsing-free virtual try-on methods, we adopt an assistant-supported progressive distillation network to alleviate the insufficient learning caused by the large knowledge gap and design an Adaptive Choose Teacher (ACT) module to optimize the distillation. Moreover, we introduce a novel cross attention-based stitching structure when generating try-on images, aiming to better constrain the alignment between the highlevel semantic features of person image and warped clothing image, and use a transformer-assisted generator to generate the results. Finally, extensive experimental evaluations demonstrate the unique advantages of our method.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] VITON: An Image-based Virtual Try-on Network
    Han, Xintong
    Wu, Zuxuan
    Wu, Zhe
    Yu, Ruichi
    Davis, Larry S.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7543 - 7552
  • [2] IMAGE-BASED VIRTUAL TRY-ON NETWORK WITH STRUCTURAL COHERENCE
    Sun, Feng
    Guo, Jiaming
    Su, Zhuo
    Gao, Chengying
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 519 - 523
  • [3] Image-Based Virtual Try-On: A Survey
    Song, Dan
    Zhang, Xuanpu
    Zhou, Juan
    Nie, Weizhi
    Tong, Ruofeng
    Kankanhalli, Mohan
    Liu, An-An
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, : 2692 - 2720
  • [4] TOAC: Try-On Aligning Conformer for Image-Based Virtual Try-On Alignment
    Wang, Yifei
    Xiang, Wang
    Zhang, Shengjie
    Xue, Dizhan
    Qian, Shengsheng
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 29 - 40
  • [5] Image-based virtual try-on: Fidelity and simplification
    Islam, Tasin
    Miron, Alina
    Liu, Xiaohui
    Li, Yongmin
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 129
  • [6] Virtual Try-On through Image-Based Rendering
    Hauswiesner, Stefan
    Straka, Matthias
    Reitmayr, Gerhard
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (09) : 1552 - 1565
  • [7] Toward Characteristic-Preserving Image-Based Virtual Try-On Network
    Wang, Bochao
    Zheng, Huabin
    Liang, Xiaodan
    Chen, Yimin
    Lin, Liang
    Yang, Meng
    COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 : 607 - 623
  • [8] VTNCT: an image-based virtual try-on network by combining feature with pixel transformation
    Chang, Yuan
    Peng, Tao
    Yu, Feng
    He, Ruhan
    Hu, Xinrong
    Liu, Junping
    Zhang, Zili
    Jiang, Minghua
    VISUAL COMPUTER, 2023, 39 (07): : 2583 - 2596
  • [9] VTNCT: an image-based virtual try-on network by combining feature with pixel transformation
    Yuan Chang
    Tao Peng
    Feng Yu
    Ruhan He
    Xinrong Hu
    Junping Liu
    Zili Zhang
    Minghua Jiang
    The Visual Computer, 2023, 39 : 2583 - 2596
  • [10] VTNFP: An Image-based Virtual Try-on Network with Body and Clothing Feature Preservation
    Yu, Ruiyun
    Wang, Xiaoqi
    Xie, Xiaohui
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10510 - 10519