FA-VTON: A Feature Alignment-Based Model for Virtual Try-On

被引:0
|
作者
Wan, Yan [1 ]
Ding, Ning [1 ]
Yao, Li [1 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, 2999 North Renmin Rd, Shanghai 201620, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 12期
关键词
deep learning; virtual try-on; image generation; knowledge distillation;
D O I
10.3390/app14125255
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The virtual try-on technology based on 2D images aims to seamlessly transfer provided garments onto target person images. Prior methods mainly concentrated on warping garments and generating images, overlooking the influence of feature alignment on the try-on results. In this study, we initially analyze the distortions present by existing methods and elucidate the critical role of feature alignment in the extraction stage. Building on this, we propose a novel feature alignment-based model (FA-VTON). Specifically, FA-VTON aligns the upsampled higher-level features from both person and garment images to acquire precise boundary information, which serves as guidance for subsequent garment warping. Concurrently, the Efficient Channel Attention mechanism (ECA) is introduced to generate the final result in the try-on generation module. This mechanism enables adaptive adjustment of channel feature weights to extract important features and reduce artifact generation. Furthermore, to make the student network focus on salient regions of each channel, we utilize channel-wise distillation (CWD) to minimize the Kullback-Leibler (KL) divergence between the channel probability maps of the two networks. The experiments show that our model achieves better results in both qualitative and quantitative analyses compared to current methods on the popular virtual try-on datasets.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] VTON-SCFA: A Virtual Try-On Network Based on the Semantic Constraints and Flow Alignment
    Du, Chenghu
    Yu, Feng
    Jiang, Minghua
    Hua, Ailing
    Wei, Xiong
    Peng, Tao
    Hu, Xinrong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 777 - 791
  • [2] WAS-VTON: Warping Architecture Search for Virtual Try-on Network
    Xie, Zhenyu
    Zhang, Xujie
    Zhao, Fuwei
    Dong, Haoye
    Kampffmeyer, Michael C.
    Yan, Haonan
    Liang, Xiaodan
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3350 - 3359
  • [3] LC-VTON: Length Controllable Virtual Try-On Network
    Yao, Jinliang
    Zheng, Haonan
    IEEE ACCESS, 2023, 11 : 88451 - 88461
  • [4] UF-VTON: Toward User-Friendly Virtual Try-On Network
    Chang, Yuan
    Peng, Tao
    He, Ruhan
    Hu, Xinrong
    Liu, Junping
    Zhang, Zili
    Jiang, Minghua
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 313 - 321
  • [5] KF-VTON: Keypoints-Driven Flow Based Virtual Try-On Network
    Wu, Zizhao
    Liu, Siyu
    Lu, Peioyan
    Yang, Ping
    Wong, Yongkang
    Gu, Xiaoling
    Kankanhalli, Mohan s.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (09)
  • [6] MT-VTON: Multilevel Transformation-Based Virtual Try-On for Enhancing Realism of Clothing
    Lee, Jaeyoung
    Lee, Moonhyun
    Kim, Younghoon
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [7] Self-supervised feature matched virtual try-on
    Jiang, Shiyi
    Xu, Yang
    Li, Danyang
    Fan, Runze
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2023, 10 (05) : 1958 - 1969
  • [8] DP-VTON: TOWARD DETAIL-PRESERVING IMAGE-BASED VIRTUAL TRY-ON NETWORK
    Chang, Yuan
    Peng, Tao
    He, Ruhan
    Hu, Xinrong
    Liu, Junping
    Zhang, Zili
    Jiang, Minghua
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2295 - 2299
  • [9] CS-VITON: a realistic virtual try-on network based on clothing region alignment and SPM
    Chen, Jinguang
    Zhang, Xin
    Ma, Lili
    Yang, Bo
    Zhang, Kaibing
    VISUAL COMPUTER, 2025, 41 (01): : 563 - 577
  • [10] Slot-VTON: subject-driven diffusion-based virtual try-on with slot attention
    Ye, Jianglei
    Wang, Yigang
    Xie, Fengmao
    Wang, Qin
    Gu, Xiaoling
    Wu, Zizhao
    VISUAL COMPUTER, 2024, : 3297 - 3308