Anime-to-real clothing: Cosplay costume generation via image-to-image translation

被引:2
作者
Tango, Koya [1 ]
Katsurai, Marie [1 ]
Maki, Hayato [2 ]
Goto, Ryosuke [2 ]
机构
[1] Doshisha Univ, 1-3 Tatara Miyakodani, Kyotanabe, Kyoto 6100394, Japan
[2] ZOZO Technol, Shibuya Ku, Aoyama Oval Bldg 3F,5-52-2 Jingumae, Tokyo 1500001, Japan
关键词
Clothing images; Image-to-image translation; Generative adversarial networks; Dataset construction; Image synthesis;
D O I
10.1007/s11042-022-12576-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cosplay has grown from its origins at fan conventions into a billion-dollar global dress phenomenon. To facilitate the imagination and reinterpretation of animated images as real garments, this paper presents an automatic costume-image generation method based on image-to-image translation. Cosplay items can be significantly diverse in their styles and shapes, and conventional methods cannot be directly applied to the wide variety of clothing images that are the focus of this study. To solve this problem, our method starts by collecting and preprocessing web images to prepare a cleaned, paired dataset of the anime and real domains. Then, we present a novel architecture for generative adversarial networks (GANs) to facilitate high-quality cosplay image generation. Our GAN consists of several effective techniques to bridge the two domains and improve both the global and local consistency of generated images. Experiments demonstrated that, with quantitative evaluation metrics, the proposed GAN performs better and produces more realistic images than conventional methods. Our codes and pretrained model are available on the web.
引用
收藏
页码:29505 / 29523
页数:19
相关论文
共 45 条
  • [1] Azathoth, 2018, MYANIMELIST DATASET
  • [2] Cascaded Pyramid Network for Multi-Person Pose Estimation
    Chen, Yilun
    Wang, Zhicheng
    Peng, Yuxiang
    Zhang, Zhiqiang
    Yu, Gang
    Sun, Jian
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
  • [3] Cheng W.H., 2020, Fashion meets computer vision: A survey
  • [4] User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks
    Ci, Yuanzheng
    Ma, Xinzhu
    Wang, Zhihui
    Li, Haojie
    Luo, Zhongxuan
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1536 - 1544
  • [5] CooperUnion, 2016, AN REC DAT REC DAT
  • [6] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [7] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [8] Hamada K., 2018, P EUR C COMP VIS ECC, P67
  • [9] VITON: An Image-based Virtual Try-on Network
    Han, Xintong
    Wu, Zuxuan
    Wu, Zhe
    Yu, Ruichi
    Davis, Larry S.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7543 - 7552
  • [10] Hensel M, 2017, ADV NEUR IN, V30