Anime-to-real clothing: Cosplay costume generation via image-to-image translation

被引：2

作者：

Tango, Koya ^{[1
]}

Katsurai, Marie ^{[1
]}

Maki, Hayato ^{[2
]}

Goto, Ryosuke ^{[2
]}

机构：

[1] Doshisha Univ, 1-3 Tatara Miyakodani, Kyotanabe, Kyoto 6100394, Japan

[2] ZOZO Technol, Shibuya Ku, Aoyama Oval Bldg 3F,5-52-2 Jingumae, Tokyo 1500001, Japan

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2022年 / 81卷 / 20期

关键词：

Clothing images; Image-to-image translation; Generative adversarial networks; Dataset construction; Image synthesis;

D O I：

10.1007/s11042-022-12576-x

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cosplay has grown from its origins at fan conventions into a billion-dollar global dress phenomenon. To facilitate the imagination and reinterpretation of animated images as real garments, this paper presents an automatic costume-image generation method based on image-to-image translation. Cosplay items can be significantly diverse in their styles and shapes, and conventional methods cannot be directly applied to the wide variety of clothing images that are the focus of this study. To solve this problem, our method starts by collecting and preprocessing web images to prepare a cleaned, paired dataset of the anime and real domains. Then, we present a novel architecture for generative adversarial networks (GANs) to facilitate high-quality cosplay image generation. Our GAN consists of several effective techniques to bridge the two domains and improve both the global and local consistency of generated images. Experiments demonstrated that, with quantitative evaluation metrics, the proposed GAN performs better and produces more realistic images than conventional methods. Our codes and pretrained model are available on the web.

引用

页码：29505 / 29523

页数：19

共 45 条

[1] Azathoth, 2018, MYANIMELIST DATASET
[2] Cascaded Pyramid Network for Multi-Person Pose Estimation
Chen, Yilun
Wang, Zhicheng
Peng, Yuxiang
Zhang, Zhiqiang
Yu, Gang
Sun, Jian
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
[3] Cheng W.H., 2020, Fashion meets computer vision: A survey
[4] User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks
Ci, Yuanzheng
Ma, Xinzhu
Wang, Zhihui
Li, Haojie
Luo, Zhongxuan
[J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1536 - 1544
[5] CooperUnion, 2016, AN REC DAT REC DAT
[6] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
[7] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[8] Hamada K., 2018, P EUR C COMP VIS ECC, P67
[9] VITON: An Image-based Virtual Try-on Network
Han, Xintong
Wu, Zuxuan
Wu, Zhe
Yu, Ruichi
Davis, Larry S.
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7543 - 7552
[10] Hensel M, 2017, ADV NEUR IN, V30

← 1 2 3 4 5 →