MISS GAN: A Multi-IlluStrator style generative adversarial network for image to illustration translation

被引:10
作者
Barzilay, Noa [1 ]
Shalev, Tal Berkovitz [1 ]
Giryes, Raja [1 ]
机构
[1] Tel Aviv Univ, Sch Elect Engn, IL-69978 Tel Aviv, Israel
关键词
Generative adversarial networks; Image to image translation; Illustration; Multi style transfer; TEXT;
D O I
10.1016/j.patrec.2021.08.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised style transfer that supports diverse input styles using only one trained generator is a challenging and interesting task in computer vision. This paper proposes a Multi-IlluStrator Style Generative Adversarial Network (MISS GAN) that is a multi-style framework for unsupervised image-to-illustration translation, which can generate styled yet content preserving images. The illustrations dataset is a challenging one since it is comprised of illustrations of seven different illustrators, hence contains diverse styles. Existing methods require to train several generators (as the number of illustrators) to handle the different illustrators' styles, which limits their practical usage, or require to train an image specific network, which ignores the style information provided in other images of the illustrator. MISS GAN is both input image specific and uses the information of other images using only one trained model. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:140 / 147
页数:8
相关论文
共 33 条
[1]   Semantically consistent text to fashion image synthesis with an enhanced attentional generative adversarial network ? [J].
Ak, Kenan E. ;
Lim, Joo Hwee ;
Tham, Jo Yew ;
Kassim, Ashraf A. .
PATTERN RECOGNITION LETTERS, 2020, 135 :22-29
[2]  
[Anonymous], 2015, A neural algorithm of artistic style, DOI DOI 10.1167/16.12.326
[3]  
Bynagari N.B., 2019, ADV NEURAL INFORM PR, V8, P25, DOI 10.18034/ajase.v8i1.9
[4]   CartoonGAN: Generative Adversarial Networks for Photo Cartoonization [J].
Chen, Yang ;
Lai, Yu-Kun ;
Liu, Yong-Jin .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9465-9474
[5]   StarGAN v2: Diverse Image Synthesis for Multiple Domains [J].
Choi, Yunjey ;
Uh, Youngjung ;
Yoo, Jaejun ;
Ha, Jung-Woo .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8185-8194
[6]   Generative Adversarial Networks [J].
Goodfellow, Ian ;
Pouget-Abadie, Jean ;
Mirza, Mehdi ;
Xu, Bing ;
Warde-Farley, David ;
Ozair, Sherjil ;
Courville, Aaron ;
Bengio, Yoshua .
COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144
[7]   GANILLA: Generative adversarial networks for image to illustration translation [J].
Hicsonmez, Samet ;
Samet, Nermin ;
Akbas, Emre ;
Duygulu, Pinar .
IMAGE AND VISION COMPUTING, 2020, 95
[8]   DRAW: Deep Networks for Recognizing Styles of Artists Who Illustrate Children's Books [J].
Hicsonmez, Samet ;
Samet, Nermin ;
Sener, Fadime ;
Duygulu, Pinar .
PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, :343-351
[9]   Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis [J].
Huang, Rui ;
Zhang, Shu ;
Li, Tianyu ;
He, Ran .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2458-2467
[10]   Multimodal Unsupervised Image-to-Image Translation [J].
Huang, Xun ;
Liu, Ming-Yu ;
Belongie, Serge ;
Kautz, Jan .
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :179-196