CartoonGAN: Generative Adversarial Networks for Photo Cartoonization

被引:265
作者
Chen, Yang [1 ]
Lai, Yu-Kun [2 ]
Liu, Yong-Jin [1 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Cardiff Univ, Cardiff, S Glam, Wales
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
D O I
10.1109/CVPR.2018.00986
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a solution to transforming photos of real-world scenes into cartoon style images, which is valuable and challenging in computer vision and computer graphics. Our solution belongs to learning based methods, which have recently become popular to stylize images in artistic forms such as painting. However, existing methods do not produce satisfactory results for cartoonization, due to the fact that (1) cartoon styles have unique characteristics with high level simplification and abstraction, and (2) cartoon images tend to have clear edges, smooth color shading and relatively simple textures, which exhibit significant challenges for texture-descriptor-based loss functions used in existing methods. In this paper, we propose Cartoon-GAN, a generative adversarial network (GAN) framework for cartoon stylization. Our method takes unpaired photos and cartoon images for training, which is easy to use. Two novel losses suitable for cartoonization are proposed: (1) a semantic content loss, which is formulated as a sparse regularization in the high-level feature maps of the VGG network to cope with substantial style variation between photos and cartoons, and (2) an edge-promoting adversarial loss for preserving clear edges. We further introduce an initialization phase, to improve the convergence of the network to the target manifold. Our method is also much more efficient to train than existing methods. Experimental results show that our method is able to generate high-quality cartoon images from real-world photos (i.e., following specific artists' styles and with clear edges and smooth shading) and outperforms state-of-the-art methods.
引用
收藏
页码:9465 / 9474
页数:10
相关论文
共 38 条
[1]  
[Anonymous], 2013, COMPUTATIONAL IMAGIN
[2]  
[Anonymous], 2016, P 4 INT C LEARN REPR
[3]  
[Anonymous], INT C IM PROC
[4]  
[Anonymous], 2014, ADV NEUR IN, DOI DOI 10.1145/3422622
[6]  
Chen Y., 2017, INT C IM PROC
[7]  
Collobert R, 2011, BIGLEARN NIPS WORKSH, P1
[8]  
Dumoulin V., 2017, INT C LEARN REPR, P1
[9]  
Gatys L., 2015, Texture Synthesis Using Convolutional Neural NetworksOpen Source Implementation on GitHub, P262
[10]  
Gatys L.A., 2015, A neural algorithm of artistic style, DOI [DOI 10.1167/16.12.326, 10.1167/16.12.326]