CartoonGAN: Generative Adversarial Networks for Photo Cartoonization

被引：265

作者：

Chen, Yang ^{[1
]}

Lai, Yu-Kun ^{[2
]}

Liu, Yong-Jin ^{[1
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Cardiff Univ, Cardiff, S Glam, Wales

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

D O I：

10.1109/CVPR.2018.00986

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a solution to transforming photos of real-world scenes into cartoon style images, which is valuable and challenging in computer vision and computer graphics. Our solution belongs to learning based methods, which have recently become popular to stylize images in artistic forms such as painting. However, existing methods do not produce satisfactory results for cartoonization, due to the fact that (1) cartoon styles have unique characteristics with high level simplification and abstraction, and (2) cartoon images tend to have clear edges, smooth color shading and relatively simple textures, which exhibit significant challenges for texture-descriptor-based loss functions used in existing methods. In this paper, we propose Cartoon-GAN, a generative adversarial network (GAN) framework for cartoon stylization. Our method takes unpaired photos and cartoon images for training, which is easy to use. Two novel losses suitable for cartoonization are proposed: (1) a semantic content loss, which is formulated as a sparse regularization in the high-level feature maps of the VGG network to cope with substantial style variation between photos and cartoons, and (2) an edge-promoting adversarial loss for preserving clear edges. We further introduce an initialization phase, to improve the convergence of the network to the target manifold. Our method is also much more efficient to train than existing methods. Experimental results show that our method is able to generate high-quality cartoon images from real-world photos (i.e., following specific artists' styles and with clear edges and smooth shading) and outperforms state-of-the-art methods.

引用

页码：9465 / 9474

页数：10

共 38 条

[1]

[Anonymous], 2013, COMPUTATIONAL IMAGIN

[2]

[Anonymous], 2016, P 4 INT C LEARN REPR

[3]

[Anonymous], INT C IM PROC

[4]

[Anonymous], 2014, ADV NEUR IN, DOI DOI 10.1145/3422622

[5] A COMPUTATIONAL APPROACH TO EDGE-DETECTION [J].

CANNY, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (06) :679-698

[6]

Chen Y., 2017, INT C IM PROC

[7]

Collobert R, 2011, BIGLEARN NIPS WORKSH, P1

[8]

Dumoulin V., 2017, INT C LEARN REPR, P1

[9]

Gatys L., 2015, Texture Synthesis Using Convolutional Neural NetworksOpen Source Implementation on GitHub, P262

[10]

Gatys L.A., 2015, A neural algorithm of artistic style, DOI [DOI 10.1167/16.12.326, 10.1167/16.12.326]

← 1 2 3 4 →