User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks

被引:95
作者
Ci, Yuanzheng [1 ]
Ma, Xinzhu [1 ]
Wang, Zhihui [2 ]
Li, Haojie [2 ]
Luo, Zhongxuan [2 ]
机构
[1] Dalian Univ Technol, DUT RU Int Sch Informat Sci & Engn, Dalian, Peoples R China
[2] Dalian Univ Technol, Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian, Peoples R China
来源
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18) | 2018年
基金
中国国家自然科学基金;
关键词
Interactive Colorization; GANs; Edit Propagation; IMAGE;
D O I
10.1145/3240508.3240661
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Scribble colors based line art colorization is a challenging computer vision problem since neither greyscale values nor semantic information is presented in line arts, and the lack of authentic illustration-line art training pairs also increases difficulty of model generalization. Recently, several Generative Adversarial Nets (GANs) based methods have achieved great success. They can generate colorized illustrations conditioned on given line art and color hints. However, these methods fail to capture the authentic illustration distributions and are hence perceptually unsatisfying in the sense that they often lack accurate shading. To address these challenges, we propose a novel deep conditional adversarial architecture for scribble based anime line art colorization. Specifically, we integrate the conditional framework with WGAN-GP criteria as well as the perceptual loss to enable us to robustly train a deep network that makes the synthesized images more natural and real. We also introduce a local features network that is independent of synthetic data. With GANs conditioned on features from such network, we notably increase the generalization capability over "in the wild" line arts. furthermore, we collect two datasets that provide high-quality colorful illustrations and authentic line arts for training and benchmarking. With the proposed model trained on our illustration dataset, we demonstrate that images synthesized by the presented approach are considerably more realistic and precise than alternative approaches.
引用
收藏
页码:1536 / 1544
页数:9
相关论文
共 60 条
[1]   AppProp: All-pairs appearance-space edit propagation [J].
An, Xiaobo ;
Pellacini, Fabio .
ACM TRANSACTIONS ON GRAPHICS, 2008, 27 (03)
[2]  
[Anonymous], 2017, ARXIV170606918
[3]  
[Anonymous], 2007, Proceedings of the 18th Eurographics conference on Rendering Techniques
[4]  
[Anonymous], 2017, ARXIV170107875
[5]  
[Anonymous], 2017, ARXIV170603319
[6]  
[Anonymous], 2017, CoRR
[7]  
Branwen Gwern, 2018, DANBOORU2017 LARGE S
[8]   Palette-based Photo Recoloring [J].
Chang, Huiwen ;
Fried, Ohad ;
Liu, Yiming ;
DiVerdi, Stephen ;
Finkelstein, Adam .
ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04)
[9]  
Chen TC, 2009, PROC EUR SOLID-STATE, P1
[10]   SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis [J].
Chen, Wengling ;
Hays, James .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9416-9425