Sketch-guided Deep Portrait Generation

被引:17
作者
Ho, Trang-Thi [1 ]
Virtusio, John Jethro [1 ]
Chen, Yung-Yao [2 ]
Hsu, Chih-Ming [3 ]
Hua, Kai-Lung [4 ,5 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Dept Elect & Comp Engn, Taipei, Taiwan
[3] Natl Taipei Univ Technol, Dept Mech Engn, Taipei, Taiwan
[4] Natl Taiwan Univ Sci & Technol, Dept CSIE, Taipei, Taiwan
[5] Natl Taiwan Univ Sci & Technol, Ctr Cyber Phys Syst Innovat, Taipei, Taiwan
关键词
Image synthesis; generative adversarial networks; semantic keypoints; perceptual loss; convolutional autoencoder;
D O I
10.1145/3396237
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generating a realistic human class image from a sketch is a unique and challenging problem considering that the human body has a complex structure that must be preserved. Additionally, input sketches often lack important details that are crucial in the generation process, hence making the problem more complicated. In this article, we present an effective method for synthesizing realistic images from human sketches. Our framework incorporates human poses corresponding to locations of key semantic components (e.g., arm, eyes, nose), seeing that its a strong prior for generating human class images. Our sketch-image synthesis framework consists of three stages: semantic keypoint extraction, coarse image generation, and image refinement. First, we extract the semantic keypoints using Part Affinity Fields (PAFs) and a convolutional autoencoder. Then, we integrate the sketch with semantic keypoints to generate a coarse image of a human. Finally, in the image refinement stage, the coarse image is enhanced by a Generative Adversarial Network (GAN) that adopts an architecture carefully designed to avoid checkerboard artifacts and to generate photo-realistic results. We evaluate our method on 6,300 sketch-image pairs and show that our proposed method generates realistic images and compares favorably against state-of-the-art image synthesis methods.
引用
收藏
页数:18
相关论文
共 60 条
[1]  
[Anonymous], 2016, ARXIV161205360
[2]  
[Anonymous], 2014, ACM T GRAPHIC
[3]  
[Anonymous], 2016, ARXIV 1611 08050 CS, DOI DOI 10.1109/CVPR.2017.143
[4]  
[Anonymous], 2010, P 18 ACM INT C MULTI, DOI [DOI 10.1145/1873951.1874299, 10.1145/1873951.1874299]
[5]  
[Anonymous], 2016, ARXIV160708022
[6]  
[Anonymous], 2017, ARXIV171108972
[7]  
Berthelot David, 2017, arXiv, DOI DOI 10.48550/ARXIV.1703.10717
[8]  
Cao Y, 2011, PROC CVPR IEEE, P761, DOI 10.1109/CVPR.2011.5995460
[9]   Photographic Image Synthesis with Cascaded Refinement Networks [J].
Chen, Qifeng ;
Koltun, Vladlen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1520-1529
[10]  
Chen TC, 2009, PROC EUR SOLID-STATE, P1