Transforming and Projecting Images into Class-Conditional Generative Networks

被引:42
作者
Huh, Minyoung [1 ,2 ]
Zhang, Richard [2 ]
Zhu, Jun-Yan [2 ]
Paris, Sylvain [2 ]
Hertzmann, Aaron [2 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
[2] Adobe Res, San Francisco, CA USA
来源
COMPUTER VISION - ECCV 2020, PT II | 2020年 / 12347卷
关键词
D O I
10.1007/978-3-030-58536-5_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a method for projecting an input image into the space of a class-conditional generative neural network. We propose a method that optimizes for transformation to counteract the model biases in generative neural networks. Specifically, we demonstrate that one can solve for image translation, scale, and global color transformation, during the projection optimization to address the object-center bias and color bias of a Generative Adversarial Network. This projection process poses a difficult optimization problem, and purely gradient-based optimizations fail to find good solutions. We describe a hybrid optimization strategy that finds good projections by estimating transformations and class parameters. We show the effectiveness of our method on real images and further demonstrate how the corresponding projections lead to better editability of these images. The project page and the code is available at https://minyoungg.github.io/GAN-Transform-and-Project/.
引用
收藏
页码:17 / 34
页数:18
相关论文
共 60 条
[1]   Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? [J].
Abdal, Rameen ;
Qin, Yipeng ;
Wonka, Peter .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4431-4440
[2]  
Ankit R., 2019, INT C COMP VIS
[3]  
[Anonymous], 2017, Feature visualization
[4]  
[Anonymous], 2006, Digital Image Processing
[5]  
Asim M., 2018, BRIT MACH VIS C
[6]   Lucas-Kanade 20 years on: A unifying framework [J].
Baker, S ;
Matthews, I .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 56 (03) :221-255
[7]   PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing [J].
Barnes, Connelly ;
Shechtman, Eli ;
Finkelstein, Adam ;
Goldman, Dan B. .
ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)
[8]  
Bau D, 2019, INT C LEARN REPR
[9]   Seeing What a GAN Cannot Generate [J].
Bau, David ;
Zhu, Jun-Yan ;
Wulff, Jonas ;
Peebles, William ;
Strobelt, Hendrik ;
Zhou, Bolei ;
Torralba, Antonio .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4501-4510
[10]   Semantic Photo Manipulation with a Generative Image Prior [J].
Bau, David ;
Strobelt, Hendrik ;
Peebles, William ;
Wulff, Jonas ;
Zhou, Bolei ;
Zhu, Jun-Yan ;
Torralba, Antonio .
ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (04)