Seeing What a GAN Cannot Generate

被引:233
作者
Bau, David [1 ,2 ]
Zhu, Jun-Yan [1 ]
Wulff, Jonas [1 ]
Peebles, William [1 ]
Strobelt, Hendrik [2 ]
Zhou, Bolei [3 ]
Torralba, Antonio [1 ,2 ]
机构
[1] MIT, CSAIL, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT IBM Watson Lab, Cambridge, MA 02142 USA
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.00460
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the success of Generative Adversarial Networks (GANs), mode collapse remains a serious issue during GAN training. To date, little work has focused on understanding and quantifying which modes have been dropped by a model. In this work, we visualize mode collapse at both the distribution level and the instance level. First, we deploy a semantic segmentation network to compare the distribution of segmented objects in the generated images with the target distribution in the training set. Differences in statistics reveal object classes that are omitted by a GAN. Second, given the identified omitted object classes, we visualize the GAN's omissions directly. In particular, we compare specific differences between individual photos and their approximate inversions by a GAN. To this end, we relax the problem of inversion and solve the tractable problem of inverting a GAN layer instead of the entire generator. Finally, we use this framework to analyze several recent GANs trained on multiple datasets and identify their typical failure cases.
引用
收藏
页码:4501 / 4510
页数:10
相关论文
共 52 条
[41]  
Salimans Tim, 2016, NIPS
[42]  
Simonyan K., 2014, P 3 INT C LEARN REPR
[43]  
Smilkov D., 2017, INT C LEARN REPR WOR
[44]  
Springenberg J.T., 2014, STRIVING SIMPLICITY
[45]  
Theis L., 2016, 4 INT C LEARN REPR I
[46]  
Wang Ting-Chun, 2018, NIPS
[47]  
Wang X., 2017, CVPR
[48]  
Wu Y., 2017, ICLR
[49]  
Xiao T., 2018, ECCV
[50]  
Zeiler M.D., 2014, EUROPEAN C COMPUTER