Understanding GANs in the LQG Setting: Formulation, Generalization and Stability

被引：21

作者：

Feizi, Soheil ^{[1
]}

Farnia, Farzan ^{[2
]}

Ginart, Tony ^{[3
]}

Tse, David ^{[3
]}

机构：

[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA

[2] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[3] Stanford Univ, Stanford, CA 94305 USA

来源：

IEEE JOURNAL ON SELECTED AREAS IN INFORMATION THEORY | 2020年 / 1卷 / 01期

基金：

美国国家科学基金会;

关键词：

Generative models; Wasserstein distance; PCA; stability; Lyapunov functions;

D O I：

10.1109/JSAIT.2020.2991375

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Generative Adversarial Networks (GANs) have become a popular method to learn a probability model from data. In this paper, we provide an understanding of basic issues surrounding GANs including their formulation, generalization and stability on a simple LQG benchmark where the generator is Linear, the discriminator is Quadratic and the data has a high-dimensional Gaussian distribution. Even in this simple benchmark, the GAN problem has not been well-understood as we observe that existing state-of-the-art GAN architectures may fail to learn a proper generative distribution owing to (1) stability issues (i.e., convergence to bad local solutions or not converging at all), (2) approximation issues (i.e., having improper global GAN optimizers caused by inappropriate GAN's loss functions), and (3) generalizability issues (i.e., requiring large number of samples for training). In this setup, we propose a GAN architecture which recovers the maximum-likelihood solution and demonstrates fast generalization. Moreover, we analyze global stability of different computational approaches for the proposed GAN and highlight their pros and cons. Finally, through experiments on MNIST and CIFAR-10 datasets, we outline extensions of our model-based approach to design GANs in more complex setups than the considered Gaussian benchmark.

引用

页码：304 / 311

页数：8

共 32 条

[1]

Arjovsky M, 2017, Arxiv, DOI [arXiv:1701.07875, 10.48550/arXiv.1701.07875]

[2]

Arora S, 2017, Arxiv, DOI [arXiv:1703.00573, DOI 10.48550/ARXIV.1703.00573]

[3]

Berthelot D, 2017, Arxiv, DOI arXiv:1703.10717

[4] Group invariant Peano curves [J].

Cannon, James W. ;

Thurston, William P. .

GEOMETRY & TOPOLOGY, 2007, 11 :1315-1355

[5]

Canas GD, 2012, Arxiv, DOI arXiv:1209.1077

[6]

Daskalakis C, 2018, Arxiv, DOI arXiv:1711.00141

[7]

Farnia F, 2018, Arxiv, DOI arXiv:1810.11740

[8]

Freund Y., 1996, Proceedings of the Ninth Annual Conference on Computational Learning Theory, P325, DOI 10.1145/238061.238163

[9]

Ge R, 2016, PR MACH LEARN RES, V48

[10]

Genevay A, 2019, Arxiv, DOI arXiv:1810.02733

← 1 2 3 4 →