DFS-GAN: stabilizing training of generative adversarial networks through discarding fake samples

被引：1

作者：

Yang, Lianping ^{[1
]}

Sun, Hao ^{[1
]}

Zhang, Jian ^{[1
]}

Mo, Sijia ^{[1
]}

Jiang, Wuming ^{[2
]}

Zhang, Xiangde ^{[1
]}

机构：

[1] Northeastern Univ, Coll Sci, Shenyang, Peoples R China

[2] Beijing EyeCool Technol Co Ltd, Beijing, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2022年 / 31卷 / 06期

关键词：

generative adversarial network; stabilized training; generated samples; IMAGE SYNTHESIS;

D O I：

10.1117/1.JEI.31.6.063016

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Generative adversarial networks (GANs) are generative models based on game theory. Because the relationship between generator and discriminator must be carefully adjusted during the training process, it is difficult to get stable training. Although some solutions are proposed to alleviate this issue, it is still necessary to discuss how to improve the stability of GANs. We propose a GAN we call the discarding fake samples (DFS)-GAN. During the training process, some generated samples are unable to fool the discriminator and provide a relatively invalid gradient for the discriminator. So, in the stabilized discriminator module (SDM), we discard the fake but easily discriminated samples. At the same time, we propose a new loss function, SGAN-gradient penalty 1. We explain the rationale of SDM and our loss function from a Bayesian decision perspective. We inferred the best number of discarded fake samples and verified the selected parameters' effectiveness by experiments. The Frechet inception distance (FID) value of DFS-GAN is 14.57 +/- 0.19 on Canadian Institute for Advanced Research-10 (CIFAR-10), 20.87 +/- 0.33 on CIFAR-100, and 92.42 +/- 0.43 on ImageNet, which is lower than that of the current optimal method. Moreover, SDM module can be used in many GANs to decrease the FID value if their loss functions fit. (c) 2022 SPIE and IS&T

引用

页数：21

共 33 条

[1] [Anonymous], 2016, P BRIT MACHINE VISIO
[2] Arjovsky M., 2017, P INT C LEARN REPR, P1
[3] Arjovsky M, 2017, PR MACH LEARN RES, V70
[4] Berthelot D, 2017, ARXIV
[5] THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE
BURT, PJ
ADELSON, EH
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) : 532 - 540
[6] Denton E, 2015, ADV NEUR IN, V28
[7] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
[8] Hensel M, 2017, ADV NEUR IN, V30
[9] Lim JH, 2017, Arxiv, DOI [arXiv:1705.02894, 10.48550/arXiv.1705.02894]
[10] Gulrajani I, 2017, ADV NEUR IN, V30

← 1 2 3 4 →