An Empirical Analysis of Generative Adversarial Network Training Times with Varying Batch

被引：0

作者：

Ghosh, Bhaskar ^{[1
]}

Dutta, Indira Kalyan ^{[1
]}

Carlson, Albert

Totaro, Michael ^{[1
]}

Bayoumi, Magdy ^{[1
]}

机构：

[1] Univ Louisiana Lafayette, Lafayette, LA 70504 USA

来源：

2020 11TH IEEE ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON) | 2020年

关键词：

Generative Adversarial Networks; Training; Hyper-parameter; Neural Networks; Artificial Intelligence;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Increasing the performance of a Generative Adversarial Network (GAN) requires experimentation in choosing the suitable training hyper-parameters of learning rate and batch size. There is no consensus on learning rates or batch sizes in GANs, which makes it a "trial-and-error" process to get acceptable output. Researchers have differing views regarding the effect of batch sizes on run time. This paper investigates the impact of these training parameters of GANs with respect to actual elapsed training time. In our initial experiments, we study the effects of batch sizes, learning rates, loss function, and optimization algorithm on training using the MNIST dataset over 30,000 epochs. The simplicity of the MNIST dataset allows for a starting point in initial studies to understand if the parameter changes have any significant impact on the training times. The goal is to analyze and understand the results of varying loss functions, batch sizes, optimizer algorithms, and learning rates on GANs and address the key issue of batch size and learning rate selection.

引用

页码：643 / 648

页数：6