An Empirical Analysis of Generative Adversarial Network Training Times with Varying Batch

被引:0
作者
Ghosh, Bhaskar [1 ]
Dutta, Indira Kalyan [1 ]
Carlson, Albert
Totaro, Michael [1 ]
Bayoumi, Magdy [1 ]
机构
[1] Univ Louisiana Lafayette, Lafayette, LA 70504 USA
来源
2020 11TH IEEE ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON) | 2020年
关键词
Generative Adversarial Networks; Training; Hyper-parameter; Neural Networks; Artificial Intelligence;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Increasing the performance of a Generative Adversarial Network (GAN) requires experimentation in choosing the suitable training hyper-parameters of learning rate and batch size. There is no consensus on learning rates or batch sizes in GANs, which makes it a "trial-and-error" process to get acceptable output. Researchers have differing views regarding the effect of batch sizes on run time. This paper investigates the impact of these training parameters of GANs with respect to actual elapsed training time. In our initial experiments, we study the effects of batch sizes, learning rates, loss function, and optimization algorithm on training using the MNIST dataset over 30,000 epochs. The simplicity of the MNIST dataset allows for a starting point in initial studies to understand if the parameter changes have any significant impact on the training times. The goal is to analyze and understand the results of varying loss functions, batch sizes, optimizer algorithms, and learning rates on GANs and address the key issue of batch size and learning rate selection.
引用
收藏
页码:643 / 648
页数:6
相关论文
共 50 条
[21]   An Automatic Control Perspective on Parameterizing Generative Adversarial Network [J].
Mu, Jinzhen ;
Xin, Ming ;
Li, Shuang ;
Jiang, Bin .
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) :1854-1867
[22]   Generative Adversarial Network for Multi Facial Attributes Translation [J].
Liu, Jun ;
Liu, Xiaoyang ;
Feng, Yanjun .
IEEE ACCESS, 2021, 9 (09) :129375-129384
[23]   PSGAN: A Generative Adversarial Network for Remote Sensing Image Pan-Sharpening [J].
Liu, Qingjie ;
Zhou, Huanyu ;
Xu, Qizhi ;
Liu, Xiangyu ;
Wang, Yunhong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (12) :10227-10242
[24]   EGANS: Evolutionary Generative Adversarial Network Search for Zero-Shot Learning [J].
Chen, Shiming ;
Chen, Shuhuang ;
Hou, Wenjin ;
Ding, Weiping ;
You, Xinge .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2024, 28 (03) :582-596
[25]   SIA-GAN: Scrambling Inversion Attack Using Generative Adversarial Network [J].
Madono, Koki ;
Tanaka, Masayuki ;
Onishi, Masaki ;
Ogawa, Tetsuji .
IEEE ACCESS, 2021, 9 :129385-129393
[26]   An Empirical Study of the Effects of Sample-Mixing Methods for Efficient Training of Generative Adversarial Networks [J].
Takamoto, Makoto ;
Morishita, Yusuke .
2021 IEEE 4TH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL, MIPR, 2021, :49-55
[27]   GANE: A Generative Adversarial Network Embedding [J].
Hong, Huiting ;
Li, Xin ;
Wang, Mingzhong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) :2325-2335
[28]   The Generative Adversarial Random Neural Network [J].
Serrano, Will .
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2021, 2021, 627 :567-580
[29]   Compressive Privacy Generative Adversarial Network [J].
Tseng, Bo-Wei ;
Wu, Pei-Yuan .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 :2499-2513
[30]   CapsuleGAN: Generative Adversarial Capsule Network [J].
Jaiswal, Ayush ;
AbdAlmageed, Wael ;
Wu, Yue ;
Natarajan, Premkumar .
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III, 2019, 11131 :526-535