Optimizing the neural network hyperparameters utilizing genetic algorithm

被引:56
作者
Nikbakht, Saeid [3 ]
Anitescu, Cosmin [3 ]
Rabczuk, Timon [1 ,2 ]
机构
[1] Ton Duc Thang Univ, Div Computat Mech, Ho Chi Minh City, Vietnam
[2] Ton Duc Thang Univ, Fac Civil Engn, Ho Chi Minh City, Vietnam
[3] Bauhaus Univ Weimar, Inst Struct Mech, D-99423 Weimar, Germany
来源
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A | 2021年 / 22卷 / 06期
关键词
Machine learning; Neural network (NN); Hyperparameters; Genetic algorithm; TP18; OPTIMIZATION;
D O I
10.1631/jzus.A2000384
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Neural networks (NNs), as one of the most robust and efficient machine learning methods, have been commonly used in solving several problems. However, choosing proper hyperparameters (e.g. the numbers of layers and neurons in each layer) has a significant influence on the accuracy of these methods. Therefore, a considerable number of studies have been carried out to optimize the NN hyperparameters. In this study, the genetic algorithm is applied to NN to find the optimal hyperparameters. Thus, the deep energy method, which contains a deep neural network, is applied first on a Timoshenko beam and a plate with a hole. Subsequently, the numbers of hidden layers, integration points, and neurons in each layer are optimized to reach the highest accuracy to predict the stress distribution through these structures. Thus, applying the proper optimization method on NN leads to significant increase in the NN prediction accuracy after conducting the optimization in various examples.
引用
收藏
页码:407 / 426
页数:20
相关论文
共 26 条
[1]  
Alhichri H, 2018, INT CONF ELECTRO INF, P113, DOI 10.1109/EIT.2018.8500107
[2]   Recovery-based error estimation and adaptivity using high-order splines over hierarchical T-meshes [J].
Anitescu, Cosmin ;
Hossain, Md Naim ;
Rabczuk, Timon .
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2018, 328 :638-662
[3]  
[Anonymous], 2018, P 7 ANN WORLD C SOC
[4]   The use of Timoshenko's exact solution for a cantilever beam in adaptive analysis [J].
Augarde, Charles E. ;
Deeks, Andrew J. .
FINITE ELEMENTS IN ANALYSIS AND DESIGN, 2008, 44 (9-10) :595-601
[5]   Optimizing Convolutional Neural Network Hyperparameters by Enhanced Swarm Intelligence Metaheuristics [J].
Bacanin, Nebojsa ;
Bezdan, Timea ;
Tuba, Eva ;
Strumberger, Ivana ;
Tuba, Milan .
ALGORITHMS, 2020, 13 (03)
[6]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[7]  
Dalto M, 2015, 2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1657, DOI 10.1109/ICIT.2015.7125335
[8]   Particle swarm optimization of deep neural networks architectures for image classification [J].
Fernandes Junior, Francisco Erivaldo ;
Yen, Gary G. .
SWARM AND EVOLUTIONARY COMPUTATION, 2019, 49 :62-74
[9]   Transfer learning enhanced physics informed neural network for phase-field modeling of fracture [J].
Goswami, Somdatta ;
Anitescu, Cosmin ;
Chakraborty, Souvik ;
Rabczuk, Timon .
THEORETICAL AND APPLIED FRACTURE MECHANICS, 2020, 106
[10]   The Tabu_Genetic Algorithm: A Novel Method for Hyper-Parameter Optimization of Learning Algorithms [J].
Guo, Baosu ;
Hu, Jingwen ;
Wu, Wenwen ;
Peng, Qingjin ;
Wu, Fenghe .
ELECTRONICS, 2019, 8 (05)