Application of Meta-Heuristic Algorithms for Training Neural Networks and Deep Learning Architectures: A Comprehensive Review

被引:0
作者
Mehrdad Kaveh
Mohammad Saadi Mesgari
机构
[1] K. N. Toosi University of Technology,Department of Geodesy and Geomatics
来源
Neural Processing Letters | 2023年 / 55卷
关键词
Deep learning (DL); Artificial neural networks (ANN); Meta-heuristics (MH); Hyper-parameters optimization; Training; And gradient-based back propagation (BP) learning algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
The learning process and hyper-parameter optimization of artificial neural networks (ANNs) and deep learning (DL) architectures is considered one of the most challenging machine learning problems. Several past studies have used gradient-based back propagation methods to train DL architectures. However, gradient-based methods have major drawbacks such as stucking at local minimums in multi-objective cost functions, expensive execution time due to calculating gradient information with thousands of iterations and needing the cost functions to be continuous. Since training the ANNs and DLs is an NP-hard optimization problem, their structure and parameters optimization using the meta-heuristic (MH) algorithms has been considerably raised. MH algorithms can accurately formulate the optimal estimation of DL components (such as hyper-parameter, weights, number of layers, number of neurons, learning rate, etc.). This paper provides a comprehensive review of the optimization of ANNs and DLs using MH algorithms. In this paper, we have reviewed the latest developments in the use of MH algorithms in the DL and ANN methods, presented their disadvantages and advantages, and pointed out some research directions to fill the gaps between MHs and DL methods. Moreover, it has been explained that the evolutionary hybrid architecture still has limited applicability in the literature. Also, this paper classifies the latest MH algorithms in the literature to demonstrate their effectiveness in DL and ANN training for various applications. Most researchers tend to extend novel hybrid algorithms by combining MHs to optimize the hyper-parameters of DLs and ANNs. The development of hybrid MHs helps improving algorithms performance and capable of solving complex optimization problems. In general, the optimal performance of the MHs should be able to achieve a suitable trade-off between exploration and exploitation features. Hence, this paper tries to summarize various MH algorithms in terms of the convergence trend, exploration, exploitation, and the ability to avoid local minima. The integration of MH with DLs is expected to accelerate the training process in the coming few years. However, relevant publications in this way are still rare.
引用
收藏
页码:4519 / 4622
页数:103
相关论文
共 1660 条
  • [61] Wang XF(2009)Group search optimizer: an optimization algorithm inspired by animal searching behavior IEEE Trans Evol Comput 6 132-184
  • [62] Huang DS(2009)GSA: a gravitational search algorithm Inf Sci 15 1116-1183
  • [63] Du JX(2009)The intelligent water drops algorithm: a nature-inspired swarm-based optimization algorithm Int J Bio-inspired Comput 11 5508-112
  • [64] Xu H(2011)Principal components analysis by the galaxy-based search algorithm: a novel metaheuristic for continuous optimisation Int J Comput Sci Eng 17 4831-125
  • [65] Heutte L(2011)Spiral dynamics inspired optimization J Adv Comput Intell Intell Inform 46 229-18
  • [66] Luo H(2011)Cuckoo optimization algorithm Appl Soft Comput 13 2592-11
  • [67] Yang Y(2012)Krill herd: a new bio-inspired optimization algorithm Commun Nonlinear Sci Numer Simul 222 175-145
  • [68] Tong B(2012)Transforming geocentric cartesian coordinates to geodetic coordinates by using differential search algorithm Comput Geosci 53 1168-2720
  • [69] Wu F(2013)Mine blast algorithm: a new population based algorithm for solving constrained engineering optimization problems Appl Soft Comput 139 98-1073
  • [70] Fan B(2013)Black hole: a new heuristic optimization approach for data clustering Inf Sci 55 99-93