Using Ant Colony Optimization to Optimize Long Short-Term Memory Recurrent Neural Networks

被引:21
作者
ElSaid, AbdElRahman [1 ]
El Jamiy, Fatima [1 ]
Higgins, James [2 ]
Wild, Brandon [2 ]
Desell, Travis [1 ]
机构
[1] Univ North Dakota, Dept Comp Sci, Grand Forks, ND 58202 USA
[2] Univ North Dakota, Dept Aviat, Grand Forks, ND 58202 USA
来源
GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE | 2018年
关键词
Neuroevolution; Recurrent Neural Networks; Long Short-Term Memory; Ant Colony Optimization; Time Series Data Prediction;
D O I
10.1145/3205455.3205637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work examines the use of ant colony optimization (ACO) to improve long short-term memory (LSTM) recurrent neural networks (RNNs) by re.ning their cellular structure. The evolved networks were trained on a large database of.ight data records obtained from an airline containing.ights that su.ered from excessive vibration. Results were obtained using MPI (Message Passing Interface) on a high performance computing (HPC) cluster, which evolved 1000 di.erent LSTM cell structures using 208 cores over 5 days. The new evolved LSTM cells showed an improvement in prediction accuracy of 1.37%, reducing the mean prediction error from 6.38% to 5.01% when predicting excessive engine vibrations 10 seconds in the future, while at the same time dramatically reducing the number of trainable weights from 21,170 to 11,650. The ACO optimized LSTM also performed signi.cantly better than traditional Nonlinear Output Error (NOE), Nonlinear AutoRegression with eXogenous (NARX) inputs, and Nonlinear Box-Jenkins (NBJ) models, which only reached error rates of 11.45%, 8.47% and 9.77%, respectively. The ACO algorithm employed could be utilized to optimize LSTM RNNs for any time series data prediction task.
引用
收藏
页码:13 / 20
页数:8
相关论文
共 27 条
[1]  
Annunziato M, 2002, P EUNITE02 ALB PORT
[2]  
[Anonymous], 2021, Nonlinear System Identification, from Classical Approaches to Neural Networks, Fuzzy Models, and Gaussian Processes
[3]  
[Anonymous], 2014, arXiv
[4]  
[Anonymous], 2005, P 4 INT JOINT C AUT, DOI DOI 10.1145/1082473.1082794
[5]  
[Anonymous], 2017, ARXIV170300548
[6]  
[Anonymous], 1997, Neural Computation
[7]  
[Anonymous], 2015, ABS151105942 CORR
[8]  
[Anonymous], 2000, PRINCIPLES NEURAL SC, DOI DOI 10.1007/SPRINGERREFERENCE_183113
[9]  
Byoung-Tzk Zhang, 1993, Complex Systems, V7, P199
[10]   MPI for Python']Python:: Performance improvements and MPI-2 extensions [J].
Dalcin, Lisandro ;
Paz, Rodrigo ;
Storti, Mario ;
D'Elia, Jorge .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (05) :655-662