Time-series prediction using a regularized self-organizing long short-term memory neural network

被引:8
作者
Duan, Hao-shan
Meng, Xi
Tang, Jian
Qiao, Jun-fei [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Time -series prediction; Long short-term memory; Regularization; Self -organizing algorithm; Structural design; LEARNING ALGORITHM; IDENTIFICATION; MODEL;
D O I
10.1016/j.asoc.2023.110553
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Industrial process data are naturally in the form of complex time-series with high nonlinearities and dynamics. Long short-term memory (LSTM) networks are suitable for developing prediction models to handle nonlinear and dynamic process. However, LSTM neural networks have typically large and predefined structures, which may result in overfitting, and an optimal hidden neurons for a given problem cannot be automatically obtained. For this reason, a regularized self-organizing LSTM (RSOLSTM) is proposed to optimize both the structure and the parameters of the network. First, an adaptive learning algorithm based on l2-norm regularization is introduced for parameter adjustment. Thereafter, both the prediction accuracy and weight dispersion are considered to avoid overfitting. Second, a growing strategy is designed based on hidden neuronal sensitivity. The structure of the LSTM can then be determined automatically with improved compactness. Finally, a convergence analysis is performed to ensure the feasibility of RSO-LSTM. To demonstrate the merits of the proposed RSO-LSTM for timeseries prediction, its results for three benchmark experiments and real industrial data of a municipal solid waste incineration process were examined and compared with those of other methods. The results indicated the superiority and potential of RSO-LSTM for industrial applications. & COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 41 条
[1]   Exploiting dynamic spatio-temporal graph convolutional neural networks for citywide traffic flows prediction [J].
Ali, Ahmad ;
Zhu, Yanmin ;
Zakarya, Muhammad .
NEURAL NETWORKS, 2022, 145 :233-247
[2]   LSTM-MSNet: Leveraging Forecasts on Sets of Related Time Series With Multiple Seasonal Patterns [J].
Bandara, Kasun ;
Bergmeir, Christoph ;
Hewamalage, Hansika .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (04) :1586-1599
[3]   Automatic robust estimation for exponential smoothing: Perspectives from statistics and machine learning [J].
Barrow, Devon ;
Kourentzes, Nikolaos ;
Sandberg, Rickard ;
Niklewski, Jacek .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160
[4]   Robust Deep Gaussian Process-Based Probabilistic Electrical Load Forecasting Against Anomalous Events [J].
Cao, Di ;
Zhao, Junbo ;
Hu, Weihao ;
Zhang, Yingchen ;
Liao, Qishu ;
Chen, Zhe ;
Blaabjerg, Frede .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (02) :1142-1153
[5]   Demand Forecasting of the Fused Magnesia Smelting Process With System Identification and Deep Learning [J].
Chai, Tianyou ;
Zhang, Jingwen ;
Yang, Tao .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (12) :8387-8396
[6]   Forecasting day-ahead electricity load using a multiple equation time series approach [J].
Clements, A. E. ;
Hurn, A. S. ;
Li, Z. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 251 (02) :522-530
[7]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[8]   Convergent newton method and neural network for the electric energy usage prediction [J].
de Jesus Rubio, Jose ;
Antonio Islas, Marco ;
Ochoa, Genaro ;
Ricardo Cruz, David ;
Garcia, Enrique ;
Pacheco, Jaime .
INFORMATION SCIENCES, 2022, 585 :89-112
[9]   A self-organizing recurrent fuzzy neural network based on multivariate time series analysis [J].
Ding, Haixu ;
Li, Wenjing ;
Qiao, Junfei .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10) :5089-5109
[10]   Deep Air Quality Forecasting Using Hybrid Deep Learning Framework [J].
Du, Shengdong ;
Li, Tianrui ;
Yang, Yan ;
Horng, Shi-Jinn .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (06) :2412-2424