DAFA-BiLSTM: Deep Autoregression Feature Augmented Bidirectional LSTM network for time series prediction

被引:43
作者
Wang, Heshan [1 ]
Zhang, Yiping [1 ]
Liang, Jing [1 ,3 ]
Liu, Lili [2 ]
机构
[1] Zhengzhou Univ, Coll Elect Engn, Zhengzhou 450001, Peoples R China
[2] Shaanxi Normal Univ, Sch Phys & Informat Technol, Xian 710119, Peoples R China
[3] Kexue Rd 100, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
Time series prediction; Long short-term memory; Deep recurrent neural network; Feature augmented; Vector autoregression transformation;
D O I
10.1016/j.neunet.2022.10.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series forecasting models that use the past information of exogenous or endogenous sequences to forecast future series play an important role in the real world because most real-world time series datasets are rich in time-dependent information. Most conventional prediction models for time series datasets are time-consuming and fraught with complex limitations because they usually fail to adequately exploit the latent spatial dependence between pairs of variables. As a successful variant of recurrent neural networks, the long short-term memory network (LSTM) has been demonstrated to have stronger nonlinear dynamics to store sequential data than traditional machine learning models. Nevertheless, the common shallow LSTM architecture has limited capacity to fully extract the transient characteristics of long interval sequential datasets. In this study, a novel deep autoregression feature augmented bidirectional LSTM network (DAFA-BiLSTM) is proposed as a new deep BiLSTM architecture for time series prediction. Initially, the input vectors are fed into a vector autoregression (VA) transformation module to represent the time-delayed linear and nonlinear properties of the input signals in an unsupervised way. Then, the learned nonlinear combination vectors of VA are progressively fed into different layers of BiLSTM and the output of the previous BiLSTM module is also concatenated with the time-delayed linear vectors of the VA as an augmented feature to form new additional input signals for the next adjacent BiLSTM layer. Extensive real-world time series applications are addressed to demonstrate the superiority and robustness of the proposed DAFA-BiLSTM. Comparative experimental results and statistical analysis show that the proposed DAFA-BiLSTM has good adaptive performance as well as robustness even in noisy environment. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:240 / 256
页数:17
相关论文
共 45 条
  • [1] Crude oil time series prediction model based on LSTM network with chaotic Henry gas solubility optimization
    Altan, Aytac
    Karasu, Seckin
    [J]. ENERGY, 2022, 242
  • [2] A new hybrid model for wind speed forecasting combining long short-term memory neural network, decomposition methods and grey wolf optimizer
    Altan, Aytac
    Karasu, Seckin
    Zio, Enrico
    [J]. APPLIED SOFT COMPUTING, 2021, 100
  • [3] Time series trend detection and forecasting using complex network topology analysis
    Anghinoni, Leandro
    Zhao, Liang
    Ji, Donghong
    Pan, Heng
    [J]. NEURAL NETWORKS, 2019, 117 : 295 - 306
  • [4] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
    BENGIO, Y
    SIMARD, P
    FRASCONI, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166
  • [5] Dual-orthogonal radial basis function networks for nonlinear time series prediction
    Billings, SA
    Hong, X
    [J]. NEURAL NETWORKS, 1998, 11 (03) : 479 - 493
  • [6] Reservoir computing and extreme learning machines for non-linear time-series data analysis
    Butcher, J. B.
    Verstraeten, D.
    Schrauwen, B.
    Day, C. R.
    Haycock, P. W.
    [J]. NEURAL NETWORKS, 2013, 38 : 76 - 89
  • [7] Multi-dimensional recurrent neural network for remaining useful life prediction under variable operating conditions and multiple fault modes
    Cheng, Yiwei
    Wang, Chao
    Wu, Jun
    Zhu, Haiping
    Lee, C. K. M.
    [J]. APPLIED SOFT COMPUTING, 2022, 118
  • [8] Continual learning for recurrent neural networks: An empirical evaluation
    Cossu, Andrea
    Carta, Antonio
    Lomonaco, Vincenzo
    Bacciu, Davide
    [J]. NEURAL NETWORKS, 2021, 143 : 607 - 627
  • [9] Stacked bidirectional and unidirectional LSTM recurrent neural network for forecasting network-wide traffic state with missing values
    Cui, Zhiyong
    Ke, Ruimin
    Pu, Ziyuan
    Wang, Yinhai
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 118
  • [10] Temporal Attention-Augmented Bilinear Network for Financial Time-Series Data Analysis
    Dat Thanh Tran
    Iosifidis, Alexandros
    Kanniainen, Juho
    Gabbouj, Moncef
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (05) : 1407 - 1418