DSTP-RNN: A dual-stage two-phase attention-based recurrent neural network for long-term and multivariate time series prediction

被引:244
作者
Liu, Yeqi [1 ,2 ,3 ]
Gong, Chuanyang [1 ,2 ,3 ]
Yang, Ling [1 ,2 ,3 ]
Chen, Yingyi [1 ,2 ,3 ,4 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr, Key Lab Agr Informat Acquisit Technol, Beijing 100083, Peoples R China
[3] Beijing Engn & Technol Res Ctr Internet Things Ag, Beijing 100083, Peoples R China
[4] China Agr Univ, Natl Innovat Ctr Digital Fishery, Beijing 100083, Peoples R China
关键词
Time series prediction; Spatio-temporal relationship; Attention mechanism; Dual-stage two-phase model; Deep attention network; MODEL;
D O I
10.1016/j.eswa.2019.113082
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long-term prediction of multivariate time series is still an important but challenging problem. The key to solve this problem is capturing (1) the spatial correlations at the same time, (2) the spatio-temporal relationships at different times, and (3) long-term dependency of the temporal relationships between different series. Attention-based recurrent neural networks (RNN) can effectively represent and learn the dynamic spatio-temporal relationships between exogenous series and target series, but they only perform well in one-step time prediction and short-term time prediction. In this paper, inspired by human attention mechanism including the dual-stage two-phase (DSTP) model and the influence mechanism of target information and non-target information, we propose DSTP-based RNN (DSTP-RNN) and DSTP-RNN-II respectively for long-term time series prediction. Specifically, we first propose the DSTP-based structure to enhance the spatial correlations between exogenous series. The first phase produces violent but decentralized response weight, while the second phase leads to stationary and concentrated response weight. Then, we employ multiple attentions on target series to boost the long-term dependency. Finally, we study the performance of deep spatial attention mechanism and provide interpretation. Experimental results demonstrate that the present work can be successfully used to develop expert or intelligent systems for a wide range of applications, with state-of-the-art performances superior to nine baseline methods on four datasets in the fields of energy, finance, environment and medicine, respectively. Overall, the present work carries a significant value not merely in the domain of machine intelligence and deep learning, but also in the fields of many applications. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 54 条
  • [1] Allen-Zhu Zeyuan, 2018, ARXIV181103962
  • [2] ARIMA-based decoupled time series forecasting of electric vehicle charging demand for stochastic power system operation
    Amini, M. Hadi
    Kargarian, Amin
    Karabasoglu, Orkun
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2016, 140 : 378 - 390
  • [3] [Anonymous], ARXIV180600685
  • [4] [Anonymous], ARXIV180809772V2
  • [5] [Anonymous], 2014, ABS14091259 CORR
  • [6] [Anonymous], NEURAL COMPUTING APP
  • [7] [Anonymous], IEEE T IMAGE PROCESS
  • [8] [Anonymous], KNOWLEDGE BASED SYST
  • [9] [Anonymous], 2015, INT C LEARN REPR ICL
  • [10] [Anonymous], 2018, ARXIV180606384