A Novel LSTM for Multivariate Time Series with Massive Missingness

被引:20
作者
Fouladgar, Nazanin [1 ]
Fraemling, Kary [1 ,2 ]
机构
[1] Umea Univ, Dept Comp Sci, S-90187 Umea, Sweden
[2] Aalto Univ, Sch Sci & Technol, POB 15500, Aalto 00076, Finland
关键词
multivariate time series; regression; massive missingness; LSTM; PREDICTION; IMPUTATION; MODEL; VALUES;
D O I
10.3390/s20102832
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Multivariate time series with missing data is ubiquitous when the streaming data is collected by sensors or any other recording instruments. For instance, the outdoor sensors gathering different meteorological variables may encounter low material sensitivity to specific situations, leading to incomplete information gathering. This is problematic in time series prediction with massive missingness and different missing rate of variables. Contribution addressing this problem on the regression task of meteorological datasets by employing Long Short-Term Memory (LSTM), capable of controlling the information flow with its memory unit, is still missing. In this paper, we propose a novel model called forward and backward variable-sensitive LSTM (FBVS-LSTM) consisting of two decay mechanisms and some informative data. The model inputs are mainly the missing indicator, time intervals of missingness in both forward and backward direction and missing rate of each variable. We employ this information to address the so-called missing not at random (MNAR) mechanism. Separately learning the features of each parameter, the model becomes adapted to deal with massive missingness. We conduct our experiment on three real-world datasets for the air pollution forecasting. The results demonstrate that our model performed well along with other LSTM-derivation models in terms of prediction accuracy.
引用
收藏
页数:17
相关论文
共 41 条
  • [1] Decision Trees in Time Series Reconstruction Problems
    Amato, Alberto
    Calabrese, Marco
    Di Lecce, Vincenzo
    [J]. 2008 IEEE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1-5, 2008, : 895 - +
  • [2] [Anonymous], 2006, 2006 2 IET INT C INT
  • [3] [Anonymous], ARXIV1905 00125
  • [4] [Anonymous], 1997, Neural Computation
  • [5] [Anonymous], 2019, 11 INT C MOD ID CONT
  • [6] [Anonymous], KNOWLEDGE ENG MANAGE
  • [7] [Anonymous], ARXIV190704155
  • [8] [Anonymous], 2015, APPL MATH
  • [9] Handling missing data in multivariate time series using a vector autoregressive model-imputation (VAR-IM) algorithm
    Bashir, Faraj
    Wei, Hua-Liang
    [J]. NEUROCOMPUTING, 2018, 276 : 23 - 30
  • [10] Time series prediction with recurrent neural networks trained by a hybrid PSO-EA algorithm
    Cai, Xindi
    Zhang, Nian
    Venayagamoorthy, Ganesh K.
    Wunsch, Donald C., II
    [J]. NEUROCOMPUTING, 2007, 70 (13-15) : 2342 - 2353