Autoregressive forests for multivariate time series modeling

被引:70
作者
Tuncel, Kerem Sinan [1 ]
Baydogan, Mustafa Gokce [1 ]
机构
[1] Bogazici Univ, Dept Ind Engn, TR-34342 Bebek, Turkey
关键词
Multivariate time series; Vector autoregression; Time series representation; Ensemble learning; Classification; REPRESENTATION;
D O I
10.1016/j.patcog.2017.08.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multivariate Time Series (MTS) modeling has received significant attention in the last decade because of the complex nature of the data. Efficient representations are required to deal with the high dimensionality due to the increase in the number of variables and duration of the time series in different applications. For example, model-based approaches such as Hidden Markov Models (HMM) or autoregressive (AR) models focus on finding a model to represent the series with the model parameters to handle this problem. Both HMM and AR models are known to be very successful in the representation of the time series however most of the HMM approaches assume independence and traditional AR models consider linear dependence between the variables of MTS. As most of the real systems exhibit nonlinear relations, traditional approaches fail to represent the time series. To handle these problems, we propose an autoregressive tree-based ensemble approach that can model the nonlinear behavior embedded in the time series with the help of tree-based learning. Multivariate autoregressive forest, namely mv-ARF, is a nonparametric vector autoregression approach which provides an easy and efficient representation that scales well with large datasets. An error-based representation based on the learned models is the basis of the proposed approach. This is very similar to time series kernels used for multivariate time series classification problems. We test mv-ARF on MTS classification problems and show that mv-ARF provides fast and competitive results on benchmark datasets from several domains. Furthermore, mv-ARF provides a research direction for vector autoregressive models that breaks from the linear dependency models to potentially foster other promising nonlinear approaches. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:202 / 215
页数:14
相关论文
共 28 条
  • [1] Correlation based dynamic time warping of multivariate time series
    Banko, Zoltan
    Abonyi, Janos
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (17) : 12814 - 12823
  • [2] Bashir F, 2005, IEEE IMAGE PROC, P3401
  • [3] Baydogan M.G., 2016, LPS
  • [4] Time series representation and similarity based on local autopatterns
    Baydogan, Mustafa Gokce
    Runger, George
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (02) : 476 - 509
  • [5] Learning a symbolic representation for multivariate time series classification
    Baydogan, Mustafa Gokce
    Runger, George
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (02) : 400 - 422
  • [6] Berndt D.J., 1994, P KDD WORKSH SEATTL, P359, DOI DOI 10.5555/3000850.3000887
  • [7] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [8] Efficient time series matching by wavelets
    Chan, KP
    Fu, AWC
    [J]. 15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 126 - 133
  • [9] Chandra B, 2008, IEEE SYS MAN CYBERN, P891
  • [10] CMU, 2012, CMU GRAPH LAB MOT CA