A time series forest for classification and feature extraction

被引:402
作者
Deng, Houtao [1 ]
Runger, George [2 ]
Tuv, Eugene [3 ]
Vladimir, Martyanov [3 ]
机构
[1] Intuit, Mountain View, CA USA
[2] Arizona State Univ, Tempe, AZ USA
[3] Intel Corp, Chandler, AZ 85226 USA
关键词
Decision tree; Ensemble; Entrance gain; Interpretability; Large margin; Time series classification;
D O I
10.1016/j.ins.2013.02.030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A tree-ensemble method, referred to as time series forest (TSF), is proposed for time series classification. TSF employs a combination of entropy gain and a distance measure, referred to as the Entrance (entropy and distance) gain, for evaluating the splits. Experimental studies show that the Entrance gain improves the accuracy of TSF. TSF randomly samples features at each tree node and has computational complexity linear in the length of time series, and can be built using parallel computing techniques. The temporal importance curve is proposed to capture the temporal characteristics useful for classification. Experimental studies show that TSF using simple features such as mean, standard deviation and slope is computationally efficient and outperforms strong competitors such as one-nearest-neighbor classifiers with dynamic time warping. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:142 / 153
页数:12
相关论文
共 25 条
  • [1] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [2] Constrained mixture estimation for analysis and robust classification of clinical time series
    Costa, Ivan G.
    Schoenhuth, Alexander
    Hafemeister, Christoph
    Schliep, Alexander
    [J]. BIOINFORMATICS, 2009, 25 (12) : I6 - I14
  • [3] Demsar J, 2006, J MACH LEARN RES, V7, P1
  • [4] MULTIPLE COMPARISONS AMONG MEANS
    DUNN, OJ
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1961, 56 (293) : 52 - &
  • [5] Eruhimov V, 2007, LECT NOTES ARTIF INT, V4702, P414
  • [6] A comparison of alternative tests of significance for the problem of m rankings
    Friedman, M
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1940, 11 : 86 - 92
  • [7] Geurts P., 2001, P 5 EUR C PRINC DAT, P115, DOI [DOI 10.1007/3-540-44794-6_10, 10.1007/3-540-44794-610, DOI 10.1007/3-540-44794-610]
  • [8] Weighted dynamic time warping for time series classification
    Jeong, Young-Seon
    Jeong, Myong K.
    Omitaomu, Olufemi A.
    [J]. PATTERN RECOGNITION, 2011, 44 (09) : 2231 - 2240
  • [9] Keogh E., 2006, The ucr time series classification/clustering home-page
  • [10] Lines Jason, 2012, SIGKDD, P289