Time series cluster kernel for learning similarities between multivariate time series with missing data

被引:72
作者
Mikalsen, Karl Oyvind [1 ,2 ]
Bianchi, Filippo Maria [2 ,3 ]
Soguero-Ruiz, Cristina [2 ,4 ]
Jenssen, Robert [2 ,3 ]
机构
[1] UiT Arctic Univ Norway, Dept Math & Stat, Tromso, Norway
[2] UiT Machine Learning Grp, Tromso, Norway
[3] UiT, Dept Phys & Technol, Tromso, Norway
[4] Univ Rey Juan Carlos, Dept Signal Theory & Commun, Telemat & Comp, Fuenlabrada, Spain
关键词
Multivariate time series; Similarity measures; Kernel methods; Missing data; Gaussian mixture models; Ensemble learning; PATTERN-RECOGNITION; CLASSIFICATION; REPRESENTATION; MODELS; DISCOVERY;
D O I
10.1016/j.patcog.2017.11.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarity-based approaches represent a promising direction for time series analysis. However, many such methods rely on parameter tuning, and some have shortcomings if the time series are multivariate (MTS), due to dependencies between attributes, or the time series contain missing data. In this paper, we address these challenges within the powerful context of kernel methods by proposing the robust time series cluster kernel (TCK). The approach taken leverages the missing data handling properties of Gaussian mixture models (GMM) augmented with informative prior distributions. An ensemble learning approach is exploited to ensure robustness to parameters by combining the clustering results of many GMM to form the final kernel. We evaluate the TCK on synthetic and real data and compare to other state-of-the-art techniques. The experimental results demonstrate that the TCK is robust to parameter choices, provides competitive results for MTS without missing data and outstanding results for missing data. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:569 / 581
页数:13
相关论文
共 98 条
[1]   Time-series clustering - A decade review [J].
Aghabozorgi, Saeed ;
Shirkhorshidi, Ali Seyed ;
Teh Ying Wah .
INFORMATION SYSTEMS, 2015, 53 :16-38
[2]  
Anava O, 2015, PR MACH LEARN RES, V37, P2191
[3]  
[Anonymous], 2001, Learning with Kernels |
[4]  
[Anonymous], 2007, EM ALGORITHM EXTENSI
[5]  
[Anonymous], 2004, KERNEL METHODS PATTE
[6]  
[Anonymous], 2017, TIME SERIES CLUSTER
[7]  
[Anonymous], 2009, Kernel methods for remote sensing data analysis
[8]  
[Anonymous], 1984, GRAD TEXTS MATH
[9]  
[Anonymous], 1994, USING DYNAMIC TIME W
[10]  
[Anonymous], 1983, Applied time series and Box-Jenkins models / Walter Vandaele