Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data

被引:0
|
作者
Hallac, David [1 ]
Vare, Sagar [1 ]
Boyd, Stephen [1 ]
Leskovec, Jure [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Subsequence clustering of multivariate time series is a useful tool for discovering repeated patterns in temporal data. Once these patterns have been discovered, seemingly complicated datasets can be interpreted as a temporal sequence of only a small number of states, or clusters. However, discovering these patterns is challenging because it requires simultaneous segmentation and clustering of the time series. Here we propose a new method of model-based clustering, which we call Toeplitz Inverse Covariance-based Clustering (TICC). Each cluster in the TICC method is defined by a correlation network, or Markov random field (MRF), characterizing the interdependencies between different observations in a typical subsequence of that cluster. Based on this graphical representation, TICC simultaneously segments and clusters the time series data. We solve the TICC problem through a scalable algorithm that is able to efficiently solve for tens of millions of observations. We validate our approach by comparing TICC to several stateof-the-art baselines in a series of synthetic experiments, and we then demonstrate on an automobile dataset how TICC can be used to learn interpretable clusters in real-world scenarios.
引用
收藏
页码:5254 / 5258
页数:5
相关论文
共 50 条
  • [41] Surveillance of the covariance matrix of multivariate nonlinear time series
    Sliwa, P
    Schmid, W
    STATISTICS, 2005, 39 (03) : 221 - 246
  • [42] Feature Representation and Similarity Measure Based on Covariance Sequence for Multivariate Time Series
    Li, Hailin
    Lin, Chunpei
    Wan, Xiaoji
    Li, Zhengxin
    IEEE ACCESS, 2019, 7 : 67018 - 67026
  • [43] Robust estimation for the covariance matrix of multivariate time series based on normal mixtures
    Kim, Byungsoo
    Lee, Sangyeol
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 57 (01) : 125 - 140
  • [44] Multivariate time series clustering based on common principal component analysis
    Li, Hailin
    NEUROCOMPUTING, 2019, 349 : 239 - 247
  • [45] Multivariate time-series clustering based on component relationship networks
    Li, Hailin
    Du, Tian
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173
  • [46] Structure-based statistical features and multivariate time series clustering
    Wang, Xiaozhe
    Wirth, Anthony
    Wang, Liang
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 351 - 360
  • [47] Spider algorithm for clustering multivariate time series
    Department of Computational Intelligence and Systems Science, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama 226-8503, Japan
    WSEAS Trans. Inf. Sci. Appl., 2006, 3 (485-492):
  • [48] An ensemble solution for multivariate time series clustering
    Vazquez, Iago
    Villar, Jose R.
    Sedano, Javier
    de la Cal, Enrique
    Simic, Svetlana
    NEUROCOMPUTING, 2021, 457 (457) : 182 - 192
  • [49] An Efficient Clustering Algorithm for Multivariate Time Series
    Zhou, Da-Zhuo
    Zhang, Bo
    EBM 2010: INTERNATIONAL CONFERENCE ON ENGINEERING AND BUSINESS MANAGEMENT, VOLS 1-8, 2010, : 5190 - 5193
  • [50] A Preliminary Study on Multivariate Time Series Clustering
    Vaquez, Iago
    Villar, Jose R.
    Sedano, Javier
    Simic, Svetlana
    14TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2019), 2020, 950 : 473 - 480