Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data

被引:0
|
作者
Hallac, David [1 ]
Vare, Sagar [1 ]
Boyd, Stephen [1 ]
Leskovec, Jure [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Subsequence clustering of multivariate time series is a useful tool for discovering repeated patterns in temporal data. Once these patterns have been discovered, seemingly complicated datasets can be interpreted as a temporal sequence of only a small number of states, or clusters. However, discovering these patterns is challenging because it requires simultaneous segmentation and clustering of the time series. Here we propose a new method of model-based clustering, which we call Toeplitz Inverse Covariance-based Clustering (TICC). Each cluster in the TICC method is defined by a correlation network, or Markov random field (MRF), characterizing the interdependencies between different observations in a typical subsequence of that cluster. Based on this graphical representation, TICC simultaneously segments and clusters the time series data. We solve the TICC problem through a scalable algorithm that is able to efficiently solve for tens of millions of observations. We validate our approach by comparing TICC to several stateof-the-art baselines in a series of synthetic experiments, and we then demonstrate on an automobile dataset how TICC can be used to learn interpretable clusters in real-world scenarios.
引用
收藏
页码:5254 / 5258
页数:5
相关论文
共 50 条
  • [31] A Sample Covariance-Based Approach For Spatial Binary Data
    Sahar Zarmehri
    Ephraim M. Hanks
    Lin Lin
    Journal of Agricultural, Biological and Environmental Statistics, 2021, 26 : 220 - 249
  • [32] A Distance Covariance-based Kernel for Nonlinear Causal Clustering in Heterogeneous Populations
    Markham, Alex
    Das, Richeek
    Grosse-Wentrup, Moritz
    CONFERENCE ON CAUSAL LEARNING AND REASONING, VOL 177, 2022, 177
  • [33] Discrimination and clustering for multivariate time series
    Kakizawa, Y
    Shumway, RH
    Taniguchi, M
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1998, 93 (441) : 328 - 340
  • [34] Tennis Multivariate Time Series Clustering
    Skublewska-Paszkowska, Maria
    Karczmarek, Pawel
    Lukasik, Edyta
    IEEE CIS INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS 2021 (FUZZ-IEEE), 2021,
  • [35] A Model-Based Multivariate Time Series Clustering Algorithm
    Zhou, Pei-Yuan
    Chan, Keith C. C.
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 8643 : 805 - 817
  • [36] Contact State Clustering Analysis Based on Multivariate Time Series
    Liu N.-L.
    Zhou X.-D.
    Liu Z.-M.
    Cui L.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2020, 49 (05): : 660 - 665
  • [37] The clustering algorithm based on multivariate time series with unequal length
    Du, Haizhou
    Journal of Computational Information Systems, 2011, 7 (16): : 5798 - 5805
  • [38] Evaluation of multivariate time series clustering for imputation of air pollution data
    Alahamade, Wedad
    Lake, Iain
    Reeves, Claire E.
    De La Iglesia, Beatriz
    GEOSCIENTIFIC INSTRUMENTATION METHODS AND DATA SYSTEMS, 2021, 10 (02) : 265 - 285
  • [39] Fuzzy clustering based on feature weights for multivariate time series
    Li, Hailin
    Wei, Miao
    KNOWLEDGE-BASED SYSTEMS, 2020, 197
  • [40] Clustering of Multivariate Time Series Data Using Particle Swarm Optimization
    Ahmadi, Abbas
    Mozafarinia, Atefeh
    Mohebi, Azadeh
    2015 INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2015, : 176 - 181