Efficiency of Time Series Clustering Method Based on Distribution of Difference Using Several Distances

被引:0
作者
Thanakulkairid, Phudit [1 ]
Trakulthongchai, Tanupat [1 ]
Prabpon, Naruesorn [1 ]
Vatiwutipong, Pat [2 ]
机构
[1] Kamnoetvidya Sci Acad, Rayong 21210, Thailand
[2] Kamnoetvidya Sci Acad, Dept Math & Comp Sci, Rayong 21210, Thailand
来源
2022 19TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2022) | 2022年
关键词
time series clustering; distance measurement; first order difference; graph partitioning;
D O I
10.1109/JCSSE54890.2022.9836279
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Clustering is a machine learning method widely used in time series analysis. In this work, we cluster time series by applying four distance functions: Euclidean distance, Kullback-Leibler divergence, Wasserstein distance, and dynamic time warping. We consider the distribution of the first-order difference of time series and compare time series using such distributions under each of the four distances. Then, we model each time series as a vertex of a graph and the distance between each pair of time series as a weighted edge. Graph partitioning is performed as a clustering method. The advantages and draw-backs of each method are discussed. The experimental results show that Euclidean distance and Kullback-Leibler divergence perform better and more efficient clustering than the other two.
引用
收藏
页数:6
相关论文
共 12 条
  • [1] Time-series clustering - A decade review
    Aghabozorgi, Saeed
    Shirkhorshidi, Ali Seyed
    Teh Ying Wah
    [J]. INFORMATION SYSTEMS, 2015, 53 : 16 - 38
  • [2] Berthold MR, 2016, Arxiv, DOI arXiv:1601.02213
  • [3] Faloutsos C., 1994, SIGMOD Record, V23, P419, DOI 10.1145/191843.191925
  • [4] Weighted dynamic time warping for time series classification
    Jeong, Young-Seon
    Jeong, Myong K.
    Omitaomu, Olufemi A.
    [J]. PATTERN RECOGNITION, 2011, 44 (09) : 2231 - 2240
  • [5] Kantorovich LV, 1958, Vestnik Leningradskogo Universiteta, V13, P52
  • [6] ON INFORMATION AND SUFFICIENCY
    KULLBACK, S
    LEIBLER, RA
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (01): : 79 - 86
  • [7] Lee T., 2014, CLUSTERING TIME SERI
  • [8] Mi L., 2018, PROC EUR C COMPUT VI, P322, DOI DOI 10.1007/978-3-030-01267-0-20
  • [9] Muller Meinard, 2007, Information retrieval for music and motion, P69, DOI DOI 10.1007/978-3-540-74048-34
  • [10] Ho N, 2017, PR MACH LEARN RES, V70