Clustering Structure Analysis in Time-Series Data With Density-Based Clusterability Measure

被引:23
|
作者
Jokinen, Juho [1 ]
Raty, Tomi [1 ]
Lintonen, Timo [1 ]
机构
[1] VTT Tech Res Ctr Finland, Vuorimiehentie 3 POB 1000, Espoo 02044, Finland
关键词
Clustering; exploratory data analysis; time-series; unsupervised learning; REPRESENTATION; OPTIMIZATION; ALGORITHM; TESTS; TREE;
D O I
10.1109/JAS.2019.1911744
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is used to gain an intuition of the structures in the data. Most of the current clustering algorithms produce a clustering structure even on data that do not possess such structure. In these cases, the algorithms force a structure in the data instead of discovering one. To avoid false structures in the relations of data, a novel clusterability assessment method called density-based clusterability measure is proposed in this paper. It measures the prominence of clustering structure in the data to evaluate whether a cluster analysis could produce a meaningful insight to the relationships in the data. This is especially useful in time-series data since visualizing the structure in time-series data is hard. The performance of the clusterability measure is evaluated against several synthetic data sets and time-series data sets, which illustrate that the density-based clusterability measure can successfully indicate clustering structure of time-series data.
引用
收藏
页码:1332 / 1343
页数:12
相关论文
共 50 条
  • [1] Clustering Structure Analysis in Time-Series Data With Density-Based Clusterability Measure
    Juho Jokinen
    Tomi R?ty
    Timo Lintonen
    IEEE/CAAJournalofAutomaticaSinica, 2019, 6 (06) : 1332 - 1343
  • [2] ChronoClust: Density-based clustering and cluster high-dimensional time-series data
    Putri, Givanna H.
    Read, Mark N.
    Koprinska, Irena
    Singh, Deeksha
    Rohm, Uwe
    Ashhurst, Thomas M.
    King, Nicholas J. C.
    KNOWLEDGE-BASED SYSTEMS, 2019, 174 : 9 - 26
  • [3] A density-based time-series data analysis methodology for shadow detection in rooftop photovoltaic systems
    Tsafarakis, Odysseas
    van Sark, Wilfried G. J. H. M.
    PROGRESS IN PHOTOVOLTAICS, 2023, 31 (05): : 506 - 523
  • [4] Time-Series Clustering for Data Analysis in Smart Grid
    Maurya, Akanksha
    Akyurek, Alper Sinan
    Aksanli, Baris
    Rosing, Tajana Simunic
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2016,
  • [5] A new clustering method using wavelet based probability density functions for identifying patterns in time-series data
    Kordestani, Mojtaba
    Alkhateeb, Abedalrhman
    Rezaeian, Iman
    Rueda, Luis
    Saif, Mehrdad
    2016 IEEE EMBS INTERNATIONAL STUDENT CONFERENCE (ISC), 2016,
  • [6] Efficient layered density-based clustering of categorical data
    Andreopoulos, Bill
    An, Aijun
    Wang, Xiaogang
    Labudde, Dirk
    JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (02) : 365 - 376
  • [7] Time-series clustering - A decade review
    Aghabozorgi, Saeed
    Shirkhorshidi, Ali Seyed
    Teh Ying Wah
    INFORMATION SYSTEMS, 2015, 53 : 16 - 38
  • [8] Clustering multivariate time-series data
    Singhal, A
    Seborg, DE
    JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 427 - 438
  • [9] Adaptive Density-Based Spatial Clustering for Massive Data Analysis
    Cai, Zihao
    Wang, Jian
    He, Kejing
    IEEE ACCESS, 2020, 8 : 23346 - 23358
  • [10] Novel density-based and hierarchical density-based clustering algorithms for uncertain data
    Zhang, Xianchao
    Liu, Han
    Zhang, Xiaotong
    NEURAL NETWORKS, 2017, 93 : 240 - 255