Clustering Structure Analysis in Time-Series Data With Density-Based Clusterability Measure

被引:23
|
作者
Jokinen, Juho [1 ]
Raty, Tomi [1 ]
Lintonen, Timo [1 ]
机构
[1] VTT Tech Res Ctr Finland, Vuorimiehentie 3 POB 1000, Espoo 02044, Finland
关键词
Clustering; exploratory data analysis; time-series; unsupervised learning; REPRESENTATION; OPTIMIZATION; ALGORITHM; TESTS; TREE;
D O I
10.1109/JAS.2019.1911744
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is used to gain an intuition of the structures in the data. Most of the current clustering algorithms produce a clustering structure even on data that do not possess such structure. In these cases, the algorithms force a structure in the data instead of discovering one. To avoid false structures in the relations of data, a novel clusterability assessment method called density-based clusterability measure is proposed in this paper. It measures the prominence of clustering structure in the data to evaluate whether a cluster analysis could produce a meaningful insight to the relationships in the data. This is especially useful in time-series data since visualizing the structure in time-series data is hard. The performance of the clusterability measure is evaluated against several synthetic data sets and time-series data sets, which illustrate that the density-based clusterability measure can successfully indicate clustering structure of time-series data.
引用
收藏
页码:1332 / 1343
页数:12
相关论文
共 50 条
  • [41] The analysis of chaotic time-series data
    Kostelich, EJ
    SYSTEMS & CONTROL LETTERS, 1997, 31 (05) : 313 - 319
  • [42] Density-based clustering for data containing two types of points
    Pei, Tao
    Wang, Weiyi
    Zhang, Hengcai
    Ma, Ting
    Du, Yunyan
    Zhou, Chenghu
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2015, 29 (02) : 175 - 193
  • [43] Clustering Time-Series Gene Expression Data with Unequal Time Intervals
    Rueda, Luis
    Bari, Ataul
    Ngom, Alioune
    TRANSACTIONS ON COMPUTATIONAL SYSTEMS BIOLOGY X, 2008, 5410 : 100 - 123
  • [44] Time-Series Data Mining
    Esling, Philippe
    Agon, Carlos
    ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [45] Mobile Networks Classification Based on Time-Series Clustering
    Lu, Shun
    Qian, Bing
    Zhao, Long-Gang
    Sun, Qiong
    2022 IEEE 5TH INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION ENGINEERING, ICECE, 2022, : 65 - 71
  • [46] Incremental Clustering of Time-Series by Fuzzy Clustering
    Aghabozorgi, Saeed
    Saybani, Mahmoud Reza
    Teh, Ying Wah
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2012, 28 (04) : 671 - 688
  • [47] A hybrid shape-based image clustering using time-series analysis
    Mondal, Atreyee
    Dey, Nilanjan
    Fong, Simon
    Ashour, Amira S.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 3793 - 3808
  • [48] Fourier Magnitude-Based Privacy-Preserving Clustering on Time-Series Data
    Kim, Hea-Suk
    Moon, Yang-Sae
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (06): : 1648 - 1651
  • [49] Comparative Analysis Review of Pioneering DBSCAN and Successive Density-Based Clustering Algorithms
    Bushra, Adil Abdu
    Yi, Gangman
    IEEE ACCESS, 2021, 9 : 87918 - 87935
  • [50] Robust Local Triangular Kernel Density-based Clustering for High-dimensional Data
    Musdholifah, Aina
    Hashim, Siti Zaiton Mohd
    2013 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2013, : 24 - 32