Clustering Structure Analysis in Time-Series Data With Density-Based Clusterability Measure

被引:23
|
作者
Jokinen, Juho [1 ]
Raty, Tomi [1 ]
Lintonen, Timo [1 ]
机构
[1] VTT Tech Res Ctr Finland, Vuorimiehentie 3 POB 1000, Espoo 02044, Finland
关键词
Clustering; exploratory data analysis; time-series; unsupervised learning; REPRESENTATION; OPTIMIZATION; ALGORITHM; TESTS; TREE;
D O I
10.1109/JAS.2019.1911744
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is used to gain an intuition of the structures in the data. Most of the current clustering algorithms produce a clustering structure even on data that do not possess such structure. In these cases, the algorithms force a structure in the data instead of discovering one. To avoid false structures in the relations of data, a novel clusterability assessment method called density-based clusterability measure is proposed in this paper. It measures the prominence of clustering structure in the data to evaluate whether a cluster analysis could produce a meaningful insight to the relationships in the data. This is especially useful in time-series data since visualizing the structure in time-series data is hard. The performance of the clusterability measure is evaluated against several synthetic data sets and time-series data sets, which illustrate that the density-based clusterability measure can successfully indicate clustering structure of time-series data.
引用
收藏
页码:1332 / 1343
页数:12
相关论文
共 50 条
  • [21] Density-based clustering on massive mobile communication data
    Liu, YF
    Tang, SW
    Yang, DQ
    Chen, Y
    Wang, TJ
    Ma, S
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS: COMMUNICATION, NETWORK AND CONTROL SYSTEMS, TECHNOLOGIES AND APPLICATIONS: II, 2003, : 251 - 254
  • [22] Density-Based Clustering of Data Streams at Multiple Resolutions
    Wan, Li
    Ng, Wee Keong
    Dang, Xuan Hong
    Yu, Philip S.
    Zhang, Kuan
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (03)
  • [23] Density-based clustering with non-continuous data
    Azzalini, Adelchi
    Menardi, Giovanna
    COMPUTATIONAL STATISTICS, 2016, 31 (02) : 771 - 798
  • [24] SS-DBSCAN: Semi-Supervised Density-Based Spatial Clustering of Applications With Noise for Meaningful Clustering in Diverse Density Data
    Zaki Abdulhameed, Tiba
    Yousif, Suhad A.
    Samawi, Venus W.
    Imad Al-Shaikhli, Hasnaa
    IEEE ACCESS, 2024, 12 : 131507 - 131520
  • [25] Deep Time-Series Clustering: A Review
    Alqahtani, Ali
    Ali, Mohammed
    Xie, Xianghua
    Jones, Mark W.
    ELECTRONICS, 2021, 10 (23)
  • [26] Clustering short time-series microarray
    Ping, Loh Wei
    Abu Hasan, Yahya
    INTERNATIONAL CONFERENCE ON MATHEMATICAL BIOLOGY 2007, 2008, 971 : 39 - 46
  • [27] Contraction Clustering (RASTER) A Big Data Algorithm for Density-Based Clustering in Constant Memory and Linear Time
    Ulm, Gregor
    Gustavsson, Emil
    Jirstrand, Mats
    MACHINE LEARNING, OPTIMIZATION, AND BIG DATA, MOD 2017, 2018, 10710 : 63 - 75
  • [28] A novel clustering algorithm for time-series data based on precise correlation coefficient matching in the IoT
    Li, Haibo
    Tong, Juncheng
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (06) : 6654 - 6671
  • [29] Stability of Density-Based Clustering
    Rinaldo, Alessandro
    Singh, Aarti
    Nugent, Rebecca
    Wasserman, Larry
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 905 - 948
  • [30] Mathematical Programming Formulations and Algorithms for Discrete k-Median Clustering of Time-Series Data
    Seref, Onur
    Fan, Ya-Ju
    Chaovalitwongse, Wanpracha Art
    INFORMS JOURNAL ON COMPUTING, 2014, 26 (01) : 160 - 172