Accelerated Sequential Data Clustering

被引:0
作者
Mortazavi, Reza [1 ]
Enayati, Elham [2 ]
Basiri, Abdolali [3 ]
机构
[1] Damghan Univ, Sch Engn, Damghan 3671645667, Iran
[2] Univ Bojnord, Fac Basic Sci, Dept Comp Sci, Bojnord, Iran
[3] Damghan Univ, Sch Math & Comp Sci, Damghan 3671645667, Iran
关键词
Sequential data clustering; Optimal clustering; Greedy optimization; Time series; ALGORITHM; NETWORK;
D O I
10.1007/s00357-024-09472-4
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Data clustering is an important task in the field of data mining. In many real applications, clustering algorithms must consider the order of data, resulting in the problem of clustering sequential data. For instance, analyzing the moving pattern of an object and detecting community structure in a complex network are related to sequential data clustering. The constraint of the continuous region prevents previous clustering algorithms from being directly applied to the problem. A dynamic programming algorithm was proposed to address the issue, which returns the optimal sequential data clustering. However, it is not scalable and hence the practicality is limited. This paper revisits the solution and enhances it by introducing a greedy stopping condition. This condition halts the algorithm's search process when it is likely that the optimal solution has been found. Experimental results on multiple datasets show that the algorithm is much faster than its original solution while the optimality gap is negligible.
引用
收藏
页码:245 / 263
页数:19
相关论文
共 36 条
  • [1] Fair Clustering via Equitable Group Representations
    Abbasi, Mohsen
    Bhaskara, Aditya
    Venkatasubramanian, Suresh
    [J]. PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 504 - 514
  • [2] A novel time series clustering method with fine-tuned support vector regression for customer behavior analysis
    Abbasimehr, Hossein
    Baghery, Farzam Sheikh
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
  • [3] NP-hardness of Euclidean sum-of-squares clustering
    Aloise, Daniel
    Deshpande, Amit
    Hansen, Pierre
    Popat, Preyas
    [J]. MACHINE LEARNING, 2009, 75 (02) : 245 - 248
  • [4] Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
  • [5] Application of self-organizing map (SOM) and K-means clustering algorithms for portraying geochemical anomaly patterns in Moalleman district, NE Iran
    Bigdeli, Amirreza
    Maghsoudi, Abbas
    Ghezelbash, Reza
    [J]. JOURNAL OF GEOCHEMICAL EXPLORATION, 2022, 233
  • [6] Weighted score-driven fuzzy clustering of time series with a financial application
    Cerqueti, Roy
    D'Urso, Pierpaolo
    De Giovanni, Livia
    Giacalone, Massimiliano
    Mattera, Raffaele
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 198
  • [7] An efficient greedy K-means algorithm for global gene trajectory clustering
    Chan, ZSH
    Collins, L
    Kasabov, N
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2006, 30 (01) : 137 - 141
  • [8] MST-GAT: A multimodal spatial-temporal graph attention network for time series anomaly detection
    Ding, Chaoyue
    Sun, Shiliang
    Zhao, Jing
    [J]. INFORMATION FUSION, 2023, 89 : 527 - 536
  • [9] K-centroid link: a novel hierarchical clustering linkage method
    Dogan, Alican
    Birant, Derya
    [J]. APPLIED INTELLIGENCE, 2022, 52 (05) : 5537 - 5560
  • [10] Dupin N., 2018, 7 INT C MET NAT INSP, P1