Dynamic and Static Enhanced BIRCH for Functional Data Clustering

被引:0
|
作者
Li, Wang [1 ]
Li, Hanfang [1 ]
Luo, Youxi [1 ]
机构
[1] Hubei Univ Technol, Sch Sci, Wuhan 430068, Peoples R China
关键词
Clustering algorithms; Heuristic algorithms; Clustering methods; Principal component analysis; Feature extraction; Prediction algorithms; Data models; Functional data; clustering; BIRCH; dynamic and static information fusion; HIGH-DIMENSIONAL DATA; K-MEANS; DENSITY; ALGORITHM;
D O I
10.1109/ACCESS.2023.3322929
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate and efficient clustering of large-scale functional data is of utmost importance in the era of big data. However, the current research falls short in fully considering the differentiability inherent in functional data. To tackle this significant challenge, we propose a novel method, namely Dynamic and Static Enhanced-BIRCH (DSE-BIRCH), which incorporates both the constant and derivate features to simultaneously measure the static and dynamic distances between functional samples. To this end, a novel matrix factorization-based approach is introduced to transform constant features, extracted through principal component analysis, into derivative features. Subsequently, these two sets of features are fused to form global clustering features with different weighting coefficients are assigned to each of them, reflecting their respective importance. Finally, an enhanced BIRCH algorithm is employed to handle both static and dynamic constraints, enabling hierarchical clustering from a more comprehensive perspective. The mathematical definition of the algorithm is rigorously provided. The superior empirical performance of our method on publicly available datasets and simulated datasets fully demonstrates its effective capture of dynamic information and its capability to achieve accurate clustering on real-world data. Further experiments involving noise and complexity attest to the algorithm's robustness and efficiency, highlighting its broad potential for applications in various complex scenarios involving large-scale functional data.
引用
收藏
页码:111448 / 111465
页数:18
相关论文
共 50 条
  • [21] Malware Family Identification with BIRCH Clustering
    Pitolli, Gregorio
    Aniello, Leonardo
    Laurenza, Giuseppe
    Querzoni, Leonardo
    Baldoni, Roberto
    2017 INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2017,
  • [22] Functional data clustering using K-means and random projection with applications to climatological data
    Ashkartizabi, Mehdi
    Aminghafari, Mina
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2018, 32 (01) : 83 - 104
  • [23] Functional data clustering via information maximization
    Li, Xinyu
    Xu, Jianjun
    Cheng, Haoyang
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (16) : 2982 - 3007
  • [24] Addressing class imbalance in functional data clustering
    Higgins, Catherine
    Carey, Michelle
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024,
  • [25] Functional clustering and identifying substructures of longitudinal data
    Chiou, Jeng-Min
    Li, Pai-Ling
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2007, 69 : 679 - 699
  • [26] MVStream: Multiview Data Stream Clustering
    Huang, Ling
    Wang, Chang-Dong
    Chao, Hong-Yang
    Yu, Philip S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) : 3482 - 3496
  • [27] Graph Enhanced Fuzzy Clustering for Categorical Data Using a Bayesian Dissimilarity Measure
    Zhang, Chuanbin
    Chen, Long
    Zhao, Yin-Ping
    Wang, Yingxu
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (03) : 810 - 824
  • [28] Local-Density Subspace Distributed Clustering for High-Dimensional Data
    Geng, Yangli-ao
    Li, Qingyong
    Liang, Mingfei
    Chi, Chong-Yung
    Tan, Juan
    Huang, Heng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (08) : 1799 - 1814
  • [29] Advances in Rough and Soft Clustering: Meta-Clustering, Dynamic Clustering, Data-Stream Clustering
    Lingras, Pawan
    Triff, Matt
    ROUGH SETS, (IJCRS 2016), 2016, 9920 : 3 - 22
  • [30] Static and Dynamic Community Detection Methods That Optimize a Specific Objective Function: A Survey and Experimental Evaluation
    Taha, Kamal
    IEEE ACCESS, 2020, 8 : 98330 - 98358