Clustering Longitudinal Data: A Review of Methods and Software Packages

被引:1
作者
Lu, Zihang [1 ,2 ]
机构
[1] Queens Univ, Dept Publ Hlth Sci, Kingston, ON, Canada
[2] Queens Univ, Dept Math & Stat, Kingston, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
cluster analysis; longitudinal data; model-based clustering; algorithm-based clustering; functional clustering; FUNCTIONAL DATA-ANALYSIS; LATENT CLASS ANALYSIS; MIXTURE-MODELS; K-MEANS; R PACKAGE; BAYESIAN-INFERENCE; CROSS-VALIDATION; UNKNOWN NUMBER; MIXED MODELS; MISSING DATA;
D O I
10.1111/insr.12588
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Clustering of longitudinal data is becoming increasingly popular in many fields such as social sciences, business, environmental science, medicine and healthcare. However, it is often challenging due to the complex nature of the data, such as dependencies between observations collected over time, missingness, sparsity and non-linearity, making it difficult to identify meaningful patterns and relationships among the data. Despite the increasingly common application of cluster analysis for longitudinal data, many existing methods are still less known to researchers, and limited guidance is provided in choosing between methods and software packages. In this paper, we review several commonly used methods for clustering longitudinal data. These methods are broadly classified into three categories, namely, model-based approaches, algorithm-based approaches and functional clustering approaches. We perform a comparison among these methods and their corresponding R software packages using real-life datasets and simulated datasets under various conditions. Findings from the analyses and recommendations for using these approaches in practice are discussed.
引用
收藏
页数:34
相关论文
共 50 条
  • [1] Joint clustering multiple longitudinal features: A comparison of methods and software packages with practical guidance
    Lu, Zihang
    Ahmadiankalati, Mojtaba
    Tan, Zhiwen
    STATISTICS IN MEDICINE, 2023, 42 (29) : 5513 - 5540
  • [2] kml and kml3d: R Packages to Cluster Longitudinal Data
    Genolini, Christophe
    Alacoque, Xavier
    Sentenac, Mariane
    Arnaud, Catherine
    JOURNAL OF STATISTICAL SOFTWARE, 2015, 65 (04): : 1 - 34
  • [3] Model-based clustering of longitudinal data
    McNicholas, Paul D.
    Murphy, T. Brendan
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (01): : 153 - 168
  • [4] Review of Clustering Methods for Functional Data
    Zhang, Mimi
    Parnell, Andrew
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (07)
  • [5] Profile clustering in clinical trials with longitudinal and functional data methods
    Gong, Hangjun
    Xun, Xiaolei
    Zhou, Yingchun
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2019, 29 (03) : 541 - 557
  • [6] Functional clustering methods for binary longitudinal data with temporal heterogeneity
    Sohn, Jinwon
    Jeong, Seonghyun
    Cho, Young Min
    Park, Taeyoung
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 185
  • [7] A Pseudo-EM Algorithm for Clustering Incomplete Longitudinal Data
    Shaikh, Mateen
    McNicholas, Paul D.
    Desmond, Anthony F.
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2010, 6 (01)
  • [8] A review of nonparametric regression methods for longitudinal data
    Yang, Changxin
    Zhu, Zhongyi
    STATISTICS AND ITS INTERFACE, 2024, 17 (01) : 127 - 142
  • [9] Functional clustering methods for longitudinal data with application to electronic health records
    Zeldow, Bret
    Flory, James
    Stephens-Shields, Alisa
    Raebel, Marsha
    Roy, Jason A.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (03) : 655 - 670
  • [10] An overview of clustering methods with guidelines for application in mental health research
    Gao, Caroline X.
    Dwyer, Dominic
    Zhu, Ye
    Smith, Catherine L.
    Du, Lan
    Filia, Kate M.
    Bayer, Johanna
    Menssink, Jana M.
    Wang, Teresa
    Bergmeir, Christoph
    Wood, Stephen
    Cotton, Sue M.
    PSYCHIATRY RESEARCH, 2023, 327