Clustering Longitudinal Data: A Review of Methods and Software Packages

被引:1
作者
Lu, Zihang [1 ,2 ]
机构
[1] Queens Univ, Dept Publ Hlth Sci, Kingston, ON, Canada
[2] Queens Univ, Dept Math & Stat, Kingston, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
cluster analysis; longitudinal data; model-based clustering; algorithm-based clustering; functional clustering; FUNCTIONAL DATA-ANALYSIS; LATENT CLASS ANALYSIS; MIXTURE-MODELS; K-MEANS; R PACKAGE; BAYESIAN-INFERENCE; CROSS-VALIDATION; UNKNOWN NUMBER; MIXED MODELS; MISSING DATA;
D O I
10.1111/insr.12588
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Clustering of longitudinal data is becoming increasingly popular in many fields such as social sciences, business, environmental science, medicine and healthcare. However, it is often challenging due to the complex nature of the data, such as dependencies between observations collected over time, missingness, sparsity and non-linearity, making it difficult to identify meaningful patterns and relationships among the data. Despite the increasingly common application of cluster analysis for longitudinal data, many existing methods are still less known to researchers, and limited guidance is provided in choosing between methods and software packages. In this paper, we review several commonly used methods for clustering longitudinal data. These methods are broadly classified into three categories, namely, model-based approaches, algorithm-based approaches and functional clustering approaches. We perform a comparison among these methods and their corresponding R software packages using real-life datasets and simulated datasets under various conditions. Findings from the analyses and recommendations for using these approaches in practice are discussed.
引用
收藏
页数:34
相关论文
共 50 条
[31]   CLUSTERING FOR MULTIVARIATE CONTINUOUS AND DISCRETE LONGITUDINAL DATA [J].
Komarek, Arnost ;
Komarkova, Lenka .
ANNALS OF APPLIED STATISTICS, 2013, 7 (01) :177-200
[32]   A new approach in pattern clustering on longitudinal data [J].
Liu, Yi, 1600, Binary Information Press (10) :6209-6222
[33]   On Comparison of Clustering Methods for Pharmacoepidemiological Data [J].
Feuillet, Fanny ;
Bellanger, Lise ;
Hardouin, Jean-Benoit ;
Victorri-Vigneau, Caroline ;
Sebille, Veronique .
JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2015, 25 (04) :843-856
[34]   Software application profile: tpc and micd-R packages for causal discovery with incomplete cohort data [J].
Andrews, Ryan M. ;
Bang, Christine W. ;
Didelez, Vanessa ;
Witte, Janine ;
Foraita, Ronja .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2024, 53 (05)
[35]   SLIDER: Software for LongItudinal Data Exploration with R [J].
Commenges, Hadrien ;
Pistre, Pierre ;
Cura, Robin .
CYBERGEO-EUROPEAN JOURNAL OF GEOGRAPHY, 2014,
[36]   Review and compare clustering algorithms for navigation data analysis tasks [J].
Ponomareva, A. V. ;
Meyta, R. V. .
PROCEEDINGS OF THE 2016 CONFERENCE ON INFORMATION TECHNOLOGIES IN SCIENCE, MANAGEMENT, SOCIAL SPHERE AND MEDICINE (ITSMSSM), 2016, 51 :270-273
[37]   Clustering longitudinal ordinal data via finite mixture of matrix-variate distributions [J].
Francesco Amato ;
Julien Jacques ;
Isabelle Prim-Allaz .
Statistics and Computing, 2024, 34
[38]   Capabilities of R Package mixAK for Clustering Based on Multivariate Continuous and Discrete Longitudinal Data [J].
Komarek, Arnost ;
Komarkova, Lenka .
JOURNAL OF STATISTICAL SOFTWARE, 2014, 59 (12) :1-38
[39]   A Comparison of Hierarchical Methods for Clustering Functional Data [J].
Ferreira, Laura ;
Hitchcock, David B. .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2009, 38 (09) :1925-1949
[40]   Sparse and smooth functional data clustering [J].
Centofanti, Fabio ;
Lepore, Antonio ;
Palumbo, Biagio .
STATISTICAL PAPERS, 2024, 65 (02) :795-825