KmL3D: A non-parametric algorithm for clustering joint trajectories

被引:73
作者
Genolini, C. [1 ,2 ]
Pingault, J. B. [3 ,4 ]
Driss, T. [2 ]
Cote, S. [3 ,4 ,5 ,6 ]
Tremblay, R. E. [3 ,4 ,5 ,6 ,9 ]
Vitaro, F. [3 ,4 ,5 ]
Arnaud, C. [1 ]
Falissard, B. [6 ,7 ,8 ]
机构
[1] Univ Toulouse 3, INSERM, U1027, F-31062 Toulouse, France
[2] Univ Paris Ouest Nanterre La Def, UFR STAPS, CeRSM EA 2931, Paris, France
[3] Univ Montreal, Res Unit Childrens Psychosocial Maladjustment, Montreal, PQ, Canada
[4] St Justine Hosp, Montreal, PQ, Canada
[5] Univ Montreal, Int Lab Child & Adolescent Mental Hlth Dev, Montreal, PQ, Canada
[6] INSERM, U669, Paris, France
[7] Univ Paris Sud, Paris, France
[8] Univ Paris 05, Paris, France
[9] Natl Univ Ireland Univ Coll Dublin, Sch Publ Hlth Physiotherapy & Populat Sci, Dublin 4, Ireland
关键词
Longitudinal data; k-means; Cluster analysis; Non-parametric algorithm; Joint trajectories; HYPERACTIVITY; DISORDER;
D O I
10.1016/j.cmpb.2012.08.016
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In cohort studies, variables are measured repeatedly and can be considered as trajectories. A classic way to work with trajectories is to cluster them in order to detect the existence of homogeneous patterns of evolution. Since cohort studies usually measure a large number of variables, it might be interesting to study the joint evolution of several variables (also called joint-variable trajectories). To date, the only way to cluster joint-trajectories is to cluster each trajectory independently, then to cross the partitions obtained. This approach is unsatisfactory because it does not take into account a possible co-evolution of variable-trajectories. KmL3D is an R package that implements a version of k-means dedicated to clustering joint-trajectories. It provides facilities for the management of missing values, offers several quality criteria and its graphic interface helps the user to select the best partition. KmL3D can work with any number of joint-variable trajectories. In the restricted case of two joint trajectories, it proposes 3D tools to visualize the partitioning and then export 3D dynamic rotating-graphs to PDF format. (C) 2012 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:104 / 111
页数:8
相关论文
共 29 条
[1]   Unsupervised curve clustering using B-splines [J].
Abraham, C ;
Cornillon, PA ;
Matzner-Lober, E ;
Molinari, N .
SCANDINAVIAN JOURNAL OF STATISTICS, 2003, 30 (03) :581-595
[2]  
[Anonymous], 1901, B SOCIETE VAUDOISEDE
[3]   A comparison of maximum covariance and k-means cluster analysis in classifying cases into known taxon groups [J].
Beauchaine, TP ;
Beauchaine, RJ .
PSYCHOLOGICAL METHODS, 2002, 7 (02) :245-261
[4]  
Calinski T., 1974, Communications in Statistics-theory and Methods, V3, P1, DOI [10.1080/03610927408827101, DOI 10.1080/03610927408827101]
[5]   A CLASSIFICATION EM ALGORITHM FOR CLUSTERING AND 2 STOCHASTIC VERSIONS [J].
CELEUX, G ;
GOVAERT, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1992, 14 (03) :315-332
[6]   The Relative Merits of Risk Ratios and Odds Ratios [J].
Cummings, Peter .
ARCHIVES OF PEDIATRICS & ADOLESCENT MEDICINE, 2009, 163 (05) :438-445
[7]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[8]   Imputation of missing longitudinal data: a comparison of methods [J].
Engels, JM ;
Diehr, P .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2003, 56 (10) :968-976
[9]  
Everitt B. S., 2001, CLUSTER ANAL
[10]  
Feng D, 2008, J STAT SOFTW, V28, P1