Model-based clustering of time series in group-specific functional subspaces

被引:108
作者
Bouveyron, Charles [2 ]
Jacques, Julien [1 ]
机构
[1] Univ Lille 1, UFR Math, INRIA Lille Nord Europe, Lab Paul Painleve,UMR CNRS 8524, F-59655 Villeneuve Dascq, France
[2] Univ Paris 01, Lab SAMM, EA 4543, F-75013 Paris, France
关键词
Functional data; Time series clustering; Model-based clustering; Group-specific functional subspaces; Functional PCA; CLASSIFICATION;
D O I
10.1007/s11634-011-0095-6
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This work develops a general procedure for clustering functional data which adapts the clustering method high dimensional data clustering (HDDC), originally proposed in the multivariate context. The resulting clustering method, called funHDDC, is based on a functional latent mixture model which fits the functional data in group-specific functional subspaces. By constraining model parameters within and between groups, a family of parsimonious models is exhibited which allow to fit onto various situations. An estimation procedure based on the EM algorithm is proposed for determining both the model parameters and the group-specific functional subspaces. Experiments on real-world datasets show that the proposed approach performs better or similarly than classical two-step clustering methods while providing useful interpretations of the groups and avoiding the uneasy choice of the discretization technique. In particular, funHDDC appears to always outperform HDDC applied on spline coefficients.
引用
收藏
页码:281 / 300
页数:20
相关论文
共 24 条