Mixtures of stochastic differential equations with random effects: Application to data clustering

被引:11
作者
Delattre, Maud [1 ,2 ]
Genon-Catalot, Valentine [3 ]
Samson, Adeline [4 ]
机构
[1] AgroParisTech, MIA Paris UMR518, F-75231 Paris 05, France
[2] INRA, MIA Paris UMR518, F-75231 Paris 05, France
[3] Univ Paris 05, Sorbonne Paris Cite, UMR CNRS 8145, Lab MAP5, Paris, France
[4] Univ Grenoble Alpes, UMR CNRS 5224, Lab Jean Kuntzmann, Grenoble, France
关键词
Mixed-effects models; Stochastic differential equations; BIC; Classification; EM algorithm; Mixture distribution; Maximum likelihood estimator; NONPARAMETRIC-ESTIMATION; LIKELIHOOD-ESTIMATION; MAXIMUM-LIKELIHOOD; MIXED MODELS; GROWTH; ALGORITHM;
D O I
10.1016/j.jspi.2015.12.003
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider N independent stochastic processes (X-i(t), t is an element of [0, T-i]), i = 1,..., N, defined by a stochastic differential equation with drift term depending on a random variable phi(i). The distribution of the random effect phi(i) is a Gaussian mixture distribution, depending on unknown parameters which are to be estimated from the continuous observation of the processes X-i. The likelihood of the observation is explicit. When the number of components is known, we prove the consistency of the exact maximum likelihood estimators and use the EM algorithm to compute it. When the number of components is unknown, BIC (Bayesian Information Criterion) is applied to select it. To assign each individual to a class, we define a classification rule based on estimated posterior probabilities. A simulation study illustrates our estimation and classification method on various models. A real data analysis is performed on growth curves with convincing results. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:109 / 124
页数:16
相关论文
共 22 条
  • [1] [Anonymous], 2001, STAT RANDOM PROCESSE
  • [2] [Anonymous], 2000, Sankhya Ser. A, DOI DOI 10.2307/25051289
  • [3] [Anonymous], 2008, EM ALGORITHM EXTENSI
  • [4] Classification of longitudinal data through a semiparametric mixed-effects model based on lasso-type estimators
    Arribas-Gil, Ana
    De la Cruz, Rolando
    Lebarbier, Emilie
    Meza, Cristian
    [J]. BIOMETRICS, 2015, 71 (02) : 333 - 343
  • [5] Mixture of linear mixed models for clustering gene expression profiles from repeated microarray experiments
    Celeux, G
    Martin, O
    Lavergne, C
    [J]. STATISTICAL MODELLING, 2005, 5 (03) : 243 - 267
  • [6] Nonparametric estimation for stochastic differential equations with random effects
    Comte, F.
    Genon-Catalot, V.
    Samson, A.
    [J]. STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2013, 123 (07) : 2522 - 2551
  • [7] Maximum Likelihood Estimation for Stochastic Differential Equations with Random Effects
    Delattre, Maud
    Genon-Catalot, Valentine
    Samson, Adeline
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2013, 40 (02) : 322 - 343
  • [8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [9] Devijver E., 2014, ARXIV14091333
  • [10] Dion C., 2015, STAT INFERENCE STOCH, V18