Selecting hidden Markov model state number with cross-validated likelihood

被引:103
作者
Celeux, Gilles [2 ]
Durand, Jean-Baptiste [1 ]
机构
[1] Grenoble Univ, Lab Jean Kuntzmann, INRIA Rhone Alpes, F-38041 Grenoble 9, France
[2] Univ Paris 11, Dept Math, INRIA Futurs, F-91405 Orsay, France
关键词
hidden Markov models; model selection; cross-validation; missing values at random; EM algorithm;
D O I
10.1007/s00180-007-0097-1
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The problem of estimating the number of hidden states in a hidden Markov model is considered. Emphasis is placed on cross-validated likelihood criteria. Using cross-validation to assess the number of hidden states allows to circumvent the well-documented technical difficulties of the order identification problem in mixture models. Moreover, in a predictive perspective, it does not require that the sampling distribution belongs to one of the models in competition. However, computing cross-validated likelihood for hidden Markov models for which only one training sample is available, involves difficulties since the data are not independent. Two approaches are proposed to compute cross-validated likelihood for a hidden Markov model. The first one consists of using a deterministic half-sampling procedure, and the second one consists of an adaptation of the EM algorithm for hidden Markov models, to take into account randomly missing values induced by cross-validation. Numerical experiments on both simulated and real data sets compare different versions of cross-validated likelihood criterion and penalised likelihood criteria, including BIC and a penalised marginal likelihood criterion. Those numerical experiments highlight a promising behaviour of the deterministic half-sampling criterion.
引用
收藏
页码:541 / 564
页数:24
相关论文
共 31 条
[1]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0_15]
[2]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[3]  
Bernardo J., 2009, Bayesian theory
[4]   Assessing a mixture model for clustering with the integrated completed likelihood [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) :719-725
[5]   Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 41 (3-4) :561-575
[6]  
CELEUX G, 1992, ACT JOURN THEM APPR, P5
[7]  
CHURCHILL GA, 1989, B MATH BIOL, V51, P79
[8]   HEART-RATE-VARIABILITY IN NORMAL SLEEPING FULL-TERM AND PRETERM NEONATES [J].
CLAIRAMBAULT, J ;
CURZIDASCALOVA, L ;
KAUFFMANN, F ;
MEDIGUE, C ;
LEFFLER, C .
EARLY HUMAN DEVELOPMENT, 1992, 28 (02) :169-183
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]   BAUM FORWARD - BACKWARD ALGORITHM REVISITED [J].
DEVIJVER, PA .
PATTERN RECOGNITION LETTERS, 1985, 3 (06) :369-373