Inference in finite state space non parametric Hidden Markov Models and applications

被引:43
作者
Gassiat, E. [1 ,2 ]
Cleynen, A. [3 ,4 ]
Robin, S. [3 ,4 ]
机构
[1] Univ Paris Sud, Math Lab, Orsay, France
[2] CNRS, Math Lab, F-91405 Orsay, France
[3] AgroParisTech, MIA 518, Paris, France
[4] INRA, MIA 518, Paris, France
关键词
Identifiability; Hidden Markov Models; Non-parametric; SEMIPARAMETRIC ESTIMATION; NONPARAMETRIC-ESTIMATION; HMM; IDENTIFIABILITY; LIKELIHOOD; COMPONENT;
D O I
10.1007/s11222-014-9523-8
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Hidden Markov models (HMMs) are intensively used in various fields to model and classify data observed along a line (e.g. time). The fit of such models strongly relies on the choice of emission distributions that are most often chosen among some parametric family. In this paper, we prove that finite state space non parametric HMMs are identifiable as soon as the transition matrix of the latent Markov chain has full rank and the emission probability distributions are linearly independent. This general result allows the use of semi-or non-parametric emission distributions. Based on this result we present a series of classification problems that can be tackled out of the strict parametric framework. We derive the corresponding inference algorithms. We also illustrate their use on few biological examples, showing that they may improve the classification performances.
引用
收藏
页码:61 / 71
页数:11
相关论文
共 37 条
[1]   IDENTIFIABILITY OF PARAMETERS IN LATENT STRUCTURE MODELS WITH MANY OBSERVED VARIABLES [J].
Allman, Elizabeth S. ;
Matias, Catherine ;
Rhode, John A. .
ANNALS OF STATISTICS, 2009, 37 (6A) :3099-3132
[2]  
[Anonymous], NONPARAMETR IN PRESS
[3]  
[Anonymous], WILEY SERIES PROBABI
[4]  
[Anonymous], 2013, TECHNICAL REPORT
[5]  
[Anonymous], TECHNICAL REPORT
[6]  
[Anonymous], P NAT C ART INT MENL
[7]  
[Anonymous], TECHNICAL REPORT
[8]   Combining Mixture Components for Clustering [J].
Baudry, Jean-Patrick ;
Raftery, Adrian E. ;
Celeux, Gilles ;
Lo, Kenneth ;
Gottardo, Raphael .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2010, 19 (02) :332-353
[9]   An EM-Like Algorithm for Semi- and Nonparametric Estimation in Multivariate Mixtures [J].
Benaglia, Tatiana ;
Chauveau, Didier ;
Hunter, David R. .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2009, 18 (02) :505-526
[10]   Unsupervised Classification for Tiling Arrays: ChIP-chip and Transcriptome [J].
Berard, Caroline ;
Martin-Magniette, Marie-Laure ;
Brunaud, Veronique ;
Aubourg, Sebastien ;
Robin, Stephane .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)