Efficient and Effective Learning of HMMs Based on Identification of Hidden States

被引:5
作者
Liu, Tingting [1 ]
Lemeire, Jan [1 ,2 ,3 ]
机构
[1] VUB, Dept Elect & Informat ETRO, Pl Laan 2, B-1050 Brussels, Belgium
[2] VUB, Dept Ind Sci INDI, Pl Laan 2, B-1050 Brussels, Belgium
[3] iMinds, Dept Data Sci, Technol Pk 19, B-9052 Zwijnaarde, Belgium
关键词
PROBABILISTIC FUNCTIONS; MARKOV; MODEL;
D O I
10.1155/2017/7318940
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The predominant learning algorithm for Hidden Markov Models (HMMs) is local search heuristics, of which the Baum-Welch (BW) algorithm is mostly used. It is an iterative learning procedure starting with a predefined size of state spaces and randomly chosen initial parameters. However, wrongly chosen initial parameters may cause the risk of falling into a local optimum and a low convergence speed. To overcome these drawbacks, we propose to use amore suitable model initialization approach, a Segmentation Clustering and Transient analysis (SCT) framework, to estimate the number of states and model parameters directly from the input data. Based on an analysis of the information flow through HMMs, we demystify the structure of models and show that high-impact states are directly identifiable from the properties of observation sequences. States having a high impact on the log-likelihood make HMMs highly specific. Experimental results show that even though the identification accuracy drops to 87.9% when randommodels are considered, the SCT method is around 50 to 260 times faster than the BWalgorithm with 100% correct identification for highly specific models whose specificity is greater than 0.06.
引用
收藏
页数:26
相关论文
共 30 条
[1]  
Akaike H., 1998, Selected papers of Hirotugu Akaike, P199, DOI [10.1007/978-1-4612-1694-0_15, DOI 10.1007/978-1-4612-1694-0_15]
[2]  
[Anonymous], 2008, Proceedings of the 25th international conference on Machine learning, DOI DOI 10.1145/1390156.1390196
[3]  
Balasubramanian V., 1993, THESIS
[4]   STATISTICAL INFERENCE FOR PROBABILISTIC FUNCTIONS OF FINITE STATE MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T .
ANNALS OF MATHEMATICAL STATISTICS, 1966, 37 (06) :1554-&
[5]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[6]   Detecting homogeneous segments in DNA sequences by using hidden Markov models [J].
Boys, RJ ;
Henderson, DA ;
Wilkinson, DJ .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2000, 49 :269-285
[7]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   Sticky Hidden Markov Modeling of Comparative Genomic Hybridization [J].
Du, Lan ;
Chen, Minhua ;
Lucas, Joseph ;
Carin, Lawrence .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (10) :5353-5368
[10]  
Finesso L., 1990, THESIS