Efficient backward decoding of high-order hidden Markov models

被引：14

作者：

Engelbrecht, H. A. ^{[1
]}

du Preez, J. A. ^{[1
]}

机构：

[1] Univ Stellenbosch, Dept Elect & Elect Engn, ZA-7600 Stellenbosch, South Africa

来源：

PATTERN RECOGNITION | 2010年 / 43卷 / 01期

关键词：

Hidden Markov model; Decoding; High-order; Search; PROBABILISTIC FUNCTIONS; RECOGNITION;

D O I：

10.1016/j.patcog.2009.06.004

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The forward-backward search (FBS) algorithm [S. Austin, R. Schwartz, P. Placeway, The forward-backward search algorithm, in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 1991, pp. 697-700] has resulted in increases in speed of up to 40 in expensive time-synchronous beam searches in hidden Markov model (HMM) based speech recognition [R. Schwartz, S. Austin, Efficient, high-performance algorithms for N-best search, in: Proceedings of the Workshop on Speech and Natural Language, 1990, pp, 6-11; L. Nguyen, R. Schwartz, F. Kubala, P. Placeway, Search algorithms for software-only real-time recognition with very large vocabularies, in: Proceedings of the Workshop on Human Language Technology, 1993, pp. 91-95: A. Sixtus, S. Ortmanns, High-quality word graphs using forward-backward pruning, in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 1999, pp. 593-596]. This is typically achieved by using a simplified forward search to decrease computation in the following detailed backward search. FBS implicitly assumes that forward and backward searches of HMMs are computationally equivalent. In this paper we present experimental results, obtained on the CallFriend database, that show that this assumption is incorrect for conventional high-order HMMs. Therefore, any improvement in computational efficiency that is gained by using conventional low-order HMMs in the simplified backward search of FBS is lost. This problem is solved by presenting a new definition of HMMs termed a right-context HMM, which is equivalent to conventional HMMs. We show that the computational expense of backward Viterbi-beam decoding right-context HMMs is similar to that of forward decoding conventional HMMs. Though not the subject of this paper, this allows us to more efficiently decode high-order HMMs, by capitalising on the improvements in computational efficiency that is obtained by using the FBS algorithm. (C) 2009 Elsevier Ltd. All rights reserved.

引用

页码：99 / 112

页数：14

共 32 条

[1]

AUSTIN S, 1991, INT CONF ACOUST SPEE, P697, DOI 10.1109/ICASSP.1991.150435

[2]

Aycard O., 2004, International Journal of Advanced Robotic Systems, V1, P231

[3] A MAXIMUM-LIKELIHOOD APPROACH TO CONTINUOUS SPEECH RECOGNITION [J].

BAHL, LR ;

JELINEK, F ;

MERCER, RL .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1983, 5 (02) :179-190

[4]

BAHL LR, 1987, RC13123 IBM

[5] STATISTICAL INFERENCE FOR PROBABILISTIC FUNCTIONS OF FINITE STATE MARKOV CHAINS [J].

BAUM, LE ;

PETRIE, T .

ANNALS OF MATHEMATICAL STATISTICS, 1966, 37 (06) :1554-&

[6] AN INEQUALITY WITH APPLICATIONS TO STATISTICAL ESTIMATION FOR PROBABILISTIC FUNCTIONS OF MARKOV PROCESSES AND TO A MODEL FOR ECOLOGY [J].

BAUM, LE ;

EAGON, JA .

BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1967, 73 (03) :360-&

[7] A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].

BAUM, LE ;

PETRIE, T ;

SOULES, G ;

WEISS, N .

ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&

[8]

Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics

[9]

Bengio Y., 1999, Neural Computing Surveys, V2

[10] High-order extensions of the double chain Markov model [J].

Berchtold, A .

STOCHASTIC MODELS, 2002, 18 (02) :193-227

← 1 2 3 4 →