Improving Viterbi Bayesian predictive classification via sequential Bayesian learning in robust speech recognition

被引：11

作者：

Jiang, H

Hirose, K

Huo, Q

机构：

[1] Univ Tokyo, Dept Informat & Commun Engn, Bunkyo Ku, Tokyo 1138656, Japan

[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada

[3] Univ Hong Kong, Dept Informat Syst & Comp Sci, Hong Kong, Peoples R China

来源：

SPEECH COMMUNICATION | 1999年 / 28卷 / 04期

关键词：

Bayesian predictive classification (BPC); Viterbi BPC (VBPC); sequential Bayesian learning; robust speech recognition; natural conjugate prior;

D O I：

10.1016/S0167-6393(99)00018-7

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we extend our proposed Viterbi Bayesian predictive classification (VBPC) algorithm to a new class of prior probability density function(pdf), namely a family of natural conjugate prior pdf's of the complete-data density in continuous density hidden Markov model (CDHMM) and their mixtures. In this way, we can on-line adapt the prior pdf via a sequential Bayesian learning algorithm when some new data are available, so that the performance of VBPC can be continuously improved. Moreover, we also study a sequential Bayesian learning strategy for CDHMM based on a finite mixture approximation of its prior/posterior density which attempts to derive a more accurate prior pdf to describe the unknown mismatches. The experimental results on a speaker-independent recognition task of isolated Japanese digits confirm the viability and the usefulness of the proposed method. (C) 1999 Elsevier Science B.V. All rights reserved.

引用

页码：313 / 326

页数：14

共 15 条

[1]

Bernardo J., 1988, BAYESIAN STAT, V3, P67

[2]

Furui S., 1997, P ESCA NATO TUT RES, P11

[3] Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains [J].

Gauvain, Jean-Luc ;

Lee, Chin-Hui .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :291-298

[4]

Huo Q, 1997, IEEE T SPEECH AUDI P, V5, P161, DOI 10.1109/89.554778

[5]

HUO Q, 1997, P INT C AC SPEECH SI

[6]

HUO Q, 1997, UNPUB IEEE T SPEECH

[7]

HUO Q, 1997, P EUR C SPEECH COMM, P1847

[8]

JIANG H, 1999, IN PRESS IEEE T SPEE, V7

[9]

JIANG H, 1997, P INT C AC SPEECH SI

[10] On stochastic feature and model compensation approaches to robust speech recognition [J].

Lee, CH .

SPEECH COMMUNICATION, 1998, 25 (1-3) :29-47

← 1 2 →