N-best rescoring for speech recognition using penalized logistic regression machines with garbage class

被引：0

作者：

Birkenes, Oystein ^{[1
,2
]}

Matsui, Tomoko ^{[1
]}

Tanabe, Kunio ^{[3
]}

Myrvoll, Tor Andre ^{[1
,2
]}

机构：

[1] Inst Stat Math, Minato Ku, Tokyo 106, Japan

[2] Norwegian Univ Sci & Technol, Dept Elect & Telecommun, Trondheim, Norway

[3] Waseda Univ, Tokyo, Japan

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年

关键词：

speech recognition; N-best rescoring; PLRM; garbage class; Aurora2;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

State-of-the-art pattern recognition approaches like neural networks or kernel methods have only had limited success in speech recognition. The difficulties often encountered include the varying lengths of speech signals as well as how to deal with sequences of labels (e.g., digit strings) and unknown segmentation. In this paper we present a combined hidden Markov model (HMM) and penalized logistic regression machine (PLRM) approach to continuous speech recognition that can cope with both of these difficulties. The key ingredients of our approach are N-best rescoring and PLRM with garbage class. Experiments on the Aurora2 connected digits database show significant increase in recognition accuracy relative to a purely HMM-based system.

引用

页码：449 / +

页数：2