Improving Speech Recognition Accuracy with Multi-Confidence Thresholding

被引:0
作者
Chang, Shuangyu [1 ]
机构
[1] Tellme Networks Inc, Mountain View, CA 94041 USA
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
speech recognition; confidence threshold; IVR system; pattern recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Confidence-based thresholding plays an important role in practical speech recognition applications. Most previous works have focused on directly improving confidence estimation within the recognition engine. A complementary approach that does not require access to recognizer internal is to optimize confidence threshold settings. This paper describes a general multi-confidence thresholding algorithm that automatically learns different confidence thresholds for different utterances, based on discreet or continuous features associated with a speech utterance. The algorithm can be applied to any speech recognition engine with a confidence output. A learned multi-threshold setting is guaranteed to perform at least as well as a baseline single-threshold system on training data. A significant improvement on overall accuracy can often be obtained on test data, as demonstrated with experiments on two real-world applications.
引用
收藏
页码:1610 / 1613
页数:4
相关论文
共 7 条
[1]  
CHEN Z, 2003, P 7 INT C DOC AN REC, V2
[2]  
GUNAWARDANA A, 1998, P ICSLP98
[3]  
GUPTA S, 1998, P ICSLP98 SYDN
[4]  
*NIST, SC LIT SPEECH REC SC
[5]  
Russell S., 2010, ARTIF INTELL
[6]   Confidence measures for large vocabulary continuous speech recognition [J].
Wessel, F ;
Schlüter, R ;
Macherey, K ;
Ney, H .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03) :288-298
[7]  
WILLIAMS DAG, 1999, THESIS U SHEFFIELD