Simultaneous Estimation of Confidence and Error Cause in Speech Recognition Using Discriminative Model

被引:0
作者
Ogawa, Atsunori [1 ]
Nakamura, Atsushi [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Kyoto, Japan
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
speech recognition; confidence; error cause; discriminative model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since recognition errors are unavoidable in speech recognition, confidence scoring, which accurately estimates the reliability of recognition results, is a critical function for speech recognition engines. In addition to achieving accurate confidence estimation, if we are to develop speech recognition systems that will be widely used by the public, speech recognition engines must be able to report the causes of errors properly, namely they must offer a reason for any failure to recognize input utterances. This paper proposes a method that simultaneously estimates both confidences and causes of errors in speech recognition results by using discriminative models. We evaluated the proposed method in an initial speech recognition experiment, and confirmed its promising performance with respect to confidence and error cause estimation.
引用
收藏
页码:1203 / 1206
页数:4
相关论文
共 19 条
  • [1] BERGER AL, 1996, COMPUT LING, V22
  • [2] DUTA N, 2006, IEEE T ASLP, V14
  • [3] Furui S, 2005, LECT NOTES ARTIF INT, V3658, P9
  • [4] HORI T, 2007, IEEE T ASLP, V15
  • [5] Imai T, 2006, INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, P1602
  • [6] ITOU K, 1998, P ICSLP, P3261
  • [7] Confidence measures for speech recognition: A survey
    Jiang, H
    [J]. SPEECH COMMUNICATION, 2005, 45 (04) : 455 - 470
  • [8] JIANG L, 1998, P ICSLP
  • [9] KOBAYASHI A, 2005, P INT, P1453
  • [10] KOBAYASHI T, 2008SLP7419 IPSJ SIG