Robust numeric recognition in spoken language dialogue

被引:7
作者
Rahim, M [1 ]
Riccardi, G [1 ]
Saul, L [1 ]
Wright, J [1 ]
Buntschuh, B [1 ]
Gorin, A [1 ]
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
关键词
robustness; spoken dialogue system; speech recognition; utterance verification; discriminative training; understanding; language modeling; numeric recognition; digits;
D O I
10.1016/S0167-6393(00)00054-6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of automatic numeric recognition and understanding in spoken language dialogue. We show that accurate numeric understanding in fluent unconstrained speech demands maintaining robustness at several different levels of system design, including acoustic, language, understanding and dialogue. We describe a robust system for numeric recognition and present algorithms for feature extraction, acoustic and language modeling, discriminative training, utterance verification and numeric understanding and validation. Experimental results from a field-trial of a spoken dialogue system are presented that include customers' responses to credit card and telephone number requests. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:195 / 212
页数:18
相关论文
共 34 条
  • [21] RAHIM M, 1999, P EUR C SPEECH COMM, P495
  • [22] Rahim MG, 1996, IEEE T SPEECH AUDI P, V4, P19
  • [23] Discriminative utterance verification for connected digits recognition
    Rahim, MG
    Lee, CH
    Juang, BH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03): : 266 - 277
  • [24] Signal conditioning techniques for robust speech recognition
    Rahim, MG
    Juang, BH
    Chou, W
    Buhrke, E
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1996, 3 (04) : 107 - 109
  • [25] RAMASWAMY G, 1999, P EUR C SPEECH COMM, P2662
  • [26] Stochastic automata for language modeling
    Riccardi, G
    Pieraccini, R
    Bocchieri, E
    [J]. COMPUTER SPEECH AND LANGUAGE, 1996, 10 (04) : 265 - 293
  • [27] Stochastic language adaptation over time and state in natural spoken dialog systems
    Riccardi, G
    Gorin, AL
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01): : 3 - 10
  • [28] RICCARDI G, 1998, ACL WORKSH VER LARG, P188
  • [29] Maximum-likelihood approach to stochastic matching for robust speech recognition
    Sankar, A
    Lee, CH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (03): : 190 - 202
  • [30] SHARP RD, 1997, P INT C AC SPEECH SI, P4065