Robust numeric recognition in spoken language dialogue

被引：7

作者：

Rahim, M ^{[1
]}

Riccardi, G ^{[1
]}

Saul, L ^{[1
]}

Wright, J ^{[1
]}

Buntschuh, B ^{[1
]}

Gorin, A ^{[1
]}

机构：

[1] AT&T Labs Res, Florham Pk, NJ 07932 USA

来源：

SPEECH COMMUNICATION | 2001年 / 34卷 / 1-2期

关键词：

robustness; spoken dialogue system; speech recognition; utterance verification; discriminative training; understanding; language modeling; numeric recognition; digits;

D O I：

10.1016/S0167-6393(00)00054-6

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper addresses the problem of automatic numeric recognition and understanding in spoken language dialogue. We show that accurate numeric understanding in fluent unconstrained speech demands maintaining robustness at several different levels of system design, including acoustic, language, understanding and dialogue. We describe a robust system for numeric recognition and present algorithms for feature extraction, acoustic and language modeling, discriminative training, utterance verification and numeric understanding and validation. Experimental results from a field-trial of a spoken dialogue system are presented that include customers' responses to credit card and telephone number requests. (C) 2001 Elsevier Science B.V. All rights reserved.

引用

页码：195 / 212

页数：18

共 34 条

[21] RAHIM M, 1999, P EUR C SPEECH COMM, P495
[22] Rahim MG, 1996, IEEE T SPEECH AUDI P, V4, P19
[23] Discriminative utterance verification for connected digits recognition
Rahim, MG
Lee, CH
Juang, BH
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03): : 266 - 277
[24] Signal conditioning techniques for robust speech recognition
Rahim, MG
Juang, BH
Chou, W
Buhrke, E
[J]. IEEE SIGNAL PROCESSING LETTERS, 1996, 3 (04) : 107 - 109
[25] RAMASWAMY G, 1999, P EUR C SPEECH COMM, P2662
[26] Stochastic automata for language modeling
Riccardi, G
Pieraccini, R
Bocchieri, E
[J]. COMPUTER SPEECH AND LANGUAGE, 1996, 10 (04) : 265 - 293
[27] Stochastic language adaptation over time and state in natural spoken dialog systems
Riccardi, G
Gorin, AL
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01): : 3 - 10
[28] RICCARDI G, 1998, ACL WORKSH VER LARG, P188
[29] Maximum-likelihood approach to stochastic matching for robust speech recognition
Sankar, A
Lee, CH
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (03): : 190 - 202
[30] SHARP RD, 1997, P INT C AC SPEECH SI, P4065

← 1 2 3 4 →