Combining language models in the input interface of a spoken dialogue system

被引：17

作者：

Lopez-Cozar, R. ^{[1
]}

Callejas, Z. ^{[1
]}

机构：

[1] Univ Granada, Fac Comp Sci, Dept Languages & Comp Syst, E-18071 Granada, Spain

来源：

COMPUTER SPEECH AND LANGUAGE | 2006年 / 20卷 / 04期

关键词：

D O I：

10.1016/j.csl.2005.05.003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a new technique to enhance the performance of the input interface of spoken dialogue systems based on a procedure that combines during speech recognition the advantages of using prompt-dependent language models with those of using a language model independent of the prompts generated by the dialogue system. The technique proposes to create a new speech recognizer, termed contextual speech recognizer, that uses a prompt-independent language model to allow recognizing any kind of sentence permitted in the application domain, and at the same time, uses contextual information (in the form of prompt-dependent language models) to take into account that some sentences are more likely to be uttered than others at a particular moment of the dialogue. The experiments show the technique allows enhancing clearly the performance of the input interface of a previously developed dialogue system based exclusively on prompt-dependent language models. But most important, in comparison with a standard speech recognizer that uses just one prompt-independent language model without contextual information, the proposed recognizer allows increasing the word accuracy and sentence understanding rates by 4.09% and 4.19% absolute, respectively. These scores are slightly better than those obtained using linear interpolation of the prompt-independent and prompt-dependent language models used in the experiments. (c) 2005 Elsevier Ltd. All rights reserved.

引用

页码：420 / 440

页数：21

共 40 条

[1]

[Anonymous], P EUR

[2]

[Anonymous], P EUR

[3]

[Anonymous], P EUR

[4]

[Anonymous], P INT C SPOK LANG PR

[5]

BACA J, 2003, P EUR C SPEECH COMM, P1929

[6]

BERNSEN NO, 2003, P EUR C SPEECH COMM, P737

[7]

BONNEAUMAYNARD H, 2003, P EUR, P253

[8] WIZARD OF OZ STUDIES - WHY AND HOW [J].

DAHLBACK, N ;

JONSSON, A ;

AHRENBERG, L .

KNOWLEDGE-BASED SYSTEMS, 1993, 6 (04) :258-266

[9] An interactive dialog system for learning Japanese [J].

Ehsani, F ;

Bernstein, J ;

Najmi, A .

SPEECH COMMUNICATION, 2000, 30 (2-3) :167-177

[10]

EMAMI A, 2003, P EUR, P413

← 1 2 3 4 →