Speech recognition for a travel reservation system

被引:0
作者
Erdogan, H [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
来源
IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III | 2001年
关键词
speech recognition; spoken dialog systems; language modeling; context free grammars;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present our work on speech recognition for a spoken dialog system for automatic travel reservations. The system can receive speech input from an analog line or from IP telephony through a web site. The system uses speech recognition, natural language understanding, air travel database access, dialog management, natural language generation and speech synthesis technologies to perform the task of an automated travel agent. Language modeling for a mixed-initiative spoken dialog system is a challenging problem. We explore class-based n-gram language models (LMs) for this domain. Details for designing LM classes and assigning non-uniform probabilities within the classes are provided. We point out disadvantages of class-based LMs and introduce compound word vocabularies to overcome the problems observed. Dialog state dependent LMs are explored which enables incorporating semantic context into the language models. We also analyze the use of embedded context free grammar objects within LMs and point out advantages and disadvantages of using them. The resulting LMs are implemented and evaluated using IBM telephony speech recognition engine. Our efforts are shown to decrease the word error rate from 24% to 17% on an evaluation testset.
引用
收藏
页码:1505 / 1511
页数:7
相关论文
共 10 条
[1]  
AARON A, 2001, P IEEE C AC SPEECH S
[2]  
DAVIES K, 1999, EUR C SPEECH COMM TE
[3]  
GAO Y, 2001, EUR C SPEECH COMM TE
[4]  
LEVIN E, 2000, INT C SPOK LANG PROC
[5]  
LUO X, 2000, INT C SPOKEN LANGUAG, V1, P158
[6]  
MONKOWSKI M, 2001, EMBEDDED GRAMMAR OBJ
[7]  
PELLOM B, 2000, INT C SPOL LANG PROC
[8]  
RUDNICKY AI, 2000, INT C SPOK LANG PROC
[9]  
SAON G, 1999, AUT SPEECH REC UND W
[10]  
SENEFF S, 1998, INT C SPOK LANG PROC