Integration of speech recognition and machine translation: Speech recognition word lattice translation

被引:5
作者
Zhang, RQ [1 ]
Kikui, G [1 ]
机构
[1] ATR Spoken Language Translat Res Labs, Kyoto 6190288, Japan
关键词
Algorithms - Decoding - Graph theory - Probability - Translation (languages);
D O I
10.1016/j.specom.2005.06.007
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An important issue in speech translation is to minimize the negative effect of speech recognition errors on machine translation. We propose a novel statistical machine translation decoding algorithm for speech translation to improve speech translation quality. The algorithm can translate the speech recognition word lattice, where more hypotheses are utilized to bypass the misrecognized single-best hypothesis. The decoding involves converting the recognition word lattice to a translation word graph by a graph-based search, followed by a fine rescoring by an A* search. We show that a speech recognition confidence measure implemented by posterior probability is effective to improve speech translation. The proposed techniques were tested in a Japanese-to-English speech translation task, in which we measured the translation results in terms of a number of automatic evaluation metrics. The experimental results demonstrate a consistent and significant improvement in speech translation achieved by the proposed techniques. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:321 / 334
页数:14
相关论文
共 25 条
[1]  
AKIBA Y, 2004, P IWSLT04 ATR KYOT J
[2]  
[Anonymous], 2003, MACH TRANSL
[3]  
BERGER A, 1994, P ARPA HLT
[4]  
BOITET C, 1994, P COL 1994
[5]  
Brown P. F., 1993, Computational Linguistics, V19, P263
[6]  
Casacuberta F., 2002, P WORKSH SPEECH TO S, P39
[7]  
Doddington G., 2002, P ARPA WORKSH HUM LA
[8]  
Gao Y., 2003, P EUR 2003 GEN, P365
[9]  
Kikui G., 2003, P 8 EUR C SPEECH COM, P381
[10]  
KOEHN P, 2004, P AMTA 2004 WASH DC