Decorated Phrase Model and Syntax-Based Reordering Model for Statistical Machine Translation

被引:0
作者
Liang, Huashen [1 ]
Xue, Yongzeng [2 ]
Zhao, Tiejun [1 ]
机构
[1] Harbin Inst Technol, MOE MS Key Lab Nat Language Proc & Speech, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Dept New Media Technol & Art, Harbin 150001, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
phrase-based statistical machine translation; reordering model; syntactic structure; syntax encapsulated phrase model;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the past few years, much attention has been paid on extending phrase-based statistical machine translation with syntactic structures. In this paper, we introduce a novel phrase model, in which treebank tags are employed to decorate the bilingual phrase pairs. We use tag sequences, instead of phrase pairs, to train the lexicalized reordering model. Since the number of treebank tags is much smaller than the number of words, the tag sequence based reordering model is smaller and more accurate than the phrase based reordering model. Experiments were carried out on three types of models: the phrase model, the POS tag encapsulated phrase (PTEP) model and the syntactic tag encapsulated phrase (STEP) model. The STEP model obtained higher BLEU-4 score than other models on NIST MT tasks.
引用
收藏
页码:314 / 319
页数:6
相关论文
共 17 条
[11]  
Mi Haitao., 2008, Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (ACL-08), P192
[12]   The alignment template approach to statistical machine translation [J].
Och, FJ ;
Ney, H .
COMPUTATIONAL LINGUISTICS, 2004, 30 (04) :417-449
[13]  
Och FJ, 2003, 41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P160
[14]  
Och FJ, 2000, 38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P440
[15]   BLEU: a method for automatic evaluation of machine translation [J].
Papineni, K ;
Roukos, S ;
Ward, T ;
Zhu, WJ .
40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, :311-318
[16]  
Stolcke A., 2002, P INTERSPEECH, P901
[17]  
Tillmann C., 2005, P ACL, P557