Topic-aware pivot language approach for statistical machine translation

被引:0
作者
Jinsong SU [1 ,2 ]
Xiaodong SHI [3 ]
Yanzhou HUANG [3 ]
Yang LIU [4 ]
Qingqiang WU [1 ,2 ]
Yidong CHEN [3 ]
Huailin DONG [1 ]
机构
[1] Software School, Xiamen University
[2] Center for Digital Media Computing, Xiamen University
[3] Cognitive Science Department, Xiamen University
[4] Department of Computer Science and Technology, Tsinghua
关键词
D O I
暂无
中图分类号
TP391.2 [翻译机];
学科分类号
摘要
The pivot language approach for statistical machine translation(SMT) is a good method to break the resource bottleneck for certain language pairs. However, in the implementation of conventional approaches, pivotside context information is far from fully utilized, resulting in erroneous estimations of translation probabilities. In this study, we propose two topic-aware pivot language approaches to use different levels of pivot-side context. The first method takes advantage of document-level context by assuming that the bridged phrase pairs should be similar in the document-level topic distributions. The second method focuses on the effect of local context. Central to this approach are that the phrase sense can be reflected by local context in the form of probabilistic topics, and that bridged phrase pairs should be compatible in the latent sense distributions. Then, we build an interpolated model bringing the above methods together to further enhance the system performance. Experimental results on French-Spanish and French-German translations using English as the pivot language demonstrate the effectiveness of topic-based context in pivot-based SMT.
引用
收藏
页码:241 / 253
页数:13
相关论文
共 5 条
  • [1] Bilingual LSA-based adaptation for statistical machine translation[J] Yik-Cheung Tam;Ian Lane;Tanja Schultz Machine Translation 2008,
  • [2] Pivot language approach for phrase-based statistical machine translation[J] Hua Wu;Haifeng Wang Machine Translation 2008,
  • [3] Semi-supervised model adaptation for statistical machine translation[J] Nicola Ueffing;Gholamreza Haffari;Anoop Sarkar Machine Translation 2008, 2
  • [4] A systematic comparison of various statistical alignment models Franz Josef Och;Hermann Ney; Comput. Linguist 2003, 01
  • [5] B1EU: a method for automatic evaluation of machine translation Kishore Papineni;Salim Roukos;Todd Ward;Wei-Jing Zhu; Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics 2002,