Example-based machine translation based on tree-string correspondence and statistical generation

被引:6
|
作者
Liu, Zhangyi [1 ]
Wang, Haifeng [1 ]
Wu, Hua [1 ]
机构
[1] Toshiba China Res & Dev Ctr, 501,Tower W2,Oriental Plaza,1,East Chang An Ave, Beijing 100738, Peoples R China
关键词
Example-based machine translation; Translation example; Tree-string correspondence; Statistical generation;
D O I
10.1007/s10590-006-9016-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an example-based machine translation (EBMT) method based on tree-string correspondence (TSC) and statistical generation. In this method, the translation example is represented as a TSC, which is a triple consisting of a parse tree in the source language, a string in the target language, and the correspondence between the leaf node of the source-language tree and the sub-string of the target-language string. For an input sentence to be translated, it is first parsed into a tree. Then the TSC forest which best matches the input tree is searched for. Finally the translation is generated using a statistical generation model to combine the target-language strings of the TSCs. The generation model consists of three features: the semantic similarity between the tree in the TSC and the input tree, the translation probability of translating the source word into the target word, and the language-model probability for the target-language string. Based on the above method, we build an English-to-Chinese MT system. Experimental results indicate that the performance of our system is comparable with phrase-based statistical MT systems.
引用
收藏
页码:25 / 41
页数:17
相关论文
empty
未找到相关数据