A General Approach for Word Reordering in English-Vietnamese-English Statistical Machine Translation

被引:2
作者
Nguyen, Nhung T. H. [1 ]
Le, Vinh Q. [1 ]
Minh-Quoc Nghiem [1 ]
Dien Dinh [1 ]
机构
[1] Univ Sci, Fac Informat Technol, Hochiminh City, Vietnam
关键词
Statistical machine translation; word order; reordering rule;
D O I
10.1142/S0218213015500244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word ordering is among the most important problems in machine translation. In this paper, we describe a general approach to solve this problem in English-Vietnamese-English statistical machine translation. Our model automatically extracts short-range and long-range reordering rules based on part-of-speech tags and alignment information. Our method, therefore, covers both local and global word order, and is more versatile than other methods. To obtain a better set of reordering rules, we omit generated rules if their weight is lower than a threshold a. The experimental results have shown that the translation quality has been improved significantly compared to the distance-based reordering model and comparable to the lexicalized model. Our approach is not only suitable for English-Vietnamese but also for language pairs which have many differences in syntax, such as English-Chinese and Chinese-Vietnamese.
引用
收藏
页数:23
相关论文
共 30 条
[1]  
[Anonymous], PRICAI 2008 TRENDS A
[2]  
[Anonymous], P EMNLP
[3]  
[Anonymous], P 2003 C N AM CHAPT
[4]  
[Anonymous], P 1 INT WORKSH VIETN
[5]  
[Anonymous], P 6 IEEE INT C COMP
[6]  
[Anonymous], 2002, P INT C SPOKEN LANGU
[7]  
[Anonymous], JORN TECHL HABL
[8]  
[Anonymous], P C HUM LANG TECHN E
[9]  
[Anonymous], P INT WORKSH SPOK LA
[10]  
[Anonymous], P EMNLP CONLL 2007