A syntactically informed reordering model for statistical machine translation

被引:6
作者
Farzi, Saeed [1 ]
Faili, Heshaam [1 ]
Khadivi, Shahram [2 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
[2] Amirkabir Univ Technol, Comp Engn & IT Dept, Human Language Technol & Machine Learning Lab, Tehran, Iran
关键词
phrasal dependency tree; statistical reordering models; machine translation;
D O I
10.1080/0952813X.2014.971439
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word reordering is one of the challengeable problems of machine translation. It is an important factor of quality and efficiency of machine translation systems. In this paper, we introduce a novel reordering model based on an innovative structure, named, phrasal dependency tree. The phrasal dependency tree is a modern syntactic structure which is based on dependency relationships between contiguous non-syntactic phrases. The proposed model integrates syntactical and statistical information in the context of log-linear model aimed at dealing with the reordering problems. It benefits from phrase dependencies, translation directions (orientations) and translation discontinuity between translated phrases. In comparison with well-known and popular reordering models such as distortion, lexicalised and hierarchical models, the experimental study demonstrates the superiority of our model in terms of translation quality. Performance is evaluated for Persian -> English and English -> German translation tasks using Tehran parallel corpus and WMT07 benchmarks, respectively. The results report 1.54/1.7 and 1.98/3.01 point improvements over the baseline in terms of BLEU/TER metrics on Persian -> English and German -> English translation tasks, respectively. On average our model retrieved a significant impact on precision with comparable recall value with respect to the lexicalised and distortion models.
引用
收藏
页码:449 / 469
页数:21
相关论文
共 37 条
[1]  
[Anonymous], P EMNLP
[2]  
[Anonymous], MT SUMM
[3]  
[Anonymous], P 41 ANN M ASS COMP
[4]  
[Anonymous], P 23 INT C COMP LING
[5]  
[Anonymous], 1959, Elements de syntaxe structurale
[6]  
Bach N., 2009, P HUM LANG TECHN 200
[7]  
Birch A., 2010, P JOINT 5 WORKSH STA
[8]  
Brown P. F., 1993, Computational Linguistics, V19, P263
[9]  
Carreras X., 2009, P EMNLP
[10]  
Chang P. C., 2007, P 45 ANN M ASS COMP