A syntactically informed reordering model for statistical machine translation

被引:6
|
作者
Farzi, Saeed [1 ]
Faili, Heshaam [1 ]
Khadivi, Shahram [2 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
[2] Amirkabir Univ Technol, Comp Engn & IT Dept, Human Language Technol & Machine Learning Lab, Tehran, Iran
关键词
phrasal dependency tree; statistical reordering models; machine translation;
D O I
10.1080/0952813X.2014.971439
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word reordering is one of the challengeable problems of machine translation. It is an important factor of quality and efficiency of machine translation systems. In this paper, we introduce a novel reordering model based on an innovative structure, named, phrasal dependency tree. The phrasal dependency tree is a modern syntactic structure which is based on dependency relationships between contiguous non-syntactic phrases. The proposed model integrates syntactical and statistical information in the context of log-linear model aimed at dealing with the reordering problems. It benefits from phrase dependencies, translation directions (orientations) and translation discontinuity between translated phrases. In comparison with well-known and popular reordering models such as distortion, lexicalised and hierarchical models, the experimental study demonstrates the superiority of our model in terms of translation quality. Performance is evaluated for Persian -> English and English -> German translation tasks using Tehran parallel corpus and WMT07 benchmarks, respectively. The results report 1.54/1.7 and 1.98/3.01 point improvements over the baseline in terms of BLEU/TER metrics on Persian -> English and German -> English translation tasks, respectively. On average our model retrieved a significant impact on precision with comparable recall value with respect to the lexicalised and distortion models.
引用
收藏
页码:449 / 469
页数:21
相关论文
共 50 条
  • [1] An integrated reordering model for statistical machine translation
    Chao, Wen-Han
    Lie, Zhou-Jun
    Chen, Yue-Xin
    MICAI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4827 : 955 - +
  • [2] A Clustered Global Phrase Reordering Model for Statistical Machine Translation
    Nagata, Masaaki
    Saito, Kuniko
    Yamamoto, Kazuhide
    Ohashi, Kazuteru
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 713 - 720
  • [3] Synonym-Based Reordering Model for Statistical Machine Translation
    Yang, Zhenxin
    Li, Miao
    Chen, Lei
    Sun, Kai
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 369 - 378
  • [4] Extraction of reordering rules for statistical machine translation
    Srivastava, Jyoti
    Sanyal, Sudip
    Srivastava, Ashish Kumar
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4809 - 4819
  • [5] A Unified Model for Soft Linguistic Reordering Constraints in Statistical Machine Translation
    Li, Junhui
    Marton, Yuval
    Resnik, Philip
    Daume, Hal, III
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1123 - 1133
  • [6] A Dependency-Based Neural Reordering Model for Statistical Machine Translation
    Hadiwinoto, Christian
    Ng, Hwee Tou
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 109 - 115
  • [7] Reordering space design in statistical machine translation
    Pecheux, Nicolas
    Allauzen, Alexandre
    Niehues, Jan
    Yvon, Francois
    LANGUAGE RESOURCES AND EVALUATION, 2016, 50 (02) : 375 - 410
  • [8] Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation
    Xiong, Deyi
    Liu, Qun
    Lin, Shouxun
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 521 - 528
  • [9] Reordering space design in statistical machine translation
    Nicolas Pécheux
    Alexandre Allauzen
    Jan Niehues
    François Yvon
    Language Resources and Evaluation, 2016, 50 : 375 - 410
  • [10] Syntax-based reordering model for phrasal statistical machine translation
    Xue, Yong-Zeng
    Li, Sheng
    Zhao, Tie-Jun
    Yang, Mu-Yun
    Tongxin Xuebao/Journal on Communications, 2008, 29 (01): : 7 - 14