A syntactically informed reordering model for statistical machine translation

被引:6
作者
Farzi, Saeed [1 ]
Faili, Heshaam [1 ]
Khadivi, Shahram [2 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
[2] Amirkabir Univ Technol, Comp Engn & IT Dept, Human Language Technol & Machine Learning Lab, Tehran, Iran
关键词
phrasal dependency tree; statistical reordering models; machine translation;
D O I
10.1080/0952813X.2014.971439
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word reordering is one of the challengeable problems of machine translation. It is an important factor of quality and efficiency of machine translation systems. In this paper, we introduce a novel reordering model based on an innovative structure, named, phrasal dependency tree. The phrasal dependency tree is a modern syntactic structure which is based on dependency relationships between contiguous non-syntactic phrases. The proposed model integrates syntactical and statistical information in the context of log-linear model aimed at dealing with the reordering problems. It benefits from phrase dependencies, translation directions (orientations) and translation discontinuity between translated phrases. In comparison with well-known and popular reordering models such as distortion, lexicalised and hierarchical models, the experimental study demonstrates the superiority of our model in terms of translation quality. Performance is evaluated for Persian -> English and English -> German translation tasks using Tehran parallel corpus and WMT07 benchmarks, respectively. The results report 1.54/1.7 and 1.98/3.01 point improvements over the baseline in terms of BLEU/TER metrics on Persian -> English and German -> English translation tasks, respectively. On average our model retrieved a significant impact on precision with comparable recall value with respect to the lexicalised and distortion models.
引用
收藏
页码:449 / 469
页数:21
相关论文
共 50 条
  • [31] MTIL2017: Machine Translation Using Recurrent Neural Network on Statistical Machine Translation
    Mahata, Sainik Kumar
    Das, Dipankar
    Bandyopadhyay, Sivaji
    JOURNAL OF INTELLIGENT SYSTEMS, 2019, 28 (03) : 447 - 453
  • [32] Source-side Reordering to Improve Machine Translation between Languages with Distinct Word Orders
    Arora, Karunesh Kumar
    Agrawal, Shyam Sunder
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [33] Improved feature decay algorithms for statistical machine translation
    Poncelas, Alberto
    Wenniger, Gideon Maillette de Buy
    Way, Andy
    NATURAL LANGUAGE ENGINEERING, 2022, 28 (01) : 71 - 91
  • [34] Measuring word alignment quality for statistical machine translation
    Fraser, Alexander
    Marcu, Daniel
    COMPUTATIONAL LINGUISTICS, 2007, 33 (03) : 293 - 303
  • [35] FACTORED PHRASE-BASED STATISTICAL MACHINE TRANSLATION
    Tufis, Dan
    Ceausu, Alexandru
    FROM SPEECH PROCESSING TO SPOKEN LANGUAGE TECHNOLOGY, 2009, : 115 - 124
  • [36] Collecting and Using Comparable Corpora for Statistical Machine Translation
    Skadina, Inguna
    Aker, Ahmet
    Mastropavlos, Nikos
    Su, Fangzhong
    Tufis, Dan
    Verlic, Mateja
    Vasiljevs, Andrejs
    Babych, Bogdan
    Clough, Paul
    Gaizauskas, Robert
    Glaros, Nikos
    Paramita, Monica Lestari
    Pinnis, Marcis
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 438 - 445
  • [37] Better Addressing Word Deletion for Statistical Machine Translation
    Li, Qiang
    Zhang, Dongdong
    Li, Mu
    Xiao, Tong
    Zhu, Jingbo
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 91 - 102
  • [38] Statistical machine translation based on weighted syntax–semantics
    Debajyoty Banik
    Asif Ekbal
    Pushpak Bhattacharyya
    Sādhanā, 2020, 45
  • [39] A Statistical Method for Selecting Noun Sense in Machine Translation
    Choe, Changil
    Kim, Hyonil
    Choe, Yongjin
    2012 THIRD INTERNATIONAL CONFERENCE ON THEORETICAL AND MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE (ICTMF 2012), 2013, 38 : 705 - 709
  • [40] Machine Learning Based Optimized Pruning Approach for Decoding in Statistical Machine Translation
    Banik, Debajyoty
    Ekbal, Asif
    Bhattacharyya, Pushpak
    IEEE ACCESS, 2019, 7 : 1736 - 1751