Context Sensitive Word Deletion Model for Statistical Machine Translation

被引:0
|
作者
Li, Qiang [1 ]
Han, Yaqian [1 ]
Xiao, Tong [1 ]
Zhu, Jingbo [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, NiuTrans Lab, Shenyang, Liaoning, Peoples R China
来源
CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017 | 2017年 / 10565卷
基金
美国国家科学基金会;
关键词
Natural language processing; Statistical machine translation; Word deletion; ALIGNMENT;
D O I
10.1007/978-3-319-69005-6_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word deletion (WD) errors can lead to poor comprehension of the meaning of source translated sentences in phrase-based statistical machine translation (SMT), and have a critical impact on the adequacy of the translation results generated by SMT systems. In this paper, first we classify the word deletion into two categories, wanted and unwanted word deletions. For these two kinds of word deletions, we propose a maximum entropy based word deletion model to improve the translation quality in phrase-based SMT. Our proposed model are based on features automatically learned from a real-word bitext. In our experiments on Chinese-to-English news and web translation tasks, the results show that our approach is capable of generating more adequate translations compared with the baseline system, and our proposed word deletion model yields a +0.99 BLEU improvement and a -2.20 TER reduction on the NIST machine translation evaluation corpora.
引用
收藏
页码:73 / 84
页数:12
相关论文
共 50 条
  • [41] Fully Unsupervised Machine Translation Using Context-Aware Word Translation and Denoising Autoencoder
    Chauhan, Shweta
    Daniel, Philemon
    Saxena, Shefali
    Sharma, Ayush
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [42] Context and Word Choosing in Translation
    张向阳
    科技信息(学术研究), 2007, (23) : 130 - 132
  • [43] Statistical Machine Translation Context Modelling with Recurrent Neural Network and LDA
    Alsenan, Shrooq
    Ykhlef, Mourad
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 75 - 84
  • [44] Target-Side Context for Discriminative Models in Statistical Machine Translation
    Tamchyna, Ales
    Fraser, Alexander
    Bojar, Ondrej
    Junczys-Dowmunt, Marcin
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1704 - 1714
  • [45] Recursive alignment block classification technique for word reordering in statistical machine translation
    Costa-jussa, Marta R.
    Fonollosa, Jose A. R.
    Monte, Enric
    LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (02) : 165 - 179
  • [46] Learning Word Reorderings for Hierarchical Phrase-based Statistical Machine Translation
    Zhang, Jingyi
    Utiyama, Masao
    Sumita, Eiichro
    Zhao, Hai
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 542 - 548
  • [47] State-of-the-Art Word Reordering Approaches in Statistical Machine Translation: A Survey
    Costa-Jussa, Marta R.
    Fonollosa, Jose A. R.
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (11): : 2179 - 2185
  • [48] Improving Statistical Machine Translation Using Bayesian Word Alignment and Gibbs Sampling
    Mermer, Coskun
    Saraclar, Murat
    Sarikaya, Ruhi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05): : 1090 - 1101
  • [49] To Swap or Not to Swap? Exploiting Dependency Word Pairs for Reordering in Statistical Machine Translation
    Hadiwinoto, Christian
    Liu, Yang
    Ng, Hwee Tou
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2943 - 2949
  • [50] A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena
    Bisazza, Arianna
    Federico, Marcello
    COMPUTATIONAL LINGUISTICS, 2016, 42 (02) : 163 - 205