Context Sensitive Word Deletion Model for Statistical Machine Translation

被引:0
|
作者
Li, Qiang [1 ]
Han, Yaqian [1 ]
Xiao, Tong [1 ]
Zhu, Jingbo [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, NiuTrans Lab, Shenyang, Liaoning, Peoples R China
来源
CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017 | 2017年 / 10565卷
基金
美国国家科学基金会;
关键词
Natural language processing; Statistical machine translation; Word deletion; ALIGNMENT;
D O I
10.1007/978-3-319-69005-6_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word deletion (WD) errors can lead to poor comprehension of the meaning of source translated sentences in phrase-based statistical machine translation (SMT), and have a critical impact on the adequacy of the translation results generated by SMT systems. In this paper, first we classify the word deletion into two categories, wanted and unwanted word deletions. For these two kinds of word deletions, we propose a maximum entropy based word deletion model to improve the translation quality in phrase-based SMT. Our proposed model are based on features automatically learned from a real-word bitext. In our experiments on Chinese-to-English news and web translation tasks, the results show that our approach is capable of generating more adequate translations compared with the baseline system, and our proposed word deletion model yields a +0.99 BLEU improvement and a -2.20 TER reduction on the NIST machine translation evaluation corpora.
引用
收藏
页码:73 / 84
页数:12
相关论文
共 50 条
  • [1] Better Addressing Word Deletion for Statistical Machine Translation
    Li, Qiang
    Zhang, Dongdong
    Li, Mu
    Xiao, Tong
    Zhu, Jingbo
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 91 - 102
  • [2] Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Min
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2255 - 2266
  • [3] Grammatical and context-sensitive error correction using a statistical machine translation framework
    Ehsan, Nava
    Faili, Heshaam
    SOFTWARE-PRACTICE & EXPERIENCE, 2013, 43 (02): : 187 - 206
  • [4] A Novel Word Reordering Method for Statistical Machine Translation
    Zang, Shuo
    Zhao, Hai
    Wu, Chunyang
    Wang, Rui
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 843 - 848
  • [5] HMM word and phrase alignment for statistical machine translation
    Deng, Yonggang
    Byrne, William
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 494 - 507
  • [6] A Neural Approach to Source Dependence Based Context Model for Statistical Machine Translation
    Chen, Kehai
    Zhao, Tiejun
    Yang, Muyun
    Liu, Lemao
    Tamura, Akihiro
    Wang, Rui
    Utiyama, Masao
    Sumita, Eiichiro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (02) : 266 - 280
  • [7] Statistical Machine Translation
    Vatsa, Mukesh G. S.
    Joshi, Nikita
    Goswami, Sumit
    DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2010, 30 (04): : 25 - 32
  • [8] Translation Model of Myanmar Phrases for Statistical Machine Translation
    Zin, Thet Thet
    Soe, Khin Mar
    Thein, Ni Lar
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 235 - +
  • [9] Syntactic Pattern Based Word Alignment for Statistical Machine Translation
    Le, Quang-Hung
    Le, Anh-Cuong
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2014, 5 (03) : 36 - 45
  • [10] Statistical System of Word Re-ordering in Machine Translation
    Costa-Jussa, Marta R.
    Fonollosa, Jose A. R.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2006, (37): : 249 - 255