Identifying bilingual Multi-Word Expressions for Statistical Machine Translation

被引:0
作者
Bouamor, Dhouha [1 ,2 ,3 ]
Semmar, Nasredine [1 ]
Zweigenbaum, Pierre [2 ,3 ]
机构
[1] CEA, LIST, Vis & Content Engn Lab, F-91191 Gif Sur Yvette, France
[2] CNRS, LIMSI, F-91403 Orsay, France
[3] Univ Paris 11, Orsay, France
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
关键词
bilingual Multi-Word Expression; Vector Space Model; Statistical Machine Translation;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
MultiWord Expressions (MWEs) repesent a key issue for numerous applications in Natural Language Processing (NLP) especially for Machine Translation (MT). In this paper, we describe a strategy for detecting translation pairs of MWEs in a French-English parallel corpus. In addition we introduce three methods aiming to integrate extracted bilingual MWES in MOSES, a phrase based Statistical Machine Translation (SMT) system. We experimentally show that these textual units can improve translation quality.
引用
收藏
页码:674 / 679
页数:6
相关论文
共 50 条
  • [41] Migrating Code with Statistical Machine Translation
    Anh Tuan Nguyen
    Tung Thanh Nguyen
    Nguyen, Tien N.
    [J]. 36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE COMPANION 2014), 2014, : 544 - 547
  • [42] Spatial Ontology in Statistical Machine Translation
    Skadins, Raivis
    [J]. DATABASES AND INFORMATION SYSTEMS, 2010, : 409 - 421
  • [43] Topic Adaptation for Statistical Machine Translation
    Taraghi, Mina
    Khadivi, Shahram
    [J]. 2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 2147 - 2152
  • [44] Improving English-Arabic statistical machine translation with morpho-syntactic and semantic word class
    Khemakhem I.T.
    Jamoussi S.
    Hamadou A.B.
    [J]. International Journal of Intelligent Systems Technologies and Applications, 2020, 19 (02) : 172 - 190
  • [45] TermFinder: log-likelihood comparison and phrase-based statistical machine translation models for bilingual terminology extraction
    Haque, Rejwanul
    Penkale, Sergio
    Way, Andy
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (02) : 365 - 400
  • [46] TermFinder: log-likelihood comparison and phrase-based statistical machine translation models for bilingual terminology extraction
    Rejwanul Haque
    Sergio Penkale
    Andy Way
    [J]. Language Resources and Evaluation, 2018, 52 : 365 - 400
  • [47] A Topic-Triggered Translation Model for Statistical Machine Translation
    SU Jinsong
    WANG Zhihao
    WU Qingqiang
    YAO Junfeng
    LONG Fei
    ZHANG Haiying
    [J]. ChineseJournalofElectronics, 2017, 26 (01) : 65 - 72
  • [48] A Topic-Triggered Translation Model for Statistical Machine Translation
    Su Jinsong
    Wang Zhihao
    Wu Qingqiang
    Yao Junfeng
    Long Fei
    Zhang Haiying
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (01) : 65 - 72
  • [49] The design and evaluation of a Statistical Machine Translation syllabus for translation students
    Doherty, Stephen
    Kenny, Dorothy
    [J]. INTERPRETER AND TRANSLATOR TRAINER, 2014, 8 (02) : 295 - 315
  • [50] Max margin learning for statistical machine translation: Toward improvement of machine translation accuracy
    Katsuhiko H.
    Taro W.
    Hajime T.
    Hideki I.
    Seiichi Y.
    [J]. Transactions of the Japanese Society for Artificial Intelligence, 2010, 25 (05) : 593 - 601