Identifying bilingual Multi-Word Expressions for Statistical Machine Translation

被引:0
作者
Bouamor, Dhouha [1 ,2 ,3 ]
Semmar, Nasredine [1 ]
Zweigenbaum, Pierre [2 ,3 ]
机构
[1] CEA, LIST, Vis & Content Engn Lab, F-91191 Gif Sur Yvette, France
[2] CNRS, LIMSI, F-91403 Orsay, France
[3] Univ Paris 11, Orsay, France
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
关键词
bilingual Multi-Word Expression; Vector Space Model; Statistical Machine Translation;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
MultiWord Expressions (MWEs) repesent a key issue for numerous applications in Natural Language Processing (NLP) especially for Machine Translation (MT). In this paper, we describe a strategy for detecting translation pairs of MWEs in a French-English parallel corpus. In addition we introduce three methods aiming to integrate extracted bilingual MWES in MOSES, a phrase based Statistical Machine Translation (SMT) system. We experimentally show that these textual units can improve translation quality.
引用
收藏
页码:674 / 679
页数:6
相关论文
共 50 条
  • [31] Discourse in Statistical Machine Translation
    Hardmeier, Christian
    DISCOURS-REVUE DE LINGUISTIQUE PSYCHOLINGUISTIQUE ET INFORMATIQUE, 2012, (11):
  • [32] A critique of Statistical Machine Translation
    Way, Andy
    LINGUISTICA ANTVERPIENSIA NEW SERIES-THEMES IN TRANSLATION STUDIES, 2009, 8 : 17 - 41
  • [33] Improvement of Word Alignment in Thai-English Statistical Machine Translation by Grammatical Attributes Identification
    Phodong, Kanyalag
    Kongkachandra, Rachada
    2016 8TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2016,
  • [34] Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation
    Dugonik, Jani
    Maucec, Mirjam Sepesy
    Verber, Domen
    Brest, Janez
    MATHEMATICS, 2023, 11 (11)
  • [35] ANALYSIS ON BILINGUAL MACHINE TRANSLATION SYSTEMS FOR ENGLISH AND TAMIL
    Sangavi, G.
    Mrinalini, K.
    Vijayalakshmi, P.
    2016 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY INFORMATION AND COMMUNICATION (ICCPEIC), 2016, : 245 - 250
  • [36] Syntax-Based Chinese-Vietnamese Tree-to-Tree Statistical Machine Translation with Bilingual Features
    Gao, Shengxiang
    Huang, Jihao
    Xue, Mingya
    Yu, Zhengtao
    Wang, Zhuo
    Zhang, Yang
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (04)
  • [37] STATISTICAL VERSUS NEURAL MACHINE TRANSLATION - A CASE STUDY FOR A MEDIUM SIZE DOMAIN-SPECIFIC BILINGUAL CORPUS
    Jassem, Krzysztof
    Dwojak, Tomasz
    POZNAN STUDIES IN CONTEMPORARY LINGUISTICS, 2019, 55 (02) : 491 - 519
  • [38] Translation Model of Myanmar Phrases for Statistical Machine Translation
    Zin, Thet Thet
    Soe, Khin Mar
    Thein, Ni Lar
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 235 - +
  • [39] Statistical machine translation for Indic languages
    Das, Sudhansu Bala
    Panda, Divyajyoti
    Mishra, Tapas Kumar
    Patra, Bidyut Kr.
    NATURAL LANGUAGE PROCESSING, 2025, 31 (02): : 328 - 345
  • [40] Paraphrase Lattice for Statistical Machine Translation
    Onishi, Takashi
    Utiyama, Masao
    Sumita, Eiichiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (06) : 1299 - 1305