Identifying bilingual Multi-Word Expressions for Statistical Machine Translation

被引:0
|
作者
Bouamor, Dhouha [1 ,2 ,3 ]
Semmar, Nasredine [1 ]
Zweigenbaum, Pierre [2 ,3 ]
机构
[1] CEA, LIST, Vis & Content Engn Lab, F-91191 Gif Sur Yvette, France
[2] CNRS, LIMSI, F-91403 Orsay, France
[3] Univ Paris 11, Orsay, France
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
关键词
bilingual Multi-Word Expression; Vector Space Model; Statistical Machine Translation;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
MultiWord Expressions (MWEs) repesent a key issue for numerous applications in Natural Language Processing (NLP) especially for Machine Translation (MT). In this paper, we describe a strategy for detecting translation pairs of MWEs in a French-English parallel corpus. In addition we introduce three methods aiming to integrate extracted bilingual MWES in MOSES, a phrase based Statistical Machine Translation (SMT) system. We experimentally show that these textual units can improve translation quality.
引用
收藏
页码:674 / 679
页数:6
相关论文
共 50 条
  • [1] Extracting Bilingual Multi-word Expressions for Low-resource Statistical Machine Translation
    Wei, Linyu
    Li, Miao
    Chen, Lei
    Yang, Zhenxin
    Sun, Kai
    Yuan, Man
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 21 - 24
  • [2] Utilization of Multi-word Expressions to Improve Statistical Machine Translation of Statutory Sentences
    Sakamoto, Satomi
    Ogawa, Yasuhiro
    Nakamura, Makoto
    Ohno, Tomohiro
    Toyama, Katsuhiko
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2017, 10091 : 249 - 264
  • [3] Machine translation and human translation of multi-word expressions: peeling this pineapple
    Rebechi, Rozane Rodrigues
    Marcon, Nathalia Oliva
    Faller, Guilherme de Almeida
    REVISTA VIRTUAL DE ESTUDOS DA LINGUAGEM-REVEL, 2025, 23 (44): : 346 - 380
  • [4] Multi-word Expressions in English-Latvian Machine Translation
    Skadina, Inguna
    BALTIC JOURNAL OF MODERN COMPUTING, 2016, 4 (04): : 811 - 825
  • [5] Framework for Handling Rare Word Problems in Neural Machine Translation System Using Multi-Word Expressions
    Garg, Kamal Deep
    Shekhar, Shashi
    Kumar, Ajit
    Goyal, Vishal
    Sharma, Bhisham
    Chengoden, Rajeswari
    Srivastava, Gautam
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [6] Verbal Multi-Word Expressions in Yiddish
    Liebeskind, Chaya
    HaCohen-Kerner, Yaakov
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 205 - 216
  • [7] Integrating Multi-source Bilingual Information for Chinese Word Segmentation in Statistical Machine Translation
    Chen, Wei
    Wei, Wei
    Chen, Zhenbiao
    Xu, Bo
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, 2013, 8208 : 61 - 72
  • [8] Automatic Translation of Multi-word Labels
    Protaziuk, Grzegorz
    Kaczynski, Marcin
    Bembenik, Robert
    MACHINE INTELLIGENCE AND BIG DATA IN INDUSTRY, 2016, 19 : 99 - 109
  • [9] The variability of multi-word verbal expressions in Estonian
    Kadri Muischnek
    Heiki-Jaan Kaalep
    Language Resources and Evaluation, 2010, 44 : 115 - 135
  • [10] A framework for the inclusion of multi-word expressions in ELT
    Martinez, Ron
    ELT JOURNAL, 2013, 67 (02) : 184 - 198