Comparabilty of Corpora in Human and Machine Translation

被引:0
作者
Lapshinova-Koltunski, Ekaterina [1 ]
Pal, Santanu [1 ]
机构
[1] Univ Saarland, D-66123 Saarbrucken, Germany
来源
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2014年
关键词
comparable corpora; paraphrases; machine translation; register analysis; registerial features;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In this study, we demonstrate a negative result from a work on comparable corpora which forces us to address a problem of comparability in both human and machine translation. We state that it is not always defined similarly, and comparable corpora used in contrastive linguistics or human translation analysis cannot always be applied for statistical machine translation (SMT). So, we revise the definition of comparability and show that some notions from translatology, i.e. registerial features, should also be considered in machine translation (MT).
引用
收藏
页数:7
相关论文
共 47 条
[1]  
[Anonymous], P 1 WORKSH BUILD US
[2]  
[Anonymous], ACL WORKSH PARS GERM
[3]  
[Anonymous], 4 WORKSH BUILD US CO
[4]  
[Anonymous], 2013, P 14 MACH TRANSL SUM
[5]  
[Anonymous], 2002, P INT C SPOKEN LANGU
[6]  
[Anonymous], P ACL 2013 SOF BULG
[7]  
[Anonymous], 10 NTCIR C TOK JAP
[8]  
[Anonymous], CROSS LINGUISTIC COR
[9]  
[Anonymous], P ACL 04
[10]  
[Anonymous], 2005, P 43 ANN M ASS COMPU, DOI [10.3115/1219840.1219914, DOI 10.3115/1219840.1219914]