Regression for machine translation evaluation at the sentence level

被引:10
|
作者
Albrecht, Joshua S. [1 ]
Hwa, Rebecca [1 ]
机构
[1] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
关键词
Machine translation; Evaluation metrics; Machine learning;
D O I
10.1007/s10590-008-9046-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning offers a systematic framework for developing metrics that use multiple criteria to assess the quality of machine translation (MT). However, learning introduces additional complexities that may impact on the resulting metric's effectiveness. First, a learned metric is more reliable for translations that are similar to its training examples; this calls into question whether it is as effective in evaluating translations from systems that are not its contemporaries. Second, metrics trained from different sets of training examples may exhibit variations in their evaluations. Third, expensive developmental resources (such as translations that have been evaluated by humans) may be needed as training examples. This paper investigates these concerns in the context of using regression to developmetrics for evaluating machine-translated sentences. We track a learned metric's reliability across a 5 year period to measure the extent to which the learned metric can evaluate sentences produced by other systems. We compare metrics trained under different conditions to measure their variations. Finally, we present an alternative formulation of metric training in which the features are based on comparisons against pseudo-references in order to reduce the demand on human produced resources. Our results confirm that regression is a useful approach for developing new metrics for MT evaluation at the sentence level.
引用
收藏
页码:1 / 27
页数:27
相关论文
共 50 条
  • [1] A comparative study of machine translation for multilingual sentence-level sentiment analysis
    Araujo, Matheus
    Pereira, Adriano
    Benevenuto, Fabricio
    INFORMATION SCIENCES, 2020, 512 : 1078 - 1102
  • [2] Sentence-Level Combination of Machine Translation Outputs with Syntactically Hybridized Translations
    Wang, Bo
    Zhang, Yuanyuan
    Xu, Qian
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (01) : 164 - 167
  • [3] Improving Non-Autoregressive Machine Translation Using Sentence-Level Semantic Agreement
    Wang, Shuheng
    Huang, Heyan
    Shi, Shumin
    APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [4] Existential Negative Sentence Translation In Japanese-Chinese Machine Translation
    Zhang, Hong
    Ren, Fuji
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 78 - 82
  • [5] Research on Sentence Segmentation with Conjunctions in Patent Machine Translation
    Xu, Lifang
    Yun, Zhu
    Yang, Lijiao
    Jin, Yaohong
    APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 4605 - 4609
  • [6] Impact of Sentence Representation Matching in Neural Machine Translation
    Jung, Heeseung
    Kim, Kangil
    Shin, Jong-Hun
    Na, Seung-Hoon
    Jung, Sangkeun
    Woo, Sangmin
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [7] Sentence-structure errors of machine translation into Slovak
    Welnitzova, Katarina
    Munkova, Dasa
    TOPICS IN LINGUISTICS, 2021, 22 (01) : 78 - 92
  • [8] Meta-Evaluation Using IMPACT Machine Translation Evaluation Method at Document level and Segment level
    Echizen-ya, Hiroshi
    Araki, Kenji
    ICEME 2011: THE 2ND INTERNATIONAL CONFERENCE ON ENGINEERING AND META-ENGINEERING, 2011, : 152 - 157
  • [9] Learning Bilingual Sentence Representations for Quality Estimation of Machine Translation
    Zhu, Junguo
    Yang, Muyun
    Li, Sheng
    Zhao, Tiejun
    MACHINE TRANSLATION, 2016, 668 : 35 - 42
  • [10] Bilingual Word Embedding with Sentence Similarity Constraint for Machine Translation
    Wu, Kui
    Wang, Xuancong
    Aw, AiTi
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 119 - 122