Regression for machine translation evaluation at the sentence level

被引:10
作者
Albrecht, Joshua S. [1 ]
Hwa, Rebecca [1 ]
机构
[1] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
关键词
Machine translation; Evaluation metrics; Machine learning;
D O I
10.1007/s10590-008-9046-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning offers a systematic framework for developing metrics that use multiple criteria to assess the quality of machine translation (MT). However, learning introduces additional complexities that may impact on the resulting metric's effectiveness. First, a learned metric is more reliable for translations that are similar to its training examples; this calls into question whether it is as effective in evaluating translations from systems that are not its contemporaries. Second, metrics trained from different sets of training examples may exhibit variations in their evaluations. Third, expensive developmental resources (such as translations that have been evaluated by humans) may be needed as training examples. This paper investigates these concerns in the context of using regression to developmetrics for evaluating machine-translated sentences. We track a learned metric's reliability across a 5 year period to measure the extent to which the learned metric can evaluate sentences produced by other systems. We compare metrics trained under different conditions to measure their variations. Finally, we present an alternative formulation of metric training in which the features are based on comparisons against pseudo-references in order to reduce the demand on human produced resources. Our results confirm that regression is a useful approach for developing new metrics for MT evaluation at the sentence level.
引用
收藏
页码:1 / 27
页数:27
相关论文
共 50 条
  • [21] SBSim: A Sentence-BERT Similarity-Based Evaluation Metric for Indian Language Neural Machine Translation Systems
    Mrinalini, K.
    Vijayalakshmi, P.
    Nagarajan, T.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1396 - 1406
  • [22] Converting the Format of the Generalized Action Sentence in Chinese-English Machine Translation
    Liu, Zhi-ying
    Jin, Yao-hong
    Zhu, Yun
    Guo, Yan-bo
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 236 - 240
  • [23] LONG SENTENCE PARTITIONING USING TOP-DOWN ANALYSIS FOR MACHINE TRANSLATION
    Yin, Baosheng
    Zuo, Junjun
    Ye, Na
    2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1425 - 1429
  • [24] A Survey on Evaluation Metrics for Machine Translation
    Lee, Seungjun
    Lee, Jungseob
    Moon, Hyeonseok
    Park, Chanjun
    Seo, Jaehyung
    Eo, Sugyeong
    Koo, Seonmin
    Lim, Heuiseok
    MATHEMATICS, 2023, 11 (04)
  • [25] Non-native Language Reading Support with Display of Machine Translation Based on Eye-Tracking and Sentence-Level Mapping
    Ho, Tien-Yu
    Wang, Hao-Chuan
    Lai, Shong-Hong
    PROCEEDINGS OF CHINESE CHI 2018: SIXTH INTERNATIONAL SYMPOSIUM OF CHINESE CHI (CHINESE CHI 2018), 2018, : 57 - 63
  • [26] Machine translation evaluation with neural networks
    Guzman, Francisco
    Joty, Shafiq
    Marquez, Lluis
    Nakov, Preslav
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 180 - 200
  • [27] Improving the Rule based Machine Translation System using Sentence Simplification (English to Tamil)
    Kavirajan, B.
    Kumar, Anand M.
    Soman, K. P.
    Rajendran, S.
    Vaithehi, S.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 957 - 963
  • [28] Building Sentiment Lexicons for Mainland Scandinavian Languages Using Machine Translation and Sentence Embeddings
    Liu, Peng
    Marco, Cristina
    Gulla, Jon Atle
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 2816 - 2825
  • [29] The machine translationness: a concept applied to the evaluation of machine translation systems
    More Lopez, Joaquim
    Climent Roca, Salvador
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2006, (37): : 233 - 240
  • [30] A review of machine transliteration, translation, evaluation metrics and datasets in Indian Languages
    Jha, Abhinav
    Patil, Hemprasad Yashwant
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (15) : 23509 - 23540