Parallel feature weight decay algorithms for fast development of machine translation models

被引:1
|
作者
Bicici, Ergun
机构
[1] Istanbul, Turkey
关键词
Machine translation; Natural language processing; Instance selection;
D O I
10.1007/s10590-021-09275-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Parallel feature weight decay algorithms, parfwd, are engineered for language- and task-adaptive instance selection to build distinct machine translation (MT) models and enable the fast development of accurate MT using less data and computation. parfwd decay the weights of both source and target features to increase their average coverage. In a conference on MT (WMT), parfwd achieved the lowest translation error rate from French to English in 2015, and a rate 11.7% less than the top phrase-based statistical MT (PBSMT) in 2017. parfwd also achieved a rate 5.8% less than the top in TweetMT and the top from Catalan to English. BLEU upper bounds identify the translation directions that offer the largest room for relative improvement and MT models that use additional data. Performance trend angles show the power of MT models to convert unit data into unit translation results or more BLEU for an increase in coverage. The source coverage angle of parfwd in the 2013-2019 WMT reached + 6 degrees better than the top with 35 degrees for translation into English, and it was + 1.4 degrees better than the top with 22 degrees overall.
引用
收藏
页码:239 / 263
页数:25
相关论文
共 50 条
  • [1] Improved feature decay algorithms for statistical machine translation
    Poncelas, Alberto
    Wenniger, Gideon Maillette de Buy
    Way, Andy
    NATURAL LANGUAGE ENGINEERING, 2022, 28 (01) : 71 - 91
  • [2] Optimizing Instance Selection for Statistical Machine Translation with Feature Decay Algorithms
    Bicici, Ergun
    Yuret, Deniz
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (02) : 339 - 350
  • [3] Linguistically motivated statistical machine translation: models and algorithms
    Vandeghinste, Vincent
    MACHINE TRANSLATION, 2015, 29 (3-4) : 291 - 294
  • [4] Parallel Machine Scheduling Under Uncertainty: Models and Exact Algorithms
    Song, Guopeng
    Leus, Roel
    INFORMS JOURNAL ON COMPUTING, 2022, 34 (06) : 3059 - 3079
  • [5] Feature Selection with Evolving, Fast and Slow Using Two Parallel Genetic Algorithms
    Cetin, Uzay
    Gundogmus, Yunus Emre
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 699 - 703
  • [6] Improving Reordering Models with Phrase Number Feature for Statistical Machine Translation
    Noormohammadi, Neda
    Rahimi, Zahra
    Khadivi, Shahram
    ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP 2013, 2014, 427 : 227 - 233
  • [7] Co-training framework for feature weight optimization of statistic machine translation
    Liu, Shu-Jie
    Li, Chi-Ho
    Li, Mu
    Zhou, Ming
    Ruan Jian Xue Bao/Journal of Software, 2012, 23 (12): : 3101 - 3114
  • [9] Prediction and feature selection of low birth weight using machine learning algorithms
    Reza, Tasneem Binte
    Salma, Nahid
    JOURNAL OF HEALTH POPULATION AND NUTRITION, 2024, 43 (01)
  • [10] Fast and Robust Neural Network Joint Models for Statistical Machine Translation
    Devlin, Jacob
    Zbib, Rabih
    Huang, Zhongqiang
    Lamar, Thomas
    Schwartz, Richard
    Makhoul, John
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1370 - 1380