Neural machine translation for limited resources English-Nyishi pair

被引:0
作者
Nabam Kakum
Sahinur Rahman Laskar
Koj Sambyo
Partha Pakray
机构
[1] National Institute of Technology,Department of Computer Science and Engineering
[2] Arunachal Pradesh,School of Computer Science
[3] University of Petroleum and Energy Studies,Department of Computer Science and Engineering
[4] National Institute of Technology,undefined
[5] Silchar,undefined
来源
Sādhanā | / 48卷
关键词
English-Nyishi; NMT; low-resource; corpus;
D O I
暂无
中图分类号
学科分类号
摘要
Neural machine translation handles sequential data over the variable length of input and output sentences and accomplishes a state-of-the-art method for the task of machine translation. Although the neural machine translation shows good performance in both low and high-resource language pairs translation, it requires adequate parallel training data. In low-resource language sets, the preparation of the corpus is strenuous and time-consuming. Automatic translation systems like Google and Bing cover under-resourced Indian languages, but lack the support of the Nyishi language. It is due to the lack of a suitable dataset. In this work, we have contributed a parallel corpus of low-resource language pairs, English-Nyishi, and reported comparative experiments on the baseline neural machine translation systems. The results are evaluated for English to Nyishi and vice-versa via well-known automatic evaluation metrics and manual evaluation.
引用
收藏
相关论文
共 24 条
  • [1] Candy L(2021)An improved English-to-Mizo neural machine translation Transactions on Asian and Low-Resource Language Information Processing 20 1-21
  • [2] Badal S(2019)English-mizo machine translation using neural and statistical approaches Neural Computing and Applications 31 7615-7631
  • [3] Partha P(2020)Low resource neural machine translation: A benchmark for five African languages Africa NLP workshop at ICLR 2020 1-10
  • [4] Amarnath P(2021)Neural machine translation of low-resource languages using smt phrase pair injection Natural Language Engineering 27 271-292
  • [5] Partha P(2019)Filtered pseudo-parallel corpus improves low-resource neural machine translation ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 19 1-16
  • [6] Jereemi B(2019)Neural machine translation for Indian languages Journal of Intelligent Systems 28 465-477
  • [7] Surafel ML(2018)Attention-based English to Punjabi neural machine translation Journal of Intelligent & Fuzzy Systems 34 1551-1559
  • [8] Matteo N(2009)The meteor metric for automatic evaluation of machine translation Machine Translation 23 105-115
  • [9] Marco T(undefined)undefined undefined undefined undefined-undefined
  • [10] Sukanta S(undefined)undefined undefined undefined undefined-undefined