Novel algorithm machine translation for language translation tool

被引:0
作者
Velmurugan, K. Jayasakthi [1 ]
Sumathy, G. [2 ]
Pradeep, K. V. [3 ]
机构
[1] Saveetha Inst Med & Tech Sci, Saveetha Sch Engn, Chennai, Tamil Nadu, India
[2] SRM Inst Sci & Technol, Dept Computat Intelligence, Chennai, Tamil Nadu, India
[3] VIT, Sch Comp Sci & Engn, Chennai Campus, Chennai, Tamil Nadu, India
关键词
fuzzy matching; machine translation; neural machine translation; optimization; spider web; translation memories;
D O I
10.1111/coin.12643
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fuzzy matching techniques are the presently used methods in translating the words. Neural machine translation and statistical machine translation are the methods used in MT. In machine translator tool, the strategy employed for translation needs to handle large amount of datasets and therefore the performance in retrieving correct matching output can be affected. In order to improve the matching score of MT, the advanced techniques can be presented by modifying the existing fuzzy based translator and neural machine translator. The conventional process of modifying architectures and encoding schemes are tedious process. Similarly, the preprocessing of datasets also involves more time consumption and memory utilization. In this article, a new spider web based searching enhanced translation is presented to be employed with the neural machine translator. The proposed scheme enables deep searching of available dataset to detect the accurate matching result. In addition, the quality of translation is improved by presenting an optimal selection scheme for using the sentence matches in source augmentation. The matches retrieved using various matching scores are applied to an optimization algorithm. The source augmentation using optimal retrieved matches increases the translation quality. Further, the selection of optimal match combination helps to reduce time requirement, since it is not necessary to test all retrieved matches in finding target sentence. The performance of translation is validated by measuring the quality of translation using BLEU and METEOR scores. These two scores can be achieved for the TA-EN language pairs in different configurations of about 92% and 86%, correspondingly. The results are evaluated and compared with other available NMT methods to validate the work.
引用
收藏
页数:26
相关论文
共 76 条
  • [1] AlRifaie MM., 2014, 2014 Federated Conference on Computer Science and Information Systems
  • [2] Kumar MA, 2014, Arxiv, DOI arXiv:1409.8581
  • [3] [Anonymous], About Us
  • [4] Artetxe M, 2018, Arxiv, DOI arXiv:1710.11041
  • [5] Bicici E., 2008, International Conference on Intelligent Text Processing and Computational Linguistics
  • [6] Neural Machine Translation
    Birch, Alexandra
    [J]. NATURAL LANGUAGE ENGINEERING, 2021, 27 (03) : 377 - 378
  • [7] Bulte B., Neural fuzzy repair: integrating fuzzy matches into neural machine translation. In: 57th Annual Meeting of the AssociationforComputationalLinguistics (ACL)
  • [8] 2019
  • [9] Cao Q., Encoding gated translation memory into neural machine translation. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
  • [10] 2018