Machine translation and its evaluation: a study

被引:8
|
作者
Mondal, Subrota Kumar [1 ]
Zhang, Haoxi [1 ]
Kabir, H. M. Dipu [2 ]
Ni, Kan [1 ]
Dai, Hong-Ning [3 ]
机构
[1] Macau Univ Sci & Technol, Sch Comp Sci & Engn, Taipa 999078, Macao, Peoples R China
[2] Deakin Univ, Geelong, Vic, Australia
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
关键词
Natural Language Processing; Computational linguistics; Statistical machine translation; Neural machine translation; Evaluation methods;
D O I
10.1007/s10462-023-10423-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine translation (namely MT) has been one of the most popular fields in computational linguistics and Artificial Intelligence (AI). As one of the most promising approaches, MT can potentially break the language barrier of people from all over the world. Despite a number of studies in MT, there are few studies in summarizing and comparing MT methods. To this end, in this paper, we principally focus on presenting the two mainstream MT schemes: statistical machine translation (SMT) and neural machine translation (NMT), including their basic rationales and developments. Meanwhile, the detailed translation models are also presented, such as the word-based model, syntax-based model, and phrase-based model in statistical machine translation. Similarly, approaches in NMT, such as the recurrent neural network-based, attention mechanism-based, and transformer-based models are presented. Last but not least, the evaluation approaches also play an important role in helping developers to improve their methods better in MT. The prevailing machine translation evaluation methodologies are also presented in this article.
引用
收藏
页码:10137 / 10226
页数:90
相关论文
共 50 条
  • [41] Study on Machine translation approaches for Indian languages and their challenges
    Sindhu, D., V
    Sagar, B. M.
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2016, : 262 - 267
  • [42] UPC: An Open Word-Sense Annotated Parallel Corpora for Machine Translation Study
    Van-Hai Vu
    Quang-Phuoc Nguyen
    Shin, Joon-Choul
    Ock, Cheol-Young
    APPLIED SCIENCES-BASEL, 2020, 10 (11):
  • [43] Evaluation of Arabic Machine Translation System Based on the Universal Networking Language
    Adly, Noha
    Al Ansary, Sameh
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 5723 : 243 - 257
  • [44] Linguistically Motivated Evaluation of English-Latvian Statistical Machine Translation
    Skadina, Inguna
    Levane-Petrova, Kristine
    Rabante, Guna
    HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 : 221 - 229
  • [45] A review of machine transliteration, translation, evaluation metrics and datasets in Indian Languages
    Jha, Abhinav
    Patil, Hemprasad Yashwant
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (15) : 23509 - 23540
  • [46] Optimizing Non-Decomposable Evaluation Metrics for Neural Machine Translation
    Shen, Shi-Qi
    Liu, Yang
    Sun, Mao-Song
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2017, 32 (04) : 796 - 804
  • [47] A review of machine transliteration, translation, evaluation metrics and datasets in Indian Languages
    Abhinav Jha
    Hemprasad Yashwant Patil
    Multimedia Tools and Applications, 2023, 82 : 23509 - 23540
  • [48] Optimizing Non-Decomposable Evaluation Metrics for Neural Machine Translation
    Shi-Qi Shen
    Yang Liu
    Mao-Song Sun
    Journal of Computer Science and Technology, 2017, 32 : 796 - 804
  • [49] A review of Thai-English machine translation
    Lyons, Seamus
    MACHINE TRANSLATION, 2020, 34 (2-3) : 197 - 230
  • [50] Case-Sensitive Neural Machine Translation
    Shi, Xuewen
    Huang, Heyan
    Jian, Ping
    Tang, Yi-Kun
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 662 - 674