Transformer Machine Translation Model Incorporating Word Alignment Structure

被引:0
作者
Xi, Haixu [1 ,2 ]
Zhang, Feng [3 ]
Wang, Yintong [4 ]
机构
[1] Jiangsu Univ Technol, Sch Comp Engn, Changzhou 213001, Jiangsu, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Econ & Management, Nanjing 210094, Peoples R China
[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Jiangsu, Peoples R China
[4] Nanjing Xiaozhuang Univ, Sch Informat Engn, Nanjing 211171, Jiangsu, Peoples R China
关键词
Training; Decoding; Vocabulary; Transformers; Feeds; Convolution; Acute respiratory distress syndrome; Statistical machine translation; vocabulary alignment; word alignment structure; transformer model; BLOCKCHAIN;
D O I
10.1109/TCE.2023.3332878
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the domain of machine translation (MT) processing, end-to-end neural machine translation (NMT) has emerged as a remarkable breakthrough, surpassing the conventional statistical MT approaches. Inspired by the Internet of Things (IoT) technology, some researchers are exploring how to integrate device-to-device communication patterns into NMT to enhance translation efficiency. However, the current state-of-the-art NMT models predominantly adopt sequence-based representations for both the source language and target language sentences. The lack of natural language sentence structure attributes leads to problems such as unfaithful translation in NMT. To enhance lexical alignment in NMT, the paper proposes a new transformer MT model that incorporates vocabulary alignment structure. The model receives external lexical alignment information during each step of the decoding process in the decoder design to alleviate the problem of missing lexical alignment structures. During the decoding phase of the model, the statistical MT system plays a crucial role by supplying relevant lexical alignment information derived from the decoding information obtained from the NMT. Additionally, the model suggests vocabulary recommendations based on this lexical alignment information. The experimental results provide evidence that this approach successfully integrates the vocabulary knowledge derived from statistical MT, leading to improved translation performance.
引用
收藏
页码:1010 / 1019
页数:10
相关论文
共 43 条
  • [1] [Anonymous], 2013, EMNLP
  • [2] Bahdanau D., 2015, PROC 3 INT C LEARN, P1221
  • [3] Barone AVM, 2017, Arxiv, DOI arXiv:1707.07631
  • [4] SF-FWA: A Self-Adaptive Fast Fireworks Algorithm for effective large-scale optimization
    Chen, Maiyue
    Tan, Ying
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2023, 80
  • [5] Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, 10.48550/arXiv.1406.1078, DOI 10.3115/V1/D14-1179]
  • [6] A Survey of Multilingual Neural Machine Translation
    Dabre, Raj
    Chu, Chenhui
    Kunchukuttan, Anoop
    [J]. ACM COMPUTING SURVEYS, 2020, 53 (05)
  • [7] A Diffused Memetic Optimizer for reactive berth allocation and scheduling at marine container terminals in response to disruptions
    Dulebenets, Maxim A.
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2023, 80
  • [8] An Adaptive Polyploid Memetic Algorithm for scheduling trucks at a cross-docking terminal
    Dulebenets, Maxim A.
    [J]. INFORMATION SCIENCES, 2021, 565 : 390 - 421
  • [9] Feng Z., 2023, Formal Analysis for Natural Language Processing: A Handbook, P597
  • [10] Framework for Handling Rare Word Problems in Neural Machine Translation System Using Multi-Word Expressions
    Garg, Kamal Deep
    Shekhar, Shashi
    Kumar, Ajit
    Goyal, Vishal
    Sharma, Bhisham
    Chengoden, Rajeswari
    Srivastava, Gautam
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (21):