Transformer Machine Translation Model Incorporating Word Alignment Structure

被引：0

作者：

Xi, Haixu ^{[1
,2
]}

Zhang, Feng ^{[3
]}

Wang, Yintong ^{[4
]}

机构：

[1] Jiangsu Univ Technol, Sch Comp Engn, Changzhou 213001, Jiangsu, Peoples R China

[2] Nanjing Univ Sci & Technol, Sch Econ & Management, Nanjing 210094, Peoples R China

[3] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Jiangsu, Peoples R China

[4] Nanjing Xiaozhuang Univ, Sch Informat Engn, Nanjing 211171, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | 2024年 / 70卷 / 01期

关键词：

Training; Decoding; Vocabulary; Transformers; Feeds; Convolution; Acute respiratory distress syndrome; Statistical machine translation; vocabulary alignment; word alignment structure; transformer model; BLOCKCHAIN;

D O I：

10.1109/TCE.2023.3332878

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In the domain of machine translation (MT) processing, end-to-end neural machine translation (NMT) has emerged as a remarkable breakthrough, surpassing the conventional statistical MT approaches. Inspired by the Internet of Things (IoT) technology, some researchers are exploring how to integrate device-to-device communication patterns into NMT to enhance translation efficiency. However, the current state-of-the-art NMT models predominantly adopt sequence-based representations for both the source language and target language sentences. The lack of natural language sentence structure attributes leads to problems such as unfaithful translation in NMT. To enhance lexical alignment in NMT, the paper proposes a new transformer MT model that incorporates vocabulary alignment structure. The model receives external lexical alignment information during each step of the decoding process in the decoder design to alleviate the problem of missing lexical alignment structures. During the decoding phase of the model, the statistical MT system plays a crucial role by supplying relevant lexical alignment information derived from the decoding information obtained from the NMT. Additionally, the model suggests vocabulary recommendations based on this lexical alignment information. The experimental results provide evidence that this approach successfully integrates the vocabulary knowledge derived from statistical MT, leading to improved translation performance.

引用

页码：1010 / 1019

页数：10

共 43 条

[1] [Anonymous], 2013, EMNLP
[2] Bahdanau D., 2015, PROC 3 INT C LEARN, P1221
[3] Barone AVM, 2017, Arxiv, DOI arXiv:1707.07631
[4] SF-FWA: A Self-Adaptive Fast Fireworks Algorithm for effective large-scale optimization
Chen, Maiyue
Tan, Ying
[J]. SWARM AND EVOLUTIONARY COMPUTATION, 2023, 80
[5] Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, 10.48550/arXiv.1406.1078, DOI 10.3115/V1/D14-1179]
[6] A Survey of Multilingual Neural Machine Translation
Dabre, Raj
Chu, Chenhui
Kunchukuttan, Anoop
[J]. ACM COMPUTING SURVEYS, 2020, 53 (05)
[7] A Diffused Memetic Optimizer for reactive berth allocation and scheduling at marine container terminals in response to disruptions
Dulebenets, Maxim A.
[J]. SWARM AND EVOLUTIONARY COMPUTATION, 2023, 80
[8] An Adaptive Polyploid Memetic Algorithm for scheduling trucks at a cross-docking terminal
Dulebenets, Maxim A.
[J]. INFORMATION SCIENCES, 2021, 565 : 390 - 421
[9] Feng Z., 2023, Formal Analysis for Natural Language Processing: A Handbook, P597
[10] Framework for Handling Rare Word Problems in Neural Machine Translation System Using Multi-Word Expressions
Garg, Kamal Deep
Shekhar, Shashi
Kumar, Ajit
Goyal, Vishal
Sharma, Bhisham
Chengoden, Rajeswari
Srivastava, Gautam
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (21):

← 1 2 3 4 5 →