LKMT: Linguistics Knowledge-Driven Multi-Task Neural Machine Translation for Urdu and English

被引：0

作者：

Hassan, Muhammad Naeem Ul ^{[1
,2
]}

Yu, Zhengtao ^{[1
,2
]}

Wang, Jian ^{[1
,2
]}

Li, Ying ^{[1
,2
]}

Gao, Shengxiang ^{[1
,2
]}

Yang, Shuwan ^{[1
,2
]}

Mao, Cunli ^{[1
,2
]}

机构：

[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Peoples R China

[2] Kunming Univ Sci & Technol, Yunnan Key Lab Artificial Intelligence, Kunming 650500, Peoples R China

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 81卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Urdu NMT (neural machine translation); Urdu natural language processing; Urdu Linguistic features; low resources language; linguistic features pretrain model;

D O I：

10.32604/cmc.2024.054673

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Thanks to the strong representation capability of pre-trained language models, supervised machine translation models have achieved outstanding performance. However, the performances of these models drop sharply when the scale of the parallel training corpus is limited. Considering the pre-trained language model has a strong ability for monolingual representation, it is the key challenge for machine translation to construct the in-depth relationship between the source and target language by injecting the lexical and syntactic information into pre-trained language models. To alleviate the dependence on the parallel corpus, we propose a Linguistics Knowledge-Driven Multi- Task (LKMT) approach to inject part-of-speech and syntactic knowledge into pre-trained models, thus enhancing the machine translation performance. On the one hand, we integrate part-of-speech and dependency labels into the embedding layer and exploit large-scale monolingual corpus to update all parameters of pre-trained language models, thus ensuring the updated language model contains potential lexical and syntactic information. On the other hand, we leverage an extra self-attention layer to explicitly inject linguistic knowledge into the pre-trained language model-enhanced machine translation model. Experiments on the benchmark dataset show that our proposed LKMT approach improves the Urdu-English translation accuracy by 1.97 points and the English-Urdu translation accuracy by 2.42 points, highlighting the effectiveness of our LKMT framework. Detailed ablation experiments confirm the positive impact of part-of-speech and dependency parsing on machine translation.

引用

页码：951 / 969

页数：19

共 49 条

[1] Multi-task Learning for Multilingual Neural Machine Translation
Wang, Yiren
Zhai, ChengXiang
Awadalla, Hany Hassan
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1022 - 1034
[2] Improving Robustness of Neural Machine Translation with Multi-task Learning
Zhou, Shuyan
Zeng, Xiangkai
Zhou, Yingqi
Anastasopoulos, Antonios
Neubig, Graham
FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 565 - 571
[3] Neural Machine Translation Based on Multi-task Learning of Discourse Structure
Kang X.-M.
Zong C.-Q.
Ruan Jian Xue Bao/Journal of Software, 2022, 33 (10): : 3806 - 3818
[4] Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation
Mao, Zhuoyuan
Chu, Chenhui
Kurohashi, Sadao
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (04)
[5] Adaptive Knowledge Sharing in Multi-Task Learning: Improving Low-Resource Neural Machine Translation
Zaremoodi, Poorya
Buntine, Wray
Haffari, Gholamreza
PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 656 - 661
[6] Training Flexible Depth Model by Multi-Task Learning for Neural Machine Translation
Wang, Qiang
Xiao, Tong
Zhu, Jingbo
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4307 - 4312
[7] Complex Question Answering on knowledge graphs using machine translation and multi-task learning
Srivastava, Saurabh
Patidar, Mayur
Chowdhury, Sudip
Agarwal, Puneet
Bhattacharya, Indrajit
Shroff, Gautam
16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 3428 - 3439
[8] Multi-Task Neural Model for Agglutinative Language Translation
Pan, Yirong
Li, Xiao
Yang, Yating
Dong, Rui
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 103 - 110
[9] Scheduled Multi-task Learning for Neural Chat Translation
Liang, Yunlong
Meng, Fandong
Xu, Jinan
Chen, Yufeng
Zhou, Jie
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4375 - 4388
[10] MulGT: Multi-Task Graph-Transformer with Task-Aware Knowledge Injection and Domain Knowledge-Driven Pooling for Whole Slide Image Analysis
Zhao, Weiqin
Wang, Shujun
Yeung, Maximus
Niu, Tianye
Yu, Lequan
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3606 - 3614

← 1 2 3 4 5 →