Adversarial Training for Unknown Word Problems in Neural Machine Translation

被引：4

作者：

Ji, Yatu ^{[1
]}

Hou, Hongxu ^{[1
]}

Chen, Junjie ^{[1
]}

Wu, Nier ^{[1
]}

机构：

[1] Inner Mongolia Univ, Comp Sci Dept, Hohhot 010021, Peoples R China

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2020年 / 19卷 / 01期

关键词：

Neural machine translation; UNK; generative adversarial network; value iteration;

D O I：

10.1145/3342482

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Nearly all of the work in neural machine translation (NMT) is limited to a quite restricted vocabulary, crudely treating all other words the same as an < unk > symbol. For the translation of language with abundant morphology, unknown (UNK) words also come from the misunderstanding of the translation model to the morphological changes. In this study, we explore two ways to alleviate the UNK problem in NMT: a new generative adversarial network (added value constraints and semantic enhancement) and a preprocessing technique that mixes morphological noise. The training process is like a win-win game in which the players are three adversarial sub models (generator, filter, and discriminator). In this game, the filter is to emphasize the discriminator's attention to the negative generations that contain noise and improve the training efficiency. Finally, the discriminator cannot easily discriminate the negative samples generated by the generator with filter and human translations. The experimental results show that the proposed method significantly improves over several strong baseline models across various language pairs and the newly emerged Mongolian-Chinese task is state-of-the-art.

引用

页数：12

共 26 条

[1] [Anonymous], 2015, NIPS
[2] [Anonymous], 2013, PREPRINT ARXIV 1308
[3] [Anonymous], 2016, Asynchronous methods for deep reinforcement learning
[4] [Anonymous], ARXIV43555
[5] [Anonymous], 2016, ARXIV160905473
[6] [Anonymous], 2017, P 31 INT C NEURAL IN
[7] Bahdanau D., 2014, ABS14090473 CORR
[8] Gehring J, 2017, PR MACH LEARN RES, V70
[9] Pointing the Unknown Words
Gulcehre, Caglar
Ahn, Sungjin
Nallapati, Ramesh
Zhou, Bowen
Bengio, Yoshua
[J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 140 - 149
[10] Jean S, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P1

← 1 2 3 →