Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

被引：0

作者：

Saunders D. ^{[1
]}

机构：

[1] Cambridge University Engineering Department, Cambridge

来源：

Journal of Artificial Intelligence Research | 2022年 / 75卷

基金：

英国工程与自然科学研究理事会;

关键词：

461.4 Ergonomics and Human Factors Engineering - 721.1 Computer Theory; Includes Computational Logic; Automata Theory; Switching Theory; Programming Theory - 723.5 Computer Applications;

D O I：

10.1613/jair.1.13566

中图分类号：

学科分类号：

摘要：

The development of deep learning techniques has allowed Neural Machine Translation (NMT) models to become extremely powerful, given sufficient training data and training time. However, systems struggle when translating text from a new domain with a distinct style or vocabulary. Fine-tuning on in-domain data allows good domain adaptation, but requires sufficient relevant bilingual data. Even if this is available, simple fine-tuning can cause overfitting to new data and catastrophic forgetting of previously learned behaviour. We survey approaches to domain adaptation for NMT, particularly where a system may need to translate across multiple domains. We divide techniques into those revolving around data selection or generation, model architecture, parameter adaptation procedure, and inference procedure. We finally highlight the benefits of domain adaptation and multi-domain adaptation techniques to other lines of NMT research. © 2022 AI Access Foundation. All rights reserved.

引用

页码：351 / 424

页数：73