Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey

被引:0
|
作者
Saunders D. [1 ]
机构
[1] Cambridge University Engineering Department, Cambridge
来源
Journal of Artificial Intelligence Research | 2022年 / 75卷
基金
英国工程与自然科学研究理事会;
关键词
461.4 Ergonomics and Human Factors Engineering - 721.1 Computer Theory; Includes Computational Logic; Automata Theory; Switching Theory; Programming Theory - 723.5 Computer Applications;
D O I
10.1613/jair.1.13566
中图分类号
学科分类号
摘要
The development of deep learning techniques has allowed Neural Machine Translation (NMT) models to become extremely powerful, given sufficient training data and training time. However, systems struggle when translating text from a new domain with a distinct style or vocabulary. Fine-tuning on in-domain data allows good domain adaptation, but requires sufficient relevant bilingual data. Even if this is available, simple fine-tuning can cause overfitting to new data and catastrophic forgetting of previously learned behaviour. We survey approaches to domain adaptation for NMT, particularly where a system may need to translate across multiple domains. We divide techniques into those revolving around data selection or generation, model architecture, parameter adaptation procedure, and inference procedure. We finally highlight the benefits of domain adaptation and multi-domain adaptation techniques to other lines of NMT research. © 2022 AI Access Foundation. All rights reserved.
引用
收藏
页码:351 / 424
页数:73
相关论文
共 50 条
  • [1] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders, Danielle
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 75 : 351 - 424
  • [2] Effective domain awareness and adaptation approach via mask substructure for multi-domain neural machine translation
    Huang, Shuanghong
    Guo, Junjun
    Yu, Zhengtao
    Wen, Yonghua
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (19): : 14047 - 14060
  • [3] Effective domain awareness and adaptation approach via mask substructure for multi-domain neural machine translation
    Shuanghong Huang
    Junjun Guo
    Zhengtao Yu
    Yonghua Wen
    Neural Computing and Applications, 2023, 35 : 14047 - 14060
  • [4] Improving the Quality Trade-Off for Neural Machine Translation Multi-Domain Adaptation
    Hasler, Eva
    Domhan, Tobias
    Trenous, Jonay
    Tran, Ke
    Byrne, Bill
    Hieber, Felix
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8470 - 8477
  • [5] Vocabulary Adaptation for Domain Adaptation in Neural Machine Translation
    Sato, Shoetsu
    Sakuma, Jin
    Yoshinaga, Naoki
    Toyoda, Masashi
    Kitsuregawa, Masaru
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4269 - 4279
  • [6] Learning Domain Specific Sub-layer Latent Variable for Multi-domain Adaptation Neural Machine Translation
    Huang, Shuanghong
    Feng, Chong
    Shi, Ge
    Li, Zhengjun
    Zhao, Xuan
    Li, Xinyan
    Wang, Xiaomei
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (06)
  • [7] Unsupervised Domain Adaptation for Neural Machine Translation
    Yang, Zhen
    Chen, Wei
    Wang, Feng
    Xu, Bo
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 338 - 343
  • [8] A Domain Adaptation Method for Neural Machine Translation
    Tian, Xiaohu
    Liu, Jin
    Pu, Jiachen
    Wang, Jin
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 321 - 326
  • [9] Sentence Embedding for Neural Machine Translation Domain Adaptation
    Wang, Rui
    Finch, Andrew
    Utiyama, Masao
    Sumita, Eiichiro
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 560 - 566
  • [10] Curriculum Learning for Domain Adaptation in Neural Machine Translation
    Zhang, Xuan
    Shapiro, Pamela
    Kumar, Gaurav
    McNamee, Paul
    Carpuat, Marine
    Duh, Kevin
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1903 - 1915