Context-aware Decoder for Neural Machine Translation using a Target-side Document-Level Language Model

被引:0
|
作者
Sugiyama, Amane [1 ,3 ]
Yoshinaga, Naoki [2 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] Univ Tokyo, Inst Ind Sci, Tokyo, Japan
[3] Mitsubishi UFJ Morgan Stanley Secur, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although many end-to-end context-aware neural machine translation models have been proposed to incorporate inter-sentential contexts in translation, these models can be trained only in domains where parallel documents with sentential alignments exist. We therefore present a simple method to perform context-aware decoding with any pre-trained sentence-level translation model by using a document-level language model. Our context-aware decoder is built upon sentence-level parallel data and target-side document-level monolingual data. From a theoretical viewpoint, our core contribution is the novel representation of contextual information using point-wise mutual information between context and the current sentence. We demonstrate the effectiveness of our method on English to Russian translation, by evaluating with BLEU and contrastive tests for context-aware translation.
引用
收藏
页码:5781 / 5791
页数:11
相关论文
共 50 条
  • [1] Target-Side Augmentation for Document-Level Machine Translation
    Bao, Guangsheng
    Teng, Zhiyang
    Zhang, Yue
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10725 - 10740
  • [2] Document-level Mongolian-Chinese Neural Machine Translation Incorporating Target-Side Data Augmentation and Topic Information
    Sun, Kangkang
    Tian, Yonghong
    Zheng, Xiaoli
    Ma, Chang
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 204 - 209
  • [3] Document-level Neural Machine Translation Using BERT as Context Encoder
    Guo, Zhiyu
    Minh Le Nguyen
    AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 94 - 100
  • [4] Domain-Aware Word Segmentation for Chinese Language: A Document-Level Context-Aware Model
    Huang, Kaiyu
    Xiao, Keli
    Mo, Fengran
    Jin, Bo
    Liu, Zhuang
    Huang, Degen
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (02)
  • [5] Document-Level Neural Machine Translation With Recurrent Context States
    Zhao, Yue
    Liu, Hui
    IEEE ACCESS, 2023, 11 : 27519 - 27526
  • [6] CONTEXT-ADAPTIVE DOCUMENT-LEVEL NEURAL MACHINE TRANSLATION
    Zhang, Linlin
    Zhang, Zhirui
    Chen, Boxing
    Luo, Weihua
    Si, Luo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6232 - 6236
  • [7] Document Flattening: Beyond Concatenating Context for Document-Level Neural Machine Translation
    Wu, Minghao
    Foster, George
    Qu, Lizhen
    Haffari, Gholamreza
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 448 - 462
  • [8] Routing Based Context Selection for Document-Level Neural Machine Translation
    Fei, Weilun
    Jian, Ping
    Zhu, Xiaoguang
    Lin, Yi
    MACHINE TRANSLATION, CCMT 2021, 2021, 1464 : 77 - 91
  • [9] Document-Level Neural Machine Translation with Hierarchical Modeling of Global Context
    Tan, Xin
    Zhang, Long-Yin
    Zhou, Guo-Dong
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (02) : 295 - 308
  • [10] Toward Understanding Most of the Context in Document-Level Neural Machine Translation
    Choi, Gyu-Hyeon
    Shin, Jong-Hun
    Lee, Yo-Han
    Kim, Young-Kil
    ELECTRONICS, 2022, 11 (15)