Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation

被引:0
作者
Alinejad, Ashkan [1 ]
Shavarani, Hassan S. [1 ]
Sarkar, Anoop [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC, Canada
来源
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021) | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In simultaneous machine translation, finding an agent with the optimal action sequence of reads and writes that maintain a high level of translation quality while minimizing the average lag in producing target tokens remains an extremely challenging problem. We propose a novel supervised learning approach for training an agent that can detect the minimum number of reads required for generating each target token by comparing simultaneous translations against full-sentence translations during training to generate oracle action sequences. These oracle sequences can then be used to train a supervised model for action generation at inference time. Our approach provides an alternative to current heuristic methods in simultaneous translation by introducing a new training objective, which is easier to train than previous attempts at training the agent using reinforcement learning techniques for this task. Our experimental results show that our novel training method for action generation produces much higher quality translations while minimizing the average lag in simultaneous translation.
引用
收藏
页码:1734 / 1744
页数:11
相关论文
共 26 条
  • [1] Alinejad A, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P3022
  • [2] [Anonymous], 2016, ABSTR REINF LEARN WO
  • [3] Arivazhagan N, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1313
  • [4] Arthur P, 2021, 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), P2709
  • [5] Cettolo M., 2014, P 11 INT WORKSHOP SP, V57
  • [6] Chiu C. C., 2018, 6 INT C LEARN REPR I
  • [7] Dalvi Fahim, 2018, PROC ACL, V2, P493, DOI [10.18653/v1/N18-2079, DOI 10.18653/V1/N18-2079]
  • [8] Dyer Chris, 2013, P 2013 C N AM CHAPTE, P644
  • [9] Efficient Wait-k Models for Simultaneous Machine Translation
    Elbayad, Maha
    Besacier, Laurent
    Verbeek, Jakob
    [J]. INTERSPEECH 2020, 2020, : 1461 - 1465
  • [10] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]