Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search

被引:155
作者
Hokamp, Chris [1 ]
Liu, Qun [1 ]
机构
[1] Dublin City Univ, ADAPT Ctr, Dublin, Ireland
来源
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1 | 2017年
基金
爱尔兰科学基金会; 欧盟地平线“2020”;
关键词
D O I
10.18653/v1/P17-1141
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present Grid Beam Search (GBS), an algorithm which extends beam search to allow the inclusion of pre-specified lexical constraints. The algorithm can be used with any model that generates a sequence (y) over cap = {y(0) ... y(T)}, by maximizing p(y vertical bar x) = Pi(t)p(y(t)vertical bar x; {y(0) ... y(t-1)}). Lexical constraints take the form of phrases or words that must be present in the output sequence. This is a very general way to incorporate additional knowledge into a model's output without requiring any modification of the model parameters or training data. We demonstrate the feasibility and flexibility of Lexically Constrained Decoding by conducting experiments on Neural Interactive-Predictive Translation, as well as Domain Adaptation for Neural Machine Translation. Experiments show that GBS can provide large improvements in translation quality in interactive scenarios, and that, even without any user input, GBS can be used to achieve significant gains in performance in domain adaptation scenarios.
引用
收藏
页码:1535 / 1546
页数:12
相关论文
共 34 条
  • [1] [Anonymous], NAACL HLT 2016
  • [2] [Anonymous], 2002, THESIS
  • [3] [Anonymous], CORR
  • [4] [Anonymous], P C ASS COMP LING AC
  • [5] [Anonymous], P EUR ASS MACH TRANS
  • [6] [Anonymous], 2016, 12 C ASS MACHINE TRA
  • [7] [Anonymous], 2014, THESIS
  • [8] [Anonymous], 2016, Proceedings of the 2016 conference on empirical methods in natural language processing, DOI DOI 10.18653/V1/D16-1032
  • [9] [Anonymous], 2014, ABS14090473 CORR
  • [10] [Anonymous], ARXIV161101874