Bandit Structured Prediction for Neural Sequence-to-Sequence Learning

被引:19
|
作者
Kreutzer, Julia [1 ]
Sokolov, Artem [1 ]
Riezler, Stefan [1 ,2 ]
机构
[1] Heidelberg Univ, Computat Linguist, Heidelberg, Germany
[2] Heidelberg Univ, IWR, Heidelberg, Germany
来源
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1 | 2017年
关键词
D O I
10.18653/v1/P17-1138
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Bandit structured prediction describes a stochastic optimization framework where learning is performed from partial feedback. This feedback is received in the form of a task loss evaluation to a predicted output structure, without having access to gold standard structures. We advance this framework by lifting linear bandit learning to neural sequence-to-sequence learning problems using attention-based recurrent neural networks. Furthermore, we show how to incorporate control variates into our learning algorithms for variance reduction and improved generalization. We present an evaluation on a neural machine translation task that shows improvements of up to 5.89 BLEU points for domain adaptation from simulated bandit feedback.
引用
收藏
页码:1503 / 1513
页数:11
相关论文
共 50 条
  • [41] Duplex Sequence-to-Sequence Learning for Reversible Machine Translation
    Zheng, Zaixiang
    Zhou, Hao
    Huang, Shujian
    Chen, Jiajun
    Xu, Jingjing
    Li, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [42] Self-Regulated Interactive Sequence-to-Sequence Learning
    Kreutzer, Julia
    Riezler, Stefan
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 303 - 315
  • [43] De-duping URLs with Sequence-to-Sequence Neural Networks
    Xu, Keyang
    Liu, Zhengzhong
    Callan, Jamie
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1157 - 1160
  • [44] Sequence-to-Sequence Learning via Shared Latent Representation
    Shen, Xu
    Tian, Xinmei
    Xing, Jun
    Rui, Yong
    Tao, Dacheng
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2395 - 2402
  • [45] Exploring Sequence-to-Sequence Learning in Aspect Term Extraction
    Ma, Dehong
    Li, Sujian
    Wu, Fangzhao
    Xie, Xing
    Wang, Houfeng
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3538 - 3547
  • [46] Improved Customer Lifetime Value Prediction With Sequence-To-Sequence Learning and Feature-Based Models
    Bauer, Josef
    Jannach, Dietmar
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (05)
  • [47] Graph augmented sequence-to-sequence model for neural question generation
    Ma, Hui
    Wang, Jian
    Lin, Hongfei
    Xu, Bo
    APPLIED INTELLIGENCE, 2023, 53 (11) : 14628 - 14644
  • [48] Multistep prediction for egg prices: An efficient sequence-to-sequence network
    Jiang, Minlan
    Mo, Liyun
    Zeng, Lingguo
    Zhang, Azhi
    Du, Youhai
    Huo, Yizhi
    Shi, Xiaowei
    Al-qaness, Mohammed A. A.
    EGYPTIAN INFORMATICS JOURNAL, 2025, 29
  • [49] Exploring sequence-to-sequence learning methods for end-to-end, complete protein structure prediction
    King, Jonathan
    Francoeur, Paul
    Koes, David
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [50] Sequence-to-sequence deep learning model for building energy consumption prediction with dynamic simulation modeling
    Kim, Chul Ho
    Kim, Marie
    Song, Yu Jin
    JOURNAL OF BUILDING ENGINEERING, 2021, 43