Bandit Structured Prediction for Neural Sequence-to-Sequence Learning

被引:19
|
作者
Kreutzer, Julia [1 ]
Sokolov, Artem [1 ]
Riezler, Stefan [1 ,2 ]
机构
[1] Heidelberg Univ, Computat Linguist, Heidelberg, Germany
[2] Heidelberg Univ, IWR, Heidelberg, Germany
来源
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1 | 2017年
关键词
D O I
10.18653/v1/P17-1138
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Bandit structured prediction describes a stochastic optimization framework where learning is performed from partial feedback. This feedback is received in the form of a task loss evaluation to a predicted output structure, without having access to gold standard structures. We advance this framework by lifting linear bandit learning to neural sequence-to-sequence learning problems using attention-based recurrent neural networks. Furthermore, we show how to incorporate control variates into our learning algorithms for variance reduction and improved generalization. We present an evaluation on a neural machine translation task that shows improvements of up to 5.89 BLEU points for domain adaptation from simulated bandit feedback.
引用
收藏
页码:1503 / 1513
页数:11
相关论文
共 50 条
  • [21] Neural AMR: Sequence-to-Sequence Models for Parsing and Generation
    Konstas, Ioannis
    Iyer, Srinivasan
    Yatskar, Mark
    Choi, Yejin
    Zettlemoyer, Luke
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 146 - 157
  • [22] The impact of memory on learning sequence-to-sequence tasks
    Seif, Alireza
    Loos, Sarah A. M.
    Tucci, Gennaro
    Roldan, Edgar
    Goldt, Sebastian
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (01):
  • [23] Sequence-to-sequence modeling for graph representation learning
    Aynaz Taheri
    Kevin Gimpel
    Tanya Berger-Wolf
    Applied Network Science, 4
  • [24] Optimal approach to sequence-to-sequence prediction: Applications in bioinformatics
    Nguyen, MN
    Rajapakse, JC
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2254 - 2258
  • [25] Remaining Useful Life Prediction Based on Normalizing Flow Embedded Sequence-to-Sequence Learning
    Yang, Haosen
    Ding, Keqin
    Qiu, Robert C.
    Mi, Tiebin
    IEEE TRANSACTIONS ON RELIABILITY, 2021, 70 (04) : 1342 - 1354
  • [26] Recurrent neural network-based semantic variational autoencoder for Sequence-to-sequence learning
    Jang, Myeongjun
    Seo, Seungwan
    Kang, Pilsung
    INFORMATION SCIENCES, 2019, 490 : 59 - 73
  • [27] Named Entity Transliteration with Sequence-to-Sequence Neural Network
    Li, Zhongwei
    Chng, Eng Siong
    Li, Haizhou
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 374 - 378
  • [28] Learning Neural Sequence-to-Sequence Models from Weak Feedback with Bipolar Ramp Loss
    Jehl, Laura
    Lawrence, Carotin
    Riezler, Stefan
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 : 233 - 248
  • [29] Neural Abstractive Text Summarization with Sequence-to-Sequence Models
    Shi, Tian
    Keneshloo, Yaser
    Ramakrishnan, Naren
    Reddy, Chandan K.
    ACM/IMS Transactions on Data Science, 2021, 2 (01):
  • [30] Enhancing Sequence-to-Sequence Neural Lemmatization with External Resources
    Milintsevich, Kirill
    Sirts, Kairit
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 3112 - 3122