Bandit Structured Prediction for Neural Sequence-to-Sequence Learning

被引:19
|
作者
Kreutzer, Julia [1 ]
Sokolov, Artem [1 ]
Riezler, Stefan [1 ,2 ]
机构
[1] Heidelberg Univ, Computat Linguist, Heidelberg, Germany
[2] Heidelberg Univ, IWR, Heidelberg, Germany
来源
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1 | 2017年
关键词
D O I
10.18653/v1/P17-1138
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Bandit structured prediction describes a stochastic optimization framework where learning is performed from partial feedback. This feedback is received in the form of a task loss evaluation to a predicted output structure, without having access to gold standard structures. We advance this framework by lifting linear bandit learning to neural sequence-to-sequence learning problems using attention-based recurrent neural networks. Furthermore, we show how to incorporate control variates into our learning algorithms for variance reduction and improved generalization. We present an evaluation on a neural machine translation task that shows improvements of up to 5.89 BLEU points for domain adaptation from simulated bandit feedback.
引用
收藏
页码:1503 / 1513
页数:11
相关论文
共 50 条
  • [31] Sequence-to-Sequence model for Building Energy Consumption Prediction
    Kim, Marie
    Jun, JongAm
    Kim, Nasoo
    Song, YuJin
    Pyo, Cheol Sik
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1243 - 1245
  • [32] EFFECT OF DATA REDUCTION ON SEQUENCE-TO-SEQUENCE NEURAL TTS
    Latorre, Javier
    Lachowicz, Jakub
    Lorenzo-Trueba, Jaime
    Merritt, Thomas
    Drugman, Thomas
    Ronanki, Srikanth
    Klimkov, Viacheslav
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7075 - 7079
  • [33] INVESTIGATION OF AN INPUT SEQUENCE ON THAI NEURAL SEQUENCE-TO-SEQUENCE SPEECH SYNTHESIS
    Janyoi, Pongsathon
    Thangthai, Ausdang
    2021 24TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2021, : 218 - 223
  • [34] Sequence-To-Sequence Learning for Online Imputation of Sensory Data
    Kaitai TONG
    Teng LI
    Instrumentation, 2019, 6 (02) : 63 - 70
  • [35] PREDICTION OF VESSEL TRAJECTORIES FROM AIS DATA VIA SEQUENCE-TO-SEQUENCE RECURRENT NEURAL NETWORKS
    Forti, Nicola
    Millefiori, Leonardo M.
    Braca, Paolo
    Willett, Peter
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8936 - 8940
  • [36] Prediction Model Design for Vibration Severity of Rotating Machine Based on Sequence-to-Sequence Neural Network
    Wang, Zhiqiang
    Qian, Hong
    Zhang, Dongliang
    Wei, Yingchen
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [37] SEQUENCE-TO-SEQUENCE ASR OPTIMIZATION VIA REINFORCEMENT LEARNING
    Tjandra, Andros
    Sakti, Sakriani
    Nakamura, Satoshi
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5829 - 5833
  • [38] Sequence-to-Sequence Learning for Human Pose Correction in Videos
    Swetha, Sirnam
    Balasubramanian, Vineeth N.
    Jawahar, C. V.
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 298 - 303
  • [39] Compositional generalization through meta sequence-to-sequence learning
    Lake, Brenden M.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [40] Sequence-to-Sequence Deep Learning for Eye Movement Classification
    Startsev, Mikhail
    Agtzidis, Ioannis
    Dorr, Michael
    PERCEPTION, 2019, 48 : 200 - 200