Meta Learning for Hyperparameter Optimization in Dialogue System

被引:24
作者
Chien, Jen-Tzung [1 ]
Lieow, Wei Xiang [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu, Taiwan
来源
INTERSPEECH 2019 | 2019年
关键词
dialogue system; meta learning; Bayesian optimization; recurrent neural network;
D O I
10.21437/Interspeech.2019-1383
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
The performance of dialogue system based on deep reinforcement learning (DRL) highly depends on the selected hyperparameters in DRL algorithms. Traditionally, Gaussian process (GP) provides a probabilistic approach to Bayesian optimization for sequential search which is beneficial to select optimal hyperparameter. However, GP suffers from the expanding computation when the dimension of hyperparameters and the number of search points are increased. This paper presents a meta learning approach to carry out multifidelity Bayesian optimization where a two-level recurrent neural network (RNN) is developed for sequential learning and optimization. The search space is explored via the first-level RNN with cheap and low fidelity over a global region of hyperparameters. The optimization is then exploited and leveraged by the second-level RNN with a high fidelity on the successively small regions. The experiments on the hyperparameter optimization for dialogue system based on the deep Q network show the effectiveness and efficiency by using the proposed multifidelity Bayesian optimization.
引用
收藏
页码:839 / 843
页数:5
相关论文
共 27 条
[1]  
Andrychowicz M, 2016, ADV NEUR IN, V29
[2]  
[Anonymous], 2015, Bayesian Speech and Language Processing
[3]  
[Anonymous], 2016, ADV NEURAL INFORM PR
[4]  
Chen L., 2015, SIGDIAL, P407, DOI [10.18653/v1/W15-4653, DOI 10.18653/V1/W15-4653]
[5]  
Chen YT, 2017, Arxiv, DOI arXiv:1611.03824
[6]  
Chien J.-T, 2019, PROC ANN M ASS COMPU
[7]  
Chien JT, 2019, INT CONF ACOUST SPEE, P3202, DOI [10.1109/icassp.2019.8683771, 10.1109/ICASSP.2019.8683771]
[8]   Bayesian Recurrent Neural Network for Language Modeling [J].
Chien, Jen-Tzung ;
Ku, Yuan-Chu .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (02) :361-374
[9]   Nonstationary Source Separation Using Sequential and Variational Bayesian Learning [J].
Chien, Jen-Tzung ;
Hsieh, Hsin-Lung .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (05) :681-694
[10]  
Dernoncourt F, 2016, IEEE W SP LANG TECH, P406, DOI 10.1109/SLT.2016.7846296