Incremental Sparse Bayesian Method for Online Dialog Strategy Learning

被引:8
作者
Lee, Sungjin [1 ]
Eskenazi, Maxine [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
Incremental learning; reinforcement learning; sparse Bayesian modeling; statistical dialog modeling; value function approximation;
D O I
10.1109/JSTSP.2012.2229963
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes an incremental sparse Bayesian learning method to allow continuous dialog strategy learning from the interactions with real users. Since conventional reinforcement learning (RL) methods require a huge number of dialogs to reach convergence, it has been essential to use a simulated user in training dialog policies. The disadvantage of this approach is that the trained dialog policies always lag behind the optimal one for live users. In order to tackle this problem, a few studies applying online RL methods to dialog management have emerged and showed very promising results. However, these methods are limited to learning online the weight parameters of the basis functions in the model and so need batch learning on a fixed data set or some heuristics to find appropriate values for other meta parameters such as sparsity-controlling thresholds, basis function parameters, and noise parameters. The proposed method attempts to overcome this limitation to achieve fully incremental and fast dialog strategy learning by adopting a sparse Bayesian learning method for value function approximation. In order to verify the proposed method, three different experimental conditions have been used: artificial data, a simulated user, and real users. The experiment on the artificial data showed that the proposed method successfully learns all the parameters in an incremental manner. Also, the experiment on training and evaluating dialog policies with a simulated user clearly demonstrated that the proposed method is much faster than conventional RL methods. A live user study showed that the dialog strategy learned from real users performed as good as the best past systems, although it slightly underperformed the one trained on simulated dialogs due to the difficulty of user feedback elicitation.
引用
收藏
页码:903 / 916
页数:14
相关论文
共 50 条
  • [41] Incremental Learning Method for Biological Signal Identification
    Oyama, Tadahiro
    Karungaru, Stephen
    Tsuge, Satoru
    Mitsukura, Yasue
    Fukumi, Minoru
    13TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, VOLS 1-3, 2009, 23 (1-3): : 302 - +
  • [42] An incremental learning method for spoof fingerprint detection
    Kho, Jun Beom
    Lee, Wonjune
    Choi, Heeseung
    Kim, Jaihie
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 116 : 52 - 64
  • [43] Unsupervised images segmentation via incremental dictionary learning based sparse representation
    Yang, Shuyuan
    Lv, Yuan
    Ren, Yu
    Yang, Lixia
    Jiao, Licheng
    INFORMATION SCIENCES, 2014, 269 : 48 - 59
  • [44] Adaptive Threshold Hierarchical Incremental Learning Method
    Li, Xingyu
    Dong, Shengbo
    Su, Qiya
    Yu, Muyao
    Li, Xinzhi
    IEEE ACCESS, 2023, 11 : 12285 - 12293
  • [45] Traffic Classification Based on Incremental Learning Method
    Sun, Guanglu
    Li, Shaobo
    Chen, Teng
    Su, Yangyang
    Lang, Fei
    ADVANCED HYBRID INFORMATION PROCESSING, 2018, 219 : 341 - 348
  • [46] Incremental Active Learning Method for Supervised ISOMAP
    Zhang, Guopeng
    Huang, Rui
    Chen, Junli
    IMAGE AND GRAPHICS (ICIG 2017), PT II, 2017, 10667 : 395 - 404
  • [47] Generative Pseudorehearsal Strategy for Fault Classification Under an Incremental Learning
    Lee, Subin
    Baek, Jun-Geol
    2019 22ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (IEEE CSE 2019) AND 17TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (IEEE EUC 2019), 2019, : 138 - 140
  • [48] Preliminary Exploration of Data Incremental Learning Method
    Yu Mengzhu
    Ding Mingyue
    Xi Zihan
    Huang Tao
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [49] An online incremental orthogonal component analysis method for dimensionality reduction
    Zhu, Tao
    Xu, Ye
    Shen, Furao
    Zhao, Jinxi
    NEURAL NETWORKS, 2017, 85 : 33 - 50
  • [50] Topology Learning Embedding: A Fast and Incremental Method for Manifold Learning
    Zhu, Tao
    Shen, Furao
    Zhao, Jinxi
    Liang, Yu
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 43 - 52