Building Adaptive Dialogue Systems Via Bayes-Adaptive POMDPs

被引:7
|
作者
Png, Shaowei [1 ]
Pineau, Joelle [1 ]
Chaib-draa, Brahim [2 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ H3A 2A7, Canada
[2] Univ Laval, Dept Comp Sci, Quebec City, PQ G1V 0A6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Dialogue management; reinforcement learning; Markov decision process (MDP); partially observable Markov decision process (POMDP); Bayesian inference; MARKOV-PROCESSES; ALGORITHMS; MODEL;
D O I
10.1109/JSTSP.2012.2229962
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent research has shown that effective dialogue management can be achieved through the Partially Observable Markov Decision Process (POMDP) framework. However past research on POMDP-based dialogue systems usually assumed the parameters of the decision process were known a priori. Themain contribution of this paper is to present a Bayesian reinforcement learning framework for learning the POMDP parameters online from data, in a decision-theoretic manner. We discuss various approximations and assumptions which can be leveraged to ensure computational tractability, and apply these techniques to learning observationmodels for several simulated spoken dialogue domains.
引用
收藏
页码:917 / 927
页数:11
相关论文
共 50 条
  • [1] Bayes-adaptive hierarchical MDPs
    Ngo Anh Vien
    Lee, SeungGwan
    Chung, TaeChoong
    APPLIED INTELLIGENCE, 2016, 45 (01) : 112 - 126
  • [2] Bayes-adaptive hierarchical MDPs
    Ngo Anh Vien
    SeungGwan Lee
    TaeChoong Chung
    Applied Intelligence, 2016, 45 : 112 - 126
  • [3] Scalable and efficient bayes-adaptive reinforcement learning based on Monte-Carlo tree search
    Guez, Arthur
    Silver, David
    Dayan, Peter
    1600, AI Access Foundation (48): : 841 - 883
  • [4] Contrastive Learning-Based Bayes-Adaptive Meta-Reinforcement Learning for Active Pantograph Control in High-Speed Railways
    Wang, Hui
    Han, Zhiwei
    Wang, Xufan
    Wu, Yanbo
    Liu, Zhigang
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (01): : 2045 - 2056
  • [5] An adaptive domain knowledge manager for dialogue systems
    Filipe, Porfirio
    Morgado, Luis
    Mamede, Nuno
    ICEIS 2007: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS: HUMAN-COMPUTER INTERACTION, 2007, : 45 - +
  • [6] A framework to develop context-aware adaptive dialogue systems
    Griol, David
    Callejas, Zoraida
    Lopez-Cozar, Ramon
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2012 - 2016
  • [7] Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems
    Janarthanam, Srinivasan
    Lemon, Oliver
    EMPIRICAL METHODS IN NATURAL LANGUAGE GENERATION: DATA-ORIENTED METHODS AND EMPIRICAL EVALUATION, 2010, 5790 : 67 - +
  • [8] Adaptive Naive Bayes method for masquerade detection
    Dash, Subrat Kumar
    Reddy, Krupa Sagar
    Pujari, Arun K.
    SECURITY AND COMMUNICATION NETWORKS, 2011, 4 (04) : 410 - 417
  • [9] A Variational Bayes Approach to Adaptive Radio Tomography
    Lee, Donghoon
    Giannakis, Georgios B.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 (68) : 3779 - 3792
  • [10] A Variational Bayes Framework for Sparse Adaptive Estimation
    Themelis, Konstantinos E.
    Rontogiannis, Athanasios A.
    Koutroumbas, Konstantinos D.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (18) : 4723 - 4736