SIMULTANEOUS FEATURE SELECTION AND PARAMETER OPTIMIZATION FOR TRAINING OF DIALOG POLICY BY REINFORCEMENT LEARNING

被引:0
|
作者
Misu, Teruhisa [1 ]
Kashioka, Hideki [1 ]
机构
[1] Natl Inst Informat & Commun Technol NICT, Kyoto, Japan
来源
2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012) | 2012年
关键词
Spoken dialog systems; Dialog management; Reinforcement learning; Feature selection;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of feature selection in the reinforcement learning (RL) of the dialog policies of spoken dialog systems. A statistical dialog manager selects the system actions the system should take based on the features derived from the current dialog state and/or the system's belief state. When defining the features used by the system for training the dialog policy, however, finding a set of actually effective features from potentially useful ones is not obvious. In addition, the selection should be done simultaneously with the optimization of the dialog policy. In this paper, we propose an incremental feature selection method for the optimization of a dialog policy by RL, in which improvement of the dialog policy and the feature selection are conducted simultaneously. Experiments in dialog policy optimization by RL with a user simulator demonstrated the following: 1) that the proposed method can find a better dialog policy with fewer policy iterations and 2) the learning speed is comparable with the case where feature selection is conducted in advance.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [1] Reinforcement Learning for Dialog Management using Least-Squares Policy Iteration and Fast Feature Selection
    Li, Lihong
    Williams, Jason D.
    Balakrishnan, Suhrid
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2447 - +
  • [2] Regularized feature selection in reinforcement learning
    Dean S. Wookey
    George D. Konidaris
    Machine Learning, 2015, 100 : 655 - 676
  • [3] Regularized feature selection in reinforcement learning
    Wookey, Dean S.
    Konidaris, George D.
    MACHINE LEARNING, 2015, 100 (2-3) : 655 - 676
  • [4] Reinforcement learning guided auto-select optimization algorithm for feature selection
    Zhang, Hongbo
    Yue, Xiaofeng
    Gao, Xueliang
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268
  • [5] EMBEDDED INCREMENTAL FEATURE SELECTION FOR REINFORCEMENT LEARNING
    Wright, Robert
    Loscalzo, Steven
    Yu, Lei
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, : 263 - 268
  • [6] Object tracking: Feature selection by reinforcement learning
    Deng, Jiali
    Gong, Haigang
    Liu, Minghui
    Liu, Ming
    INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155
  • [7] Automated Feature Selection: A Reinforcement Learning Perspective
    Liu, Kunpeng
    Fu, Yanjie
    Wu, Le
    Li, Xiaolin
    Aggarwal, Charu
    Xiong, Hui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (03) : 2272 - 2284
  • [8] Feature Selection and Feature Learning for High-dimensional Batch Reinforcement Learning: A Survey
    Liu, De-Rong
    Li, Hong-Liang
    Wang, Ding
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2015, 12 (03) : 229 - 242
  • [9] Sample Aware Embedded Feature Selection for Reinforcement Learning
    Loscalzo, Steven
    Wright, Robert
    Acunto, Kevin
    Yu, Lei
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2012, : 887 - 894
  • [10] QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature Selection
    Sadeg, Souhila
    Hamdad, Leila
    Remache, Amine Riad
    Karech, Mehdi Nedjmeddine
    Benatchba, Karima
    Habbas, Zineb
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 : 785 - 796