SIMULTANEOUS FEATURE SELECTION AND PARAMETER OPTIMIZATION FOR TRAINING OF DIALOG POLICY BY REINFORCEMENT LEARNING

被引:0
作者
Misu, Teruhisa [1 ]
Kashioka, Hideki [1 ]
机构
[1] Natl Inst Informat & Commun Technol NICT, Kyoto, Japan
来源
2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012) | 2012年
关键词
Spoken dialog systems; Dialog management; Reinforcement learning; Feature selection;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of feature selection in the reinforcement learning (RL) of the dialog policies of spoken dialog systems. A statistical dialog manager selects the system actions the system should take based on the features derived from the current dialog state and/or the system's belief state. When defining the features used by the system for training the dialog policy, however, finding a set of actually effective features from potentially useful ones is not obvious. In addition, the selection should be done simultaneously with the optimization of the dialog policy. In this paper, we propose an incremental feature selection method for the optimization of a dialog policy by RL, in which improvement of the dialog policy and the feature selection are conducted simultaneously. Experiments in dialog policy optimization by RL with a user simulator demonstrated the following: 1) that the proposed method can find a better dialog policy with fewer policy iterations and 2) the learning speed is comparable with the case where feature selection is conducted in advance.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
[41]   Feature Selection and SVM Parameter Synchronous Optimization Based on a Hybrid Intelligent Optimization Algorithm [J].
Wang, Qingjun ;
Mu, Zhendong .
HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2023, 13
[42]   A reinforcement learning approach to parameter selection for distributed optimal power flow [J].
Zeng, Sihan ;
Kody, Alyssa ;
Kim, Youngdae ;
Kim, Kibaek ;
Molzahn, Daniel K. .
ELECTRIC POWER SYSTEMS RESEARCH, 2022, 212
[43]   Reinforcement learning methods for network-based transfer parameter selection [J].
Guo, Yue ;
Wang, Yu ;
Yang, I-Hsuan ;
Sycara, Katia .
INTELLIGENCE & ROBOTICS, 2023, 3 (03) :402-419
[44]   Feature Selection and SVM Parameter Synchronous Optimization Based on a Hybrid Intelligent Optimization Algorithm [J].
Wang, Qingjun ;
Mu, Zhendong .
HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2023, 13
[45]   Interpretable policy derivation for reinforcement learning based on evolutionary feature synthesis [J].
Zhang, Hengzhe ;
Zhou, Aimin ;
Lin, Xin .
COMPLEX & INTELLIGENT SYSTEMS, 2020, 6 (03) :741-753
[46]   Interpretable policy derivation for reinforcement learning based on evolutionary feature synthesis [J].
Hengzhe Zhang ;
Aimin Zhou ;
Xin Lin .
Complex & Intelligent Systems, 2020, 6 :741-753
[47]   Reinforcement learning-based multi-objective differential evolution algorithm for feature selection [J].
Yu, Xiaobing ;
Hu, Zhengpeng ;
Luo, Wenguan ;
Xue, Yu .
INFORMATION SCIENCES, 2024, 661
[48]   Intelligent Feature Selection for ECG-Based Personal Authentication Using Deep Reinforcement Learning [J].
Baek, Suwhan ;
Kim, Juhyeong ;
Yu, Hyunsoo ;
Yang, Geunbo ;
Sohn, Illsoo ;
Cho, Youngho ;
Park, Cheolsoo .
SENSORS, 2023, 23 (03)
[49]   Simultaneous Feature Selection and Support Vector Machine Optimization Using the Grasshopper Optimization Algorithm [J].
Ibrahim Aljarah ;
Ala’ M. Al-Zoubi ;
Hossam Faris ;
Mohammad A. Hassonah ;
Seyedali Mirjalili ;
Heba Saadeh .
Cognitive Computation, 2018, 10 :478-495
[50]   Simultaneous Feature Selection and Support Vector Machine Optimization Using the Grasshopper Optimization Algorithm [J].
Aljarah, Ibrahim ;
Al-Zoubi, Ala M. ;
Faris, Hossam ;
Hassonah, Mohammad A. ;
Mirjalili, Seyedali ;
Saadeh, Heba .
COGNITIVE COMPUTATION, 2018, 10 (03) :478-495