SIMULTANEOUS FEATURE SELECTION AND PARAMETER OPTIMIZATION FOR TRAINING OF DIALOG POLICY BY REINFORCEMENT LEARNING

被引：0

作者：

Misu, Teruhisa ^{[1
]}

Kashioka, Hideki ^{[1
]}

机构：

[1] Natl Inst Informat & Commun Technol NICT, Kyoto, Japan

来源：

2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012) | 2012年

关键词：

Spoken dialog systems; Dialog management; Reinforcement learning; Feature selection;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper addresses the problem of feature selection in the reinforcement learning (RL) of the dialog policies of spoken dialog systems. A statistical dialog manager selects the system actions the system should take based on the features derived from the current dialog state and/or the system's belief state. When defining the features used by the system for training the dialog policy, however, finding a set of actually effective features from potentially useful ones is not obvious. In addition, the selection should be done simultaneously with the optimization of the dialog policy. In this paper, we propose an incremental feature selection method for the optimization of a dialog policy by RL, in which improvement of the dialog policy and the feature selection are conducted simultaneously. Experiments in dialog policy optimization by RL with a user simulator demonstrated the following: 1) that the proposed method can find a better dialog policy with fewer policy iterations and 2) the learning speed is comparable with the case where feature selection is conducted in advance.

引用

页码：1 / 6

页数：6

共 50 条

[21] Reinforcement learning-based comprehensive learning grey wolf optimizer for feature selection
Hu, Zhengpeng
Yu, Xiaobing
[J]. APPLIED SOFT COMPUTING, 2023, 149
[22] Feature selection of time series based on reinforcement learning
Jia, Yi
Zhang, Zhenguo
Cui, Rongyi
[J]. 2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 1010 - 1014
[23] Feature Selection for Malware Detection Based on Reinforcement Learning
Fang, Zhiyang
Wang, Junfeng
Geng, Jiaxuan
Kan, Xuan
[J]. IEEE ACCESS, 2019, 7 : 176177 - 176187
[24] Reinforcement learning based on local state feature learning and policy adjustment
Lin, YP
Li, XY
[J]. INFORMATION SCIENCES, 2003, 154 (1-2) : 59 - 70
[25] Adaptive Feature Selection With Reinforcement Learning for Skeleton-Based Action Recognition
Xu, Zheyuan
Wang, Yingfu
Jiang, Jiaqin
Yao, Jian
Li, Liang
[J]. IEEE ACCESS, 2020, 8 : 213038 - 213051
[26] Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs
Kroon, Mark
Whiteson, Shimon
[J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 324 - 330
[27] Parameter optimization design of MFAC based on Reinforcement Learning
Liu, Shida
Jia, Xiongbo
Ji, Honghai
Fan, Lingling
[J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1036 - 1043
[28] Simultaneous Model Selection and Feature Selection via BYY Harmony Learning
Wang, Hongyan
Ma, Jinwen
[J]. ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 47 - +
[29] Causal Based Action Selection Policy for Reinforcement Learning
Feliciano-Avelino, Ivan
Mendez-Molina, Arquimides
Morales, Eduardo F.
Enrique Sucar, L.
[J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE (MICAI 2021), PT I, 2021, 13067 : 213 - 227
[30] A simulated-annealing-based approach for simultaneous parameter optimization and feature selection of back-propagation networks
Lin, Shih-Wei
Tseng, Tsung-Yuan
Chou, Shuo-Yan
Chen, Shih-Chieh
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (02) : 1491 - 1499

← 1 2 3 4 5 →