SIMULTANEOUS FEATURE SELECTION AND PARAMETER OPTIMIZATION FOR TRAINING OF DIALOG POLICY BY REINFORCEMENT LEARNING

被引：0

作者：

Misu, Teruhisa ^{[1
]}

Kashioka, Hideki ^{[1
]}

机构：

[1] Natl Inst Informat & Commun Technol NICT, Kyoto, Japan

来源：

2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012) | 2012年

关键词：

Spoken dialog systems; Dialog management; Reinforcement learning; Feature selection;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper addresses the problem of feature selection in the reinforcement learning (RL) of the dialog policies of spoken dialog systems. A statistical dialog manager selects the system actions the system should take based on the features derived from the current dialog state and/or the system's belief state. When defining the features used by the system for training the dialog policy, however, finding a set of actually effective features from potentially useful ones is not obvious. In addition, the selection should be done simultaneously with the optimization of the dialog policy. In this paper, we propose an incremental feature selection method for the optimization of a dialog policy by RL, in which improvement of the dialog policy and the feature selection are conducted simultaneously. Experiments in dialog policy optimization by RL with a user simulator demonstrated the following: 1) that the proposed method can find a better dialog policy with fewer policy iterations and 2) the learning speed is comparable with the case where feature selection is conducted in advance.

引用

页码：1 / 6

页数：6

共 50 条

[1] Reinforcement Learning for Dialog Management using Least-Squares Policy Iteration and Fast Feature Selection [J].

Li, Lihong ;

Williams, Jason D. ;

Balakrishnan, Suhrid .

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, :2447-+

[2] Regularized feature selection in reinforcement learning [J].

Wookey, Dean S. ;

Konidaris, George D. .

MACHINE LEARNING, 2015, 100 (2-3) :655-676

[3] Regularized feature selection in reinforcement learning [J].

Dean S. Wookey ;

George D. Konidaris .

Machine Learning, 2015, 100 :655-676

[4] Reinforcement learning guided auto-select optimization algorithm for feature selection [J].

Zhang, Hongbo ;

Yue, Xiaofeng ;

Gao, Xueliang .

EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268

[5] Automated Feature Selection: A Reinforcement Learning Perspective [J].

Liu, Kunpeng ;

Fu, Yanjie ;

Wu, Le ;

Li, Xiaolin ;

Aggarwal, Charu ;

Xiong, Hui .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (03) :2272-2284

[6] Object tracking: Feature selection by reinforcement learning [J].

Deng, Jiali ;

Gong, Haigang ;

Liu, Minghui ;

Liu, Ming .

INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155

[7] EMBEDDED INCREMENTAL FEATURE SELECTION FOR REINFORCEMENT LEARNING [J].

Wright, Robert ;

Loscalzo, Steven ;

Yu, Lei .

ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, :263-268

[8] Sample Aware Embedded Feature Selection for Reinforcement Learning [J].

Loscalzo, Steven ;

Wright, Robert ;

Acunto, Kevin ;

Yu, Lei .

PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2012, :887-894

[9] Feature Selection and Feature Learning for High-dimensional Batch Reinforcement Learning: A Survey [J].

Liu, De-Rong ;

Li, Hong-Liang ;

Wang, Ding .

INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2015, 12 (03) :229-242

[10] QBSO-FS: A Reinforcement Learning Based Bee Swarm Optimization Metaheuristic for Feature Selection [J].

Sadeg, Souhila ;

Hamdad, Leila ;

Remache, Amine Riad ;

Karech, Mehdi Nedjmeddine ;

Benatchba, Karima ;

Habbas, Zineb .

ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 :785-796

← 1 2 3 4 5 →