Feature Extraction in Q-Learning using Neural Networks

被引：0

作者：

Zhu, Henghui ^{[1
]}

Paschalidis, Ioannis Ch. ^{[2
,3
,4
]}

Hasselmo, Michael E. ^{[5
]}

机构：

[1] Boston Univ, Ctr Informat & Syst Engn, Boston, MA 02215 USA

[2] Boston Univ, Dept Elect & Comp Engn, 8 St Marys St, Boston, MA 02215 USA

[3] Boston Univ, Div Syst Engn, 8 St Marys St, Boston, MA 02215 USA

[4] Boston Univ, Dept Biomed Engn, 8 St Marys St, Boston, MA 02215 USA

[5] Boston Univ, Ctr Syst Neurosci, Boston, MA 02215 USA

来源：

2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2017年

关键词：

Q-learning; reinforcement learning; Markov decision processes; neural networks;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Integrating deep neural networks with reinforcement learning has exhibited excellent performance in the literature, highlighting the ability of neural networks to extract features. This paper begins with a simple Markov decision process inspired from a cognitive task. We show that Q-learning, and approximate Q-learning using a linear function approximation fail in this task. Instead, we show that Q-learning combined with a neural network-based function approximator can learn the optimal policy. Motivated by this finding, we outline procedures that allow the use of a neural network to extract appropriate features, which can then be used in a Q-learning framework with a linear function approximation, obtaining performance similar to that observed using Q-learning with neural networks. Our work suggests that neural networks can be used as feature extractors in the context of Q-learning.

引用

页数：6

共 50 条

[1] Learning to control an inverted pendulum using Q-learning and neural networks
Jiang, Guofei
Wu, Cangpu
Zidonghua Xuebao/Acta Automatica Sinica, 1998, 24 (05): : 662 - 666
[2] Neural Q-learning
Stephan ten Hagen
Ben Kröse
Neural Computing & Applications, 2003, 12 : 81 - 88
[3] Neural Q-learning
ten Hagen, S
Kröse, B
NEURAL COMPUTING & APPLICATIONS, 2003, 12 (02): : 81 - 88
[4] Hyperparameter optimization of neural networks based on Q-learning
Xin Qi
Bing Xu
Signal, Image and Video Processing, 2023, 17 : 1669 - 1676
[5] Hyperparameter optimization of neural networks based on Q-learning
Qi, Xin
Xu, Bing
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1669 - 1676
[6] Software Defect Prediction Using Deep Q-Learning Network-Based Feature Extraction
Zhang, Qinhe
Zhang, Jiachen
Feng, Tie
Xue, Jialang
Zhu, Xinxin
Zhu, Ningyang
Li, Zhiheng
IET SOFTWARE, 2024, 2024
[7] Reinforcement Q-Learning and Neural Networks to Acquire Negotiation Behaviors
Chohra, Amine
Madani, Kurosh
Kanzari, Dalel
NEW CHALLENGES IN APPLIED INTELLIGENCE TECHNOLOGIES, 2008, 134 : 23 - 33
[8] QLP: Deep Q-Learning for Pruning Deep Neural Networks
Camci, Efe
Gupta, Manas
Wu, Min
Lin, Jie
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6488 - 6501
[9] Expression of Continuous State and Action Spaces for Q-Learning Using Neural Networks and CMAC
Yamada, Kazuaki
JOURNAL OF ROBOTICS AND MECHATRONICS, 2012, 24 (02) : 330 - 339
[10] Mobile robot navigation using neural Q-learning
Yang, GS
Chen, EK
An, CW
PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 48 - 52

← 1 2 3 4 5 →