Feature Extraction in Q-Learning using Neural Networks

被引:0
|
作者
Zhu, Henghui [1 ]
Paschalidis, Ioannis Ch. [2 ,3 ,4 ]
Hasselmo, Michael E. [5 ]
机构
[1] Boston Univ, Ctr Informat & Syst Engn, Boston, MA 02215 USA
[2] Boston Univ, Dept Elect & Comp Engn, 8 St Marys St, Boston, MA 02215 USA
[3] Boston Univ, Div Syst Engn, 8 St Marys St, Boston, MA 02215 USA
[4] Boston Univ, Dept Biomed Engn, 8 St Marys St, Boston, MA 02215 USA
[5] Boston Univ, Ctr Syst Neurosci, Boston, MA 02215 USA
来源
2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2017年
关键词
Q-learning; reinforcement learning; Markov decision processes; neural networks;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Integrating deep neural networks with reinforcement learning has exhibited excellent performance in the literature, highlighting the ability of neural networks to extract features. This paper begins with a simple Markov decision process inspired from a cognitive task. We show that Q-learning, and approximate Q-learning using a linear function approximation fail in this task. Instead, we show that Q-learning combined with a neural network-based function approximator can learn the optimal policy. Motivated by this finding, we outline procedures that allow the use of a neural network to extract appropriate features, which can then be used in a Q-learning framework with a linear function approximation, obtaining performance similar to that observed using Q-learning with neural networks. Our work suggests that neural networks can be used as feature extractors in the context of Q-learning.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Learning to control an inverted pendulum using Q-learning and neural networks
    Jiang, Guofei
    Wu, Cangpu
    Zidonghua Xuebao/Acta Automatica Sinica, 1998, 24 (05): : 662 - 666
  • [2] Neural Q-learning
    Stephan ten Hagen
    Ben Kröse
    Neural Computing & Applications, 2003, 12 : 81 - 88
  • [3] Neural Q-learning
    ten Hagen, S
    Kröse, B
    NEURAL COMPUTING & APPLICATIONS, 2003, 12 (02): : 81 - 88
  • [4] Hyperparameter optimization of neural networks based on Q-learning
    Xin Qi
    Bing Xu
    Signal, Image and Video Processing, 2023, 17 : 1669 - 1676
  • [5] Hyperparameter optimization of neural networks based on Q-learning
    Qi, Xin
    Xu, Bing
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1669 - 1676
  • [6] Software Defect Prediction Using Deep Q-Learning Network-Based Feature Extraction
    Zhang, Qinhe
    Zhang, Jiachen
    Feng, Tie
    Xue, Jialang
    Zhu, Xinxin
    Zhu, Ningyang
    Li, Zhiheng
    IET SOFTWARE, 2024, 2024
  • [7] Reinforcement Q-Learning and Neural Networks to Acquire Negotiation Behaviors
    Chohra, Amine
    Madani, Kurosh
    Kanzari, Dalel
    NEW CHALLENGES IN APPLIED INTELLIGENCE TECHNOLOGIES, 2008, 134 : 23 - 33
  • [8] QLP: Deep Q-Learning for Pruning Deep Neural Networks
    Camci, Efe
    Gupta, Manas
    Wu, Min
    Lin, Jie
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (10) : 6488 - 6501
  • [9] Expression of Continuous State and Action Spaces for Q-Learning Using Neural Networks and CMAC
    Yamada, Kazuaki
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2012, 24 (02) : 330 - 339
  • [10] Mobile robot navigation using neural Q-learning
    Yang, GS
    Chen, EK
    An, CW
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 48 - 52