Feature Extraction in Q-Learning using Neural Networks

被引:0
作者
Zhu, Henghui [1 ]
Paschalidis, Ioannis Ch. [2 ,3 ,4 ]
Hasselmo, Michael E. [5 ]
机构
[1] Boston Univ, Ctr Informat & Syst Engn, Boston, MA 02215 USA
[2] Boston Univ, Dept Elect & Comp Engn, 8 St Marys St, Boston, MA 02215 USA
[3] Boston Univ, Div Syst Engn, 8 St Marys St, Boston, MA 02215 USA
[4] Boston Univ, Dept Biomed Engn, 8 St Marys St, Boston, MA 02215 USA
[5] Boston Univ, Ctr Syst Neurosci, Boston, MA 02215 USA
来源
2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2017年
关键词
Q-learning; reinforcement learning; Markov decision processes; neural networks;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Integrating deep neural networks with reinforcement learning has exhibited excellent performance in the literature, highlighting the ability of neural networks to extract features. This paper begins with a simple Markov decision process inspired from a cognitive task. We show that Q-learning, and approximate Q-learning using a linear function approximation fail in this task. Instead, we show that Q-learning combined with a neural network-based function approximator can learn the optimal policy. Motivated by this finding, we outline procedures that allow the use of a neural network to extract appropriate features, which can then be used in a Q-learning framework with a linear function approximation, obtaining performance similar to that observed using Q-learning with neural networks. Our work suggests that neural networks can be used as feature extractors in the context of Q-learning.
引用
收藏
页数:6
相关论文
共 50 条
[21]   Feature Extraction and Classification of Learners Using Neural Networks [J].
Hayashida, Tomohiro ;
Yamamoto, Toru ;
Wakitani, Shin ;
Kinoshita, Takuya ;
Nishizaki, Ichiro ;
Sekizaki, Shinya ;
Tanimoto, Yusukc .
2019 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE 2019), 2019,
[22]   Feature extraction and classification of learners using neural networks [J].
Hayashida, Tomohiro ;
Yamamoto, Toru ;
Wakitani, Shin ;
Nishizaki, Ichiro ;
Sekizaki, Shinya ;
Tanimoto, Yusuke .
2018 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE), 2018,
[23]   Using Q-Learning for OLTC Voltage Regulation in PV-Rich Distribution Networks [J].
Custodio, Guilherme ;
Ochoa, Luis F. ;
Trindade, F. C. L. ;
Alpcan, Tansu .
2020 INTERNATIONAL CONFERENCE ON SMART GRIDS AND ENERGY SYSTEMS (SGES 2020), 2020, :482-487
[24]   Smart home's wireless sensor networks lifetime optimizing using Q-learning [J].
Jrhilifa, Ismael ;
Ouadi, Hamid ;
Jilbab, Abdelilah .
IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
[25]   New algorithms of the Q-learning type [J].
Bhatnagar, Shalabh ;
Babu, K. Mohan .
AUTOMATICA, 2008, 44 (04) :1111-1119
[26]   CVaR Q-Learning [J].
Stanko, Silvestr ;
Macek, Karel .
COMPUTATIONAL INTELLIGENCE: 11th International Joint Conference, IJCCI 2019, Vienna, Austria, September 17-19, 2019, Revised Selected Papers, 2021, 922 :333-358
[27]   Periodic Q-Learning [J].
Lee, Donghwan ;
He, Niao .
LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 :582-598
[28]   Mobile Robot Navigation: Neural Q-Learning [J].
Yun, Soh Chin ;
Parasuraman, S. ;
Ganapathy, V. .
ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 3, 2013, 178 :259-+
[29]   Q-learning based on neural network in learning action selection of mobile robot [J].
Qiao, Junfei ;
Hou, Zhanjun ;
Ruan, Xiaogang .
2007 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS, VOLS 1-6, 2007, :263-267
[30]   rl4dtn: Q-Learning for Opportunistic Networks [J].
Visca, Jorge ;
Baliosian, Javier .
FUTURE INTERNET, 2022, 14 (12)