Feature Extraction in Q-Learning using Neural Networks

被引：0

作者：

Zhu, Henghui ^{[1
]}

Paschalidis, Ioannis Ch. ^{[2
,3
,4
]}

Hasselmo, Michael E. ^{[5
]}

机构：

[1] Boston Univ, Ctr Informat & Syst Engn, Boston, MA 02215 USA

[2] Boston Univ, Dept Elect & Comp Engn, 8 St Marys St, Boston, MA 02215 USA

[3] Boston Univ, Div Syst Engn, 8 St Marys St, Boston, MA 02215 USA

[4] Boston Univ, Dept Biomed Engn, 8 St Marys St, Boston, MA 02215 USA

[5] Boston Univ, Ctr Syst Neurosci, Boston, MA 02215 USA

来源：

2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC) | 2017年

关键词：

Q-learning; reinforcement learning; Markov decision processes; neural networks;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Integrating deep neural networks with reinforcement learning has exhibited excellent performance in the literature, highlighting the ability of neural networks to extract features. This paper begins with a simple Markov decision process inspired from a cognitive task. We show that Q-learning, and approximate Q-learning using a linear function approximation fail in this task. Instead, we show that Q-learning combined with a neural network-based function approximator can learn the optimal policy. Motivated by this finding, we outline procedures that allow the use of a neural network to extract appropriate features, which can then be used in a Q-learning framework with a linear function approximation, obtaining performance similar to that observed using Q-learning with neural networks. Our work suggests that neural networks can be used as feature extractors in the context of Q-learning.

引用

页数：6

共 50 条

[21] Feature Extraction and Classification of Learners Using Neural Networks [J].

Hayashida, Tomohiro ;

Yamamoto, Toru ;

Wakitani, Shin ;

Kinoshita, Takuya ;

Nishizaki, Ichiro ;

Sekizaki, Shinya ;

Tanimoto, Yusukc .

2019 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE 2019), 2019,

[22] Feature extraction and classification of learners using neural networks [J].

Hayashida, Tomohiro ;

Yamamoto, Toru ;

Wakitani, Shin ;

Nishizaki, Ichiro ;

Sekizaki, Shinya ;

Tanimoto, Yusuke .

2018 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE), 2018,

[23] Using Q-Learning for OLTC Voltage Regulation in PV-Rich Distribution Networks [J].

Custodio, Guilherme ;

Ochoa, Luis F. ;

Trindade, F. C. L. ;

Alpcan, Tansu .

2020 INTERNATIONAL CONFERENCE ON SMART GRIDS AND ENERGY SYSTEMS (SGES 2020), 2020, :482-487

[24] Smart home's wireless sensor networks lifetime optimizing using Q-learning [J].

Jrhilifa, Ismael ;

Ouadi, Hamid ;

Jilbab, Abdelilah .

IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,

[25] New algorithms of the Q-learning type [J].

Bhatnagar, Shalabh ;

Babu, K. Mohan .

AUTOMATICA, 2008, 44 (04) :1111-1119

[26] CVaR Q-Learning [J].

Stanko, Silvestr ;

Macek, Karel .

COMPUTATIONAL INTELLIGENCE: 11th International Joint Conference, IJCCI 2019, Vienna, Austria, September 17-19, 2019, Revised Selected Papers, 2021, 922 :333-358

[27] Periodic Q-Learning [J].

Lee, Donghwan ;

He, Niao .

LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 :582-598

[28] Mobile Robot Navigation: Neural Q-Learning [J].

Yun, Soh Chin ;

Parasuraman, S. ;

Ganapathy, V. .

ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 3, 2013, 178 :259-+

[29] Q-learning based on neural network in learning action selection of mobile robot [J].

Qiao, Junfei ;

Hou, Zhanjun ;

Ruan, Xiaogang .

2007 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS, VOLS 1-6, 2007, :263-267

[30] rl4dtn: Q-Learning for Opportunistic Networks [J].

Visca, Jorge ;

Baliosian, Javier .

FUTURE INTERNET, 2022, 14 (12)

← 1 2 3 4 5 →