Knowledge of opposite actions for reinforcement learning

被引：18

作者：

Shokri, Maryam ^{[1
]}

机构：

[1] Univ Waterloo Alumni, Waterloo, ON N2L 3G1, Canada

来源：

APPLIED SOFT COMPUTING | 2011年 / 11卷 / 06期

关键词：

Reinforcement learning; Q(lambda); Opposite action; Opposition-based learning (OBL); OQ(lambda) algorithm; NOQ(lambda) algorithm; Opposition weight;

D O I：

10.1016/j.asoc.2011.01.045

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) is one of the machine intelligence techniques with several characteristics that make it suitable for solving real-world problems. However, RL agents generally face a very large state space in many applications. They must take actions in every state many times to find the optimal policy. In this work, a special type of knowledge about actions is employed to improve the performance of the off-policy, incremental, and model-free reinforcement learning with discrete state and action space. One of the components of RL agent is the action. For each action, its associate opposite action is defined. The actions and opposite actions are implemented in the framework of reinforcement learning to update the value function resulting in a faster convergence. The effects of opposite action on some of the reinforcement learning algorithms are investigated. (C) 2011 Elsevier B.V. All rights reserved.

引用

页码：4097 / 4109

页数：13

共 50 条

[1] Learning to Transform Service Instructions into Actions with Reinforcement Learning and Knowledge Base
Zhang M.-Y.
Tian G.-H.
Li C.-C.
Gong J.
International Journal of Automation and Computing, 2018, 15 (5) : 582 - 592
[2] Reinforcement Learning with Multiple Actions
Nishiyama, Riku
Yamada, Satoshi
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES AND ENGINEERING SYSTEMS (ICITES2014), 2016, 345 : 207 - 213
[3] Reinforcement learning with factored states and actions
Sallans, B
Hinton, GE
JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 1063 - 1088
[4] Using Combination of Actions in Reinforcement Learning
Karanik, Marcelo J.
Gramajo, Sergio D.
JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2010, 10 (01): : 19 - 23
[5] Learning More Complex Actions with Deep Reinforcement Learning
Wang, Chenxi
Du, Youtian
Xie, Shengyuan
Lu, Yongdi
2021 FIFTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2021), 2021, : 121 - 122
[6] Reusability and Transferability of Macro Actions for Reinforcement Learning
Chang Y.-H.
Chang K.-Y.
Kuo H.
Lee C.-Y.
ACM Transactions on Evolutionary Learning and Optimization, 2022, 2 (01):
[7] Pyramid Representations of the Set of Actions in Reinforcement Learning
Iglesias, R.
Alvarez-Santos, V.
Rodriguez, M. A.
Santos-Saavedra, D.
Regueiro, C. V.
Pardo, X. M.
BIOINSPIRED COMPUTATION IN ARTIFICIAL SYSTEMS, PT II, 2015, 9108 : 203 - 212
[8] Biologically plausible reinforcement learning of continuous actions
Jaldert O Rombouts
Pieter R Roelfsema
Sander M Bohte
BMC Neuroscience, 14 (Suppl 1)
[9] An Efficient Reinforcement Learning Algorithm for Continuous Actions
Fu Bo
Chen Xin
He Yong
Wu Min
2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 80 - 85
[10] Learning Continuous Control Actions for Robotic Grasping with Reinforcement Learning
Shahid, Asad Ali
Roveda, Loris
Piga, Dario
Braghin, Francesco
2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 4066 - 4072

← 1 2 3 4 5 →