Knowledge of opposite actions for reinforcement learning

被引：18

作者：

Shokri, Maryam ^{[1
]}

机构：

[1] Univ Waterloo Alumni, Waterloo, ON N2L 3G1, Canada

来源：

APPLIED SOFT COMPUTING | 2011年 / 11卷 / 06期

关键词：

Reinforcement learning; Q(lambda); Opposite action; Opposition-based learning (OBL); OQ(lambda) algorithm; NOQ(lambda) algorithm; Opposition weight;

D O I：

10.1016/j.asoc.2011.01.045

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) is one of the machine intelligence techniques with several characteristics that make it suitable for solving real-world problems. However, RL agents generally face a very large state space in many applications. They must take actions in every state many times to find the optimal policy. In this work, a special type of knowledge about actions is employed to improve the performance of the off-policy, incremental, and model-free reinforcement learning with discrete state and action space. One of the components of RL agent is the action. For each action, its associate opposite action is defined. The actions and opposite actions are implemented in the framework of reinforcement learning to update the value function resulting in a faster convergence. The effects of opposite action on some of the reinforcement learning algorithms are investigated. (C) 2011 Elsevier B.V. All rights reserved.

引用

页码：4097 / 4109

页数：13

共 50 条

[31] Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning
Asad Ali Shahid
Dario Piga
Francesco Braghin
Loris Roveda
Autonomous Robots, 2022, 46 : 483 - 498
[32] Application of Reinforcement Learning in Controlling Quadrotor UAV Flight Actions
Shen, Shang-En
Huang, Yi-Cheng
DRONES, 2024, 8 (11)
[33] Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning
Chunduri, Pramod
Bang, Jaeho
Lu, Yao
Arulraj, Joy
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 545 - 558
[34] Generalized Reinforcement Learning with Concept-Driven Abstract Actions
Chiu, Po-Hsiang
Huber, Manfred
2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2575 - 2582
[35] Reinforcement learning for high-dimensional problems with symmetrical actions
Kamal, MAS
Murata, J
2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 6192 - 6197
[36] Intrinsically-motivated reinforcement learning for control with continuous actions
de Abril, Ildefons Magrans
Kanai, Ryota
2017 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2017, : 212 - 213
[37] Rethinking Stochasticity in Neural Networks for Reinforcement Learning with Continuous Actions
Shah, Syed Naveed Hussain
Hougen, Dean Frederick
2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 488 - 496
[38] Learning path recommendation based on knowledge tracing and reinforcement learning
Wan, Han
Che, Baoliang
Luo, Hongzhen
Luo, Xiaoyan
2023 IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, ICALT, 2023, : 55 - 57
[39] Combining Deep Reinforcement Learning with Prior Knowledge and Reasoning
Bougie, Nicolas
Cheng, Li Kai
Ichise, Ryutaro
APPLIED COMPUTING REVIEW, 2018, 18 (02): : 33 - 45
[40] Correcting flawed expert knowledge through reinforcement learning
Aihe, David O.
Gonzalez, Avelino J.
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (17-18) : 6457 - 6471

← 1 2 3 4 5 →