Knowledge of opposite actions for reinforcement learning

被引:18
|
作者
Shokri, Maryam [1 ]
机构
[1] Univ Waterloo Alumni, Waterloo, ON N2L 3G1, Canada
关键词
Reinforcement learning; Q(lambda); Opposite action; Opposition-based learning (OBL); OQ(lambda) algorithm; NOQ(lambda) algorithm; Opposition weight;
D O I
10.1016/j.asoc.2011.01.045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is one of the machine intelligence techniques with several characteristics that make it suitable for solving real-world problems. However, RL agents generally face a very large state space in many applications. They must take actions in every state many times to find the optimal policy. In this work, a special type of knowledge about actions is employed to improve the performance of the off-policy, incremental, and model-free reinforcement learning with discrete state and action space. One of the components of RL agent is the action. For each action, its associate opposite action is defined. The actions and opposite actions are implemented in the framework of reinforcement learning to update the value function resulting in a faster convergence. The effects of opposite action on some of the reinforcement learning algorithms are investigated. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:4097 / 4109
页数:13
相关论文
共 50 条
  • [31] Continuous control actions learning and adaptation for robotic manipulation through reinforcement learning
    Asad Ali Shahid
    Dario Piga
    Francesco Braghin
    Loris Roveda
    Autonomous Robots, 2022, 46 : 483 - 498
  • [32] Application of Reinforcement Learning in Controlling Quadrotor UAV Flight Actions
    Shen, Shang-En
    Huang, Yi-Cheng
    DRONES, 2024, 8 (11)
  • [33] Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning
    Chunduri, Pramod
    Bang, Jaeho
    Lu, Yao
    Arulraj, Joy
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 545 - 558
  • [34] Generalized Reinforcement Learning with Concept-Driven Abstract Actions
    Chiu, Po-Hsiang
    Huber, Manfred
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2575 - 2582
  • [35] Reinforcement learning for high-dimensional problems with symmetrical actions
    Kamal, MAS
    Murata, J
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 6192 - 6197
  • [36] Intrinsically-motivated reinforcement learning for control with continuous actions
    de Abril, Ildefons Magrans
    Kanai, Ryota
    2017 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2017, : 212 - 213
  • [37] Rethinking Stochasticity in Neural Networks for Reinforcement Learning with Continuous Actions
    Shah, Syed Naveed Hussain
    Hougen, Dean Frederick
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 488 - 496
  • [38] Learning path recommendation based on knowledge tracing and reinforcement learning
    Wan, Han
    Che, Baoliang
    Luo, Hongzhen
    Luo, Xiaoyan
    2023 IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, ICALT, 2023, : 55 - 57
  • [39] Combining Deep Reinforcement Learning with Prior Knowledge and Reasoning
    Bougie, Nicolas
    Cheng, Li Kai
    Ichise, Ryutaro
    APPLIED COMPUTING REVIEW, 2018, 18 (02): : 33 - 45
  • [40] Correcting flawed expert knowledge through reinforcement learning
    Aihe, David O.
    Gonzalez, Avelino J.
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (17-18) : 6457 - 6471