Knowledge of opposite actions for reinforcement learning

被引:18
|
作者
Shokri, Maryam [1 ]
机构
[1] Univ Waterloo Alumni, Waterloo, ON N2L 3G1, Canada
关键词
Reinforcement learning; Q(lambda); Opposite action; Opposition-based learning (OBL); OQ(lambda) algorithm; NOQ(lambda) algorithm; Opposition weight;
D O I
10.1016/j.asoc.2011.01.045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is one of the machine intelligence techniques with several characteristics that make it suitable for solving real-world problems. However, RL agents generally face a very large state space in many applications. They must take actions in every state many times to find the optimal policy. In this work, a special type of knowledge about actions is employed to improve the performance of the off-policy, incremental, and model-free reinforcement learning with discrete state and action space. One of the components of RL agent is the action. For each action, its associate opposite action is defined. The actions and opposite actions are implemented in the framework of reinforcement learning to update the value function resulting in a faster convergence. The effects of opposite action on some of the reinforcement learning algorithms are investigated. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:4097 / 4109
页数:13
相关论文
共 50 条
  • [1] Learning to Transform Service Instructions into Actions with Reinforcement Learning and Knowledge Base
    Zhang M.-Y.
    Tian G.-H.
    Li C.-C.
    Gong J.
    International Journal of Automation and Computing, 2018, 15 (5) : 582 - 592
  • [2] Reinforcement Learning with Multiple Actions
    Nishiyama, Riku
    Yamada, Satoshi
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES AND ENGINEERING SYSTEMS (ICITES2014), 2016, 345 : 207 - 213
  • [3] Reinforcement learning with factored states and actions
    Sallans, B
    Hinton, GE
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 1063 - 1088
  • [4] Using Combination of Actions in Reinforcement Learning
    Karanik, Marcelo J.
    Gramajo, Sergio D.
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2010, 10 (01): : 19 - 23
  • [5] Learning More Complex Actions with Deep Reinforcement Learning
    Wang, Chenxi
    Du, Youtian
    Xie, Shengyuan
    Lu, Yongdi
    2021 FIFTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2021), 2021, : 121 - 122
  • [6] Reusability and Transferability of Macro Actions for Reinforcement Learning
    Chang Y.-H.
    Chang K.-Y.
    Kuo H.
    Lee C.-Y.
    ACM Transactions on Evolutionary Learning and Optimization, 2022, 2 (01):
  • [7] Pyramid Representations of the Set of Actions in Reinforcement Learning
    Iglesias, R.
    Alvarez-Santos, V.
    Rodriguez, M. A.
    Santos-Saavedra, D.
    Regueiro, C. V.
    Pardo, X. M.
    BIOINSPIRED COMPUTATION IN ARTIFICIAL SYSTEMS, PT II, 2015, 9108 : 203 - 212
  • [8] Biologically plausible reinforcement learning of continuous actions
    Jaldert O Rombouts
    Pieter R Roelfsema
    Sander M Bohte
    BMC Neuroscience, 14 (Suppl 1)
  • [9] An Efficient Reinforcement Learning Algorithm for Continuous Actions
    Fu Bo
    Chen Xin
    He Yong
    Wu Min
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 80 - 85
  • [10] Learning Continuous Control Actions for Robotic Grasping with Reinforcement Learning
    Shahid, Asad Ali
    Roveda, Loris
    Piga, Dario
    Braghin, Francesco
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 4066 - 4072