A Multi-Step Reinforcement Learning Algorithm

被引:1
|
作者
Zhang, Zhicong [1 ]
Hu, Kaishun [1 ]
Huang, Huiyu [1 ]
Li, Shuai [1 ]
Zhao, Shaoyong [1 ]
机构
[1] Dongguan Univ Technol, Dept Ind Engn, Dongguan 523808, Guangdong, Peoples R China
关键词
Reinforcement learning; Sarsa(lambda; k); Sarsa; Sarsa(lambda);
D O I
10.4028/www.scientific.net/AMM.44-47.3611
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Reinforcement learning (RL) is a state or action value based machine learning method which approximately solves large-scale Markov Decision Process (MDP) or Semi-Markov Decision Process (SMDP). A multi-step RL algorithm called Sarsa(lambda,k) is proposed, which is a compromised variation of Sarsa and Sarsa(lambda). It is equivalent to Sarsa if k is 1 and is equivalent to Sarsa(lambda) if k is infinite. Sarsa(lambda,k) adjust its performance by setting k value. Two forms of Sarsa(lambda,k), forward view Sarsa(lambda,k) and backward view Sarsa(lambda,k), are constructed and proved equivalent in off-line updating.
引用
收藏
页码:3611 / 3615
页数:5
相关论文
共 50 条
  • [1] Multi-Step Reinforcement Learning: A Unifying Algorithm
    De Asis, Kristopher
    Hernandez-Garcia, J. Fernando
    Holland, G. Zacharias
    Sutton, Richard S.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2902 - 2909
  • [2] Reinforcement Learning for Multi-Step Expert Advice
    Philipp, Patrick
    Rettinger, Achim
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 962 - 971
  • [3] Multi-step Prediction for Learning Invariant Representations in Reinforcement Learning
    Xu, Xinyue
    Lv, Kai
    Dong, Xingye
    Han, Sheng
    Lin, Youfang
    2021 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS), 2021, : 202 - 206
  • [4] Multi-step truncated Q learning algorithm
    Chen, SL
    Wu, HZ
    Han, XL
    Xiao, L
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 194 - 198
  • [5] Multi-Agent Reinforcement Learning with Multi-Step Generative Models
    Krupnik, Orr
    Mordatch, Igor
    Tamar, Aviv
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [6] A multi-step predictive deep reinforcement learning algorithm for HVAC control systems in smart buildings
    Liu, Xiangfei
    Ren, Mifeng
    Yang, Zhile
    Yan, Gaowei
    Guo, Yuanjun
    Cheng, Lan
    Wu, Chengke
    ENERGY, 2022, 259
  • [7] Multi-step medical image segmentation based on reinforcement learning
    Zhiqiang Tian
    Xiangyu Si
    Yaoyue Zheng
    Zhang Chen
    Xiaojian Li
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 5011 - 5022
  • [8] Speeding-up reinforcement learning with multi-step actions
    Schoknecht, R
    Riedmiller, M
    ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 813 - 818
  • [9] Multi-step medical image segmentation based on reinforcement learning
    Tian, Zhiqiang
    Si, Xiangyu
    Zheng, Yaoyue
    Chen, Zhang
    Li, Xiaojian
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 13 (11) : 5011 - 5022
  • [10] The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
    Meng, Lingheng
    Gorbet, Rob
    Kulic, Dana
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 347 - 353