Competitive reinforcement learning in continuous control tasks

被引:0
|
作者
Abramson, M [1 ]
Pachowicz, P [1 ]
Wechsler, H [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a novel hybrid reinforcement learning algorithm, Sarsa Learning Vector Quantization (SLVQ), that leaves the reinforcement part intact but employs a more effective representation of the policy function using,a piecewise constant function based upon "policy prototypes." The prototypes correspond to the pattern classes induced by the Voronoi tessellation generated by self-organizing methods like Learning Vector Quantization (LVQ). The determination of the optimal policy function can be now viewed as a pattern recognition problem in the sense that the assignment of an action to a point in the phase space is similar to the assignment of a pattern class to a point in phase space. The distributed LVQ representation of the policy function automatically generates a piecewise constant tessellation of the state space and yields in a major simplification of the learning task relative to the standard reinforcement learning algorithms for whom a discontinuous table look function, has to be learned. The feasibility and comparative advantages of the new algorithm is shown on the cart centering and mountain car problems, two control problems of increased difficulty.
引用
收藏
页码:1909 / 1914
页数:6
相关论文
共 50 条
  • [21] Augmented Memory Replay in Reinforcement Learning With Continuous Control
    Ramicic, Mirza
    Bonarini, Andrea
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 485 - 496
  • [22] Continuous Control in Car Simulator with Deep Reinforcement Learning
    Yang, Fan
    Wang, Ping
    Wang, XinHong
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 566 - 570
  • [23] A Tour of Reinforcement Learning: The View from Continuous Control
    Recht, Benjamin
    ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 2, 2019, 2 : 253 - 279
  • [24] Continuous control of a polymerization system with deep reinforcement learning
    Ma, Yan
    Zhu, Wenbo
    Benton, Michael G.
    Romagnoli, Jose
    JOURNAL OF PROCESS CONTROL, 2019, 75 : 40 - 47
  • [25] Hierarchical Deep Reinforcement Learning for Continuous Action Control
    Yang, Zhaoyang
    Merrick, Kathryn
    Jin, Lianwen
    Abbass, Hussein A.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5174 - 5184
  • [26] Autoregressive Policies for Continuous Control Deep Reinforcement Learning
    Korenkevych, Dmytro
    Mahmood, A. Rupam
    Vasan, Gautham
    Bergstra, James
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2754 - 2762
  • [27] Deep Reinforcement Learning for Continuous Control of Material Thickness
    Dippel, Oliver
    Lisitsa, Alexei
    Peng, Bei
    ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 321 - 334
  • [28] Action Robust Reinforcement Learning and Applications in Continuous Control
    Tessler, Chen
    Efroni, Yonathan
    Mannor, Shie
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [29] End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks
    Cheng, Richard
    Orosz, Gabor
    Murray, Richard M.
    Burdick, Joel W.
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3387 - 3395
  • [30] Hierarchical reinforcement learning for kinematic control tasks with parameterized action spaces
    Cao, Jingyu
    Dong, Lu
    Sun, Changyin
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (01): : 323 - 336