Competitive reinforcement learning in continuous control tasks

被引：0

作者：

Abramson, M ^{[1
]}

Pachowicz, P ^{[1
]}

Wechsler, H ^{[1
]}

机构：

[1] George Mason Univ, Fairfax, VA 22030 USA

来源：

PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4 | 2003年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a novel hybrid reinforcement learning algorithm, Sarsa Learning Vector Quantization (SLVQ), that leaves the reinforcement part intact but employs a more effective representation of the policy function using,a piecewise constant function based upon "policy prototypes." The prototypes correspond to the pattern classes induced by the Voronoi tessellation generated by self-organizing methods like Learning Vector Quantization (LVQ). The determination of the optimal policy function can be now viewed as a pattern recognition problem in the sense that the assignment of an action to a point in the phase space is similar to the assignment of a pattern class to a point in phase space. The distributed LVQ representation of the policy function automatically generates a piecewise constant tessellation of the state space and yields in a major simplification of the learning task relative to the standard reinforcement learning algorithms for whom a discontinuous table look function, has to be learned. The feasibility and comparative advantages of the new algorithm is shown on the cart centering and mountain car problems, two control problems of increased difficulty.

引用

页码：1909 / 1914

页数：6

共 50 条

[21] Augmented Memory Replay in Reinforcement Learning With Continuous Control
Ramicic, Mirza
Bonarini, Andrea
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 485 - 496
[22] Continuous Control in Car Simulator with Deep Reinforcement Learning
Yang, Fan
Wang, Ping
Wang, XinHong
PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 566 - 570
[23] A Tour of Reinforcement Learning: The View from Continuous Control
Recht, Benjamin
ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 2, 2019, 2 : 253 - 279
[24] Continuous control of a polymerization system with deep reinforcement learning
Ma, Yan
Zhu, Wenbo
Benton, Michael G.
Romagnoli, Jose
JOURNAL OF PROCESS CONTROL, 2019, 75 : 40 - 47
[25] Hierarchical Deep Reinforcement Learning for Continuous Action Control
Yang, Zhaoyang
Merrick, Kathryn
Jin, Lianwen
Abbass, Hussein A.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5174 - 5184
[26] Autoregressive Policies for Continuous Control Deep Reinforcement Learning
Korenkevych, Dmytro
Mahmood, A. Rupam
Vasan, Gautham
Bergstra, James
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2754 - 2762
[27] Deep Reinforcement Learning for Continuous Control of Material Thickness
Dippel, Oliver
Lisitsa, Alexei
Peng, Bei
ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 321 - 334
[28] Action Robust Reinforcement Learning and Applications in Continuous Control
Tessler, Chen
Efroni, Yonathan
Mannor, Shie
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[29] End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks
Cheng, Richard
Orosz, Gabor
Murray, Richard M.
Burdick, Joel W.
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3387 - 3395
[30] Hierarchical reinforcement learning for kinematic control tasks with parameterized action spaces
Cao, Jingyu
Dong, Lu
Sun, Changyin
NEURAL COMPUTING & APPLICATIONS, 2024, 36 (01): : 323 - 336

← 1 2 3 4 5 →