Reinforcement learning with supervision by a stable controller

被引：0

作者：

Rosenstein, MT ^{[1
]}

Barto, AG ^{[1
]}

机构：

[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA

来源：

PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6 | 2004年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) methods provide a means for solving optimal control problems when accurate models are unavailable. For many such problems, however, RL alone is impractical and the associated learning problem must be structured somehow to take advantage of prior knowledge. In this paper we examine the use of such knowledge in the form of a stable controller that generates control inputs in parallel with an RL system. The controller acts as a supervisor that not only teaches the RL system about favorable control actions but also protects the learning system from risky behavior. We demonstrate the approach with a simulated robotic arm and a real seven-DOF manipulator.

引用

页码：4517 / 4522

页数：6

共 50 条

[21] Reinforcement learning based fuzzy adaptive controller
Ma, Li
Cai, Zixing
Zhongnan Gongye Daxue Xuebao/Journal of Central South University of Technology, 29 (02): : 172 - 175
[22] A Velocity Controller for Quadrotors Based on Reinforcement Learning
Hu, Yu
Luo, Jie
Dong, Zhiyan
Zhang, Lihua
ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 411 - 421
[23] Structure and algorithm of reinforcement learning based controller
Moshi Shibie yu Rengong Zhineng, 1 (96-100):
[24] Safe Reinforcement Learning: Learning with Supervision Using a Constraint-Admissible Set
Li, Zhaojian
Kalabic, Uros
Chu, Tianshu
2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 6390 - 6395
[25] Improving Spatiotemporal Self-supervision by Deep Reinforcement Learning
Buechler, Uta
Brattoli, Biagio
Ommer, Bjoern
COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 797 - 814
[26] Reinforcement learning with supervision by combining multiple learnings and expert advices
Chang, Hyeong Soo
2006 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2006, 1-12 : 4159 - 4164
[27] Weak Supervision for Fake News Detection via Reinforcement Learning
Wang, Yaqing
Yang, Weifeng
Ma, Fenglong
Jin Xu
Bin Zhong
Deng, Qiang
Gao, Jing
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 516 - 523
[28] A Reinforcement Learning Adaptive Fuzzy Controller for Differential Games
Sidney N. Givigi
Howard M. Schwartz
Xiaosong Lu
Journal of Intelligent and Robotic Systems, 2010, 59 : 3 - 30
[29] Controller Optimization for Multirate Systems Based on Reinforcement Learning
Zhan Li
Sheng-Ri Xue
Xing-Hu Yu
Hui-Jun Gao
International Journal of Automation and Computing, 2020, 17 : 417 - 427
[30] A Reinforcement Learning Adaptive Fuzzy Controller for Differential Games
Givigi, Sidney N., Jr.
Schwartz, Howard M.
Lu, Xiaosong
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2010, 59 (01) : 3 - 30

← 1 2 3 4 5 →