Reinforcement learning with supervision by a stable controller

被引:0
作者
Rosenstein, MT [1 ]
Barto, AG [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
来源
PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6 | 2004年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) methods provide a means for solving optimal control problems when accurate models are unavailable. For many such problems, however, RL alone is impractical and the associated learning problem must be structured somehow to take advantage of prior knowledge. In this paper we examine the use of such knowledge in the form of a stable controller that generates control inputs in parallel with an RL system. The controller acts as a supervisor that not only teaches the RL system about favorable control actions but also protects the learning system from risky behavior. We demonstrate the approach with a simulated robotic arm and a real seven-DOF manipulator.
引用
收藏
页码:4517 / 4522
页数:6
相关论文
共 50 条
[31]   A Reinforcement Learning Adaptive Fuzzy Controller for Differential Games [J].
Givigi, Sidney N., Jr. ;
Schwartz, Howard M. ;
Lu, Xiaosong .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2010, 59 (01) :3-30
[32]   Controller Optimization for Multirate Systems Based on Reinforcement Learning [J].
Zhan Li ;
Sheng-Ri Xue ;
Xing-Hu Yu ;
Hui-Jun Gao .
International Journal of Automation and Computing, 2020, 17 :417-427
[33]   Intelligent PID Controller Based on Deep Reinforcement Learning [J].
Zhai, Yinhe ;
Zhao, Qiang ;
Han, Yinghua ;
Wang, Jinkuan ;
Zeng, Wenying .
2024 8TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION, ICRCA 2024, 2024, :343-348
[34]   Controller Optimization for Multirate Systems Based on Reinforcement Learning [J].
Zhan Li ;
ShengRi Xue ;
XingHu Yu ;
HuiJun Gao .
International Journal of Automation and Computing, 2020, 17 (03) :417-427
[35]   Function Approximation for Reinforcement Learning Controller for Energy from [J].
Sarkar, Soumyendu ;
Gundecha, Vineet ;
Ghorbanpour, Sahand ;
Shmakov, Alexander ;
Babu, Ashwin Ramesh ;
Naug, Avisek ;
Pichard, Alexandre ;
Cocho, Mathieu .
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, :6201-6209
[36]   A Proposal of Adaptive PID Controller Based on Reinforcement Learning [J].
WANG, Xue-song ;
CHENG, Yu-hu ;
SUN, Wei .
Journal of China University of Mining and Technology, 2007, 17 (01) :40-44
[37]   Fuzzy PID Controller for UAV Based on Reinforcement Learning [J].
Zhang, Benyi ;
Zhang, Weiping ;
Mou, Jiawang ;
Yang, Runmin ;
Zhang, Yichen .
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 :1724-1732
[38]   Controller Design for Quadrotor UAVs using Reinforcement Learning [J].
Bou-Ammar, Haitham ;
Voos, Holger ;
Ertel, Wolfgang .
2010 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, 2010, :2130-2135
[39]   Reinforcement Learning Integrated Nonlinear Controller for Guaranteed Stability [J].
Nan, Shiqi ;
Qian, Chunjiang .
2024 19TH ANNUAL SYSTEM OF SYSTEMS ENGINEERING CONFERENCE, SOSE 2024, 2024, :172-177
[40]   Stable Training of Bellman Error in Reinforcement Learning [J].
Gong, Chen ;
Bai, Yunpeng ;
Hou, Xinwen ;
Li, Xiaohui .
NEURAL INFORMATION PROCESSING, ICONIP 2020, PT V, 2021, 1333 :439-448