Reinforcement learning with supervision by a stable controller

被引:0
作者
Rosenstein, MT [1 ]
Barto, AG [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
来源
PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6 | 2004年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) methods provide a means for solving optimal control problems when accurate models are unavailable. For many such problems, however, RL alone is impractical and the associated learning problem must be structured somehow to take advantage of prior knowledge. In this paper we examine the use of such knowledge in the form of a stable controller that generates control inputs in parallel with an RL system. The controller acts as a supervisor that not only teaches the RL system about favorable control actions but also protects the learning system from risky behavior. We demonstrate the approach with a simulated robotic arm and a real seven-DOF manipulator.
引用
收藏
页码:4517 / 4522
页数:6
相关论文
共 50 条
[41]   A Stable Deep Reinforcement Learning Framework for Recommendation [J].
Liu, Ruochen ;
Jiang, Dawei ;
Zhang, Xilong .
IEEE INTELLIGENT SYSTEMS, 2022, 37 (03) :76-84
[42]   Stable reinforcement learning with recurrent neural networks [J].
Knight J.N. ;
Anderson C. .
Journal of Control Theory and Applications, 2011, 9 (3) :410-420
[43]   SDN Controller Load Balancing Based on Reinforcement Learning [J].
Li, Zhuo ;
Zhou, Xu ;
Gao, Junruo ;
Qin, Yifang .
PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, :1120-1126
[44]   Advanced UAV Flight Controller with Deep Reinforcement Learning [J].
Lee, SeungGi ;
Cho, Deun-Sol ;
Kim, Won-Tae .
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 124 :180-181
[45]   Controller Optimization for Multirate Systems Based on Reinforcement Learning [J].
Li, Zhan ;
Xue, Sheng-Ri ;
Yu, Xing-Hu ;
Gao, Hui-Jun .
INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2020, 17 (03) :417-427
[46]   Hybrid Reinforcement Learning based controller for autonomous navigation [J].
Joglekar, Ajinkya ;
Krovi, Venkat ;
Brudnak, Mark ;
Smereka, Jonathon M. .
2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
[47]   Online reinforcement learning of controller parameters adaptation law [J].
Alhazmi, Khalid ;
Sarathy, S. Mani .
2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, :2975-2980
[48]   Reinforcement learning congestion controller for multimedia surveillance system [J].
Hsiao, MC ;
Hwang, KS ;
Tan, SW ;
Wu, CS .
2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-3, PROCEEDINGS, 2003, :4403-4407
[49]   Supplementary Reinforcement Learning Controller Designed for Quadrotor UAVs [J].
Lin, Xiaobo ;
Yu, Yao ;
Sun, Chang-Yin .
IEEE ACCESS, 2019, 7 :26422-26431
[50]   Self scaling reinforcement learning for fuzzy logic controller [J].
Fukuda, T ;
Hasegawa, Y ;
Shimojima, K ;
Saito, F .
1996 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION (ICEC '96), PROCEEDINGS OF, 1996, :247-252