Reinforcement learning with supervision by a stable controller

被引:0
|
作者
Rosenstein, MT [1 ]
Barto, AG [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
来源
PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6 | 2004年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) methods provide a means for solving optimal control problems when accurate models are unavailable. For many such problems, however, RL alone is impractical and the associated learning problem must be structured somehow to take advantage of prior knowledge. In this paper we examine the use of such knowledge in the form of a stable controller that generates control inputs in parallel with an RL system. The controller acts as a supervisor that not only teaches the RL system about favorable control actions but also protects the learning system from risky behavior. We demonstrate the approach with a simulated robotic arm and a real seven-DOF manipulator.
引用
收藏
页码:4517 / 4522
页数:6
相关论文
共 50 条
  • [21] Reinforcement learning based fuzzy adaptive controller
    Ma, Li
    Cai, Zixing
    Zhongnan Gongye Daxue Xuebao/Journal of Central South University of Technology, 29 (02): : 172 - 175
  • [22] A Velocity Controller for Quadrotors Based on Reinforcement Learning
    Hu, Yu
    Luo, Jie
    Dong, Zhiyan
    Zhang, Lihua
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 411 - 421
  • [23] Structure and algorithm of reinforcement learning based controller
    Moshi Shibie yu Rengong Zhineng, 1 (96-100):
  • [24] Safe Reinforcement Learning: Learning with Supervision Using a Constraint-Admissible Set
    Li, Zhaojian
    Kalabic, Uros
    Chu, Tianshu
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 6390 - 6395
  • [25] Improving Spatiotemporal Self-supervision by Deep Reinforcement Learning
    Buechler, Uta
    Brattoli, Biagio
    Ommer, Bjoern
    COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 797 - 814
  • [26] Reinforcement learning with supervision by combining multiple learnings and expert advices
    Chang, Hyeong Soo
    2006 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2006, 1-12 : 4159 - 4164
  • [27] Weak Supervision for Fake News Detection via Reinforcement Learning
    Wang, Yaqing
    Yang, Weifeng
    Ma, Fenglong
    Jin Xu
    Bin Zhong
    Deng, Qiang
    Gao, Jing
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 516 - 523
  • [28] A Reinforcement Learning Adaptive Fuzzy Controller for Differential Games
    Sidney N. Givigi
    Howard M. Schwartz
    Xiaosong Lu
    Journal of Intelligent and Robotic Systems, 2010, 59 : 3 - 30
  • [29] Controller Optimization for Multirate Systems Based on Reinforcement Learning
    Zhan Li
    Sheng-Ri Xue
    Xing-Hu Yu
    Hui-Jun Gao
    International Journal of Automation and Computing, 2020, 17 : 417 - 427
  • [30] A Reinforcement Learning Adaptive Fuzzy Controller for Differential Games
    Givigi, Sidney N., Jr.
    Schwartz, Howard M.
    Lu, Xiaosong
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2010, 59 (01) : 3 - 30