Reinforcement learning with supervision by a stable controller

被引:0
|
作者
Rosenstein, MT [1 ]
Barto, AG [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
来源
PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6 | 2004年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) methods provide a means for solving optimal control problems when accurate models are unavailable. For many such problems, however, RL alone is impractical and the associated learning problem must be structured somehow to take advantage of prior knowledge. In this paper we examine the use of such knowledge in the form of a stable controller that generates control inputs in parallel with an RL system. The controller acts as a supervisor that not only teaches the RL system about favorable control actions but also protects the learning system from risky behavior. We demonstrate the approach with a simulated robotic arm and a real seven-DOF manipulator.
引用
收藏
页码:4517 / 4522
页数:6
相关论文
共 50 条
  • [1] Reinforcement Learning Intersection Controller
    Tolebi, Gulnur
    Dairbekov, Nurlan S.
    Kurmankhojayev, Daniyar
    Mussabayev, Ravil
    2018 14TH INTERNATIONAL CONFERENCE ON ELECTRONICS COMPUTER AND COMPUTATION (ICECCO), 2018,
  • [2] A Stable Lyapunov Constrained Reinforcement Learning based Neural Controller for Non Linear Systems
    Kumar, Abhishek
    Sharma, Rajneesh
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 185 - 189
  • [3] Data-Efficient Reinforcement Learning from Controller Guidance with Integrated Self-Supervision for Process Control
    Bougie, Nicolas
    Onishi, Takashi
    Tsuruoka, Yoshimasa
    IFAC PAPERSONLINE, 2022, 55 (07): : 863 - 868
  • [4] Stable fitted reinforcement learning
    Gordon, GJ
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 1052 - 1058
  • [5] A reinforcement learning approach to STATCOM controller
    Guo, HX
    Liu, YQ
    Wu, J
    Yang, JM
    PROCEEDINGS OF THE 2004 IEEE INTERNATIONAL CONFERENCE ON ELECTRIC UTILITY DEREGULATION, RESTRUCTURING AND POWER TECHNOLOGIES, VOLS 1 AND 2, 2004, : 638 - 642
  • [6] Design of a reinforcement learning PID controller
    Guan, Zhe
    Yamamoto, Toru
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2021, 16 (10) : 1354 - 1360
  • [7] Redirection Controller Using Reinforcement Learning
    Chang, Yuchen
    Matsumoto, Keigo
    Narumi, Takuji
    Tanikawa, Tomohiro
    Hirose, Michitaka
    IEEE ACCESS, 2021, 9 : 145083 - 145097
  • [8] Design of a Reinforcement Learning PID controller
    Guan, Zhe
    Yamamoto, Tom
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [9] Consistent Action for Stable Training in Reinforcement Learning–based Gain Tuning of Linear Feedback Controller
    ASRI, Department of Electrical and Computer Engineering, Seoul National University, Korea, Republic of
    J. Inst. Control Rob. Syst., 9 (965-972): : 965 - 972
  • [10] Weak Human Preference Supervision for Deep Reinforcement Learning
    Cao, Zehong
    Wong, KaiChiu
    Lin, Chin-Teng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (12) : 5369 - 5378