Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints

被引:17
|
作者
Kim, Yeonsoo [1 ]
Kim, Jong Woo [2 ]
机构
[1] Kwangwoon Univ, Dept Chem Engn, 20 Kwangwoon Ro, Seoul 01897, South Korea
[2] Tech Univ Berlin, Chair Bioproc Engn, Berlin, Germany
基金
新加坡国家研究基金会;
关键词
approximate dynamic programming; barrier function; control Lyapunov function; reinforcement learning; Sontag's formula; PROGRAMS; SYSTEMS;
D O I
10.1002/aic.17601
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Safety is a critical factor in reinforcement learning (RL) in chemical processes. In our previous work, we had proposed a new stability-guaranteed RL for unconstrained nonlinear control-affine systems. In the approximate policy iteration algorithm, a Lyapunov neural network (LNN) was updated while being restricted to the control Lyapunov function, and a policy was updated using a variation of Sontag's formula. In this study, we additionally consider state and input constraints by introducing a barrier function, and we extend the applicable type to general nonlinear systems. We augment the constraints into the objective function and use the LNN added with a Lyapunov barrier function to approximate the augmented value function. Sontag's formula input with this approximate function brings the states into its lower level set, thereby guaranteeing the constraints satisfaction and stability. We prove the practical asymptotic stability and forward invariance. The effectiveness is validated using four tank system simulations.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Offline Model-Based Reinforcement Learning for Tokamak Control
    Char, Ian
    Abbate, Joseph
    Bardoczi, Laszlo
    Boyer, Mark D.
    Chung, Youngseog
    Conlin, Rory
    Erickson, Keith
    Mehta, Viraj
    Richner, Nathan
    Kolemen, Egemen
    Schneider, Jeff
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [22] Optimal tracking control based on reinforcement learning value iteration algorithm for time-delayed nonlinear systems with external disturbances and input constraints
    Mohammadi, Mehdi
    Arefi, Mohammad Mehdi
    Setoodeh, Peyman
    Kaynak, Okyay
    INFORMATION SCIENCES, 2021, 554 : 84 - 98
  • [23] Machine learning based-distributed optimal control algorithm for multiple nonlinear agents with input constraints
    Nguyen Tan Luy
    Nguyen Thanh Dang
    Dang Quang Minh
    Tran Hong Vinh
    PROCEEDINGS OF 2018 5TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS 2018), 2018, : 276 - 281
  • [24] Combining hybrid metaheuristic algorithms and reinforcement learning to improve the optimal control of nonlinear continuous-time systems with input constraints
    Amirabadi, Roya Khalili
    Fard, Omid Solaymani
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 116
  • [25] Safe Stabilization Control for Interconnected Virtual-Real Systems via Model-based Reinforcement Learning
    Tan, Junkai
    Xue, Shuangsi
    Li, Huan
    Cao, Hui
    Li, Dongyu
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 605 - 610
  • [26] Uncertainty-Aware Contact-Safe Model-Based Reinforcement Learning
    Kuo, Cheng-Yu
    Schaarschmidt, Andreas
    Cui, Yunduan
    Asfour, Tamim
    Matsubara, Takamitsu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 3918 - 3925
  • [27] Nonlinear stochastic optimal control with input saturation constraints based on path integrals
    Satoh, Satoshi
    Kappen, Hilbert J.
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 15 (08) : 1169 - 1175
  • [28] Cognitive Control Predicts Use of Model-based Reinforcement Learning
    Otto, A. Ross
    Skatova, Anya
    Madlon-Kay, Seth
    Daw, Nathaniel D.
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2015, 27 (02) : 319 - 333
  • [29] Model-based hierarchical reinforcement learning and human action control
    Botvinick, Matthew
    Weinstein, Ari
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2014, 369 (1655)
  • [30] Reinforcement learning-based event-triggered optimal control for unknown nonlinear systems with input delay
    Chen, Xiangyu
    Sun, Weiwei
    Gao, Xinci
    Li, Yongshu
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (07) : 4844 - 4863