Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints

被引：17

作者：

Kim, Yeonsoo ^{[1
]}

Kim, Jong Woo ^{[2
]}

机构：

[1] Kwangwoon Univ, Dept Chem Engn, 20 Kwangwoon Ro, Seoul 01897, South Korea

[2] Tech Univ Berlin, Chair Bioproc Engn, Berlin, Germany

来源：

AICHE JOURNAL | 2022年 / 68卷 / 05期

基金：

新加坡国家研究基金会;

关键词：

approximate dynamic programming; barrier function; control Lyapunov function; reinforcement learning; Sontag's formula; PROGRAMS; SYSTEMS;

D O I：

10.1002/aic.17601

中图分类号：

TQ [化学工业];

学科分类号：

0817 ;

摘要：

Safety is a critical factor in reinforcement learning (RL) in chemical processes. In our previous work, we had proposed a new stability-guaranteed RL for unconstrained nonlinear control-affine systems. In the approximate policy iteration algorithm, a Lyapunov neural network (LNN) was updated while being restricted to the control Lyapunov function, and a policy was updated using a variation of Sontag's formula. In this study, we additionally consider state and input constraints by introducing a barrier function, and we extend the applicable type to general nonlinear systems. We augment the constraints into the objective function and use the LNN added with a Lyapunov barrier function to approximate the augmented value function. Sontag's formula input with this approximate function brings the states into its lower level set, thereby guaranteeing the constraints satisfaction and stability. We prove the practical asymptotic stability and forward invariance. The effectiveness is validated using four tank system simulations.

引用

页数：18

共 50 条

[21] Offline Model-Based Reinforcement Learning for Tokamak Control
Char, Ian
Abbate, Joseph
Bardoczi, Laszlo
Boyer, Mark D.
Chung, Youngseog
Conlin, Rory
Erickson, Keith
Mehta, Viraj
Richner, Nathan
Kolemen, Egemen
Schneider, Jeff
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
[22] Optimal tracking control based on reinforcement learning value iteration algorithm for time-delayed nonlinear systems with external disturbances and input constraints
Mohammadi, Mehdi
Arefi, Mohammad Mehdi
Setoodeh, Peyman
Kaynak, Okyay
INFORMATION SCIENCES, 2021, 554 : 84 - 98
[23] Machine learning based-distributed optimal control algorithm for multiple nonlinear agents with input constraints
Nguyen Tan Luy
Nguyen Thanh Dang
Dang Quang Minh
Tran Hong Vinh
PROCEEDINGS OF 2018 5TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS 2018), 2018, : 276 - 281
[24] Combining hybrid metaheuristic algorithms and reinforcement learning to improve the optimal control of nonlinear continuous-time systems with input constraints
Amirabadi, Roya Khalili
Fard, Omid Solaymani
COMPUTERS & ELECTRICAL ENGINEERING, 2024, 116
[25] Safe Stabilization Control for Interconnected Virtual-Real Systems via Model-based Reinforcement Learning
Tan, Junkai
Xue, Shuangsi
Li, Huan
Cao, Hui
Li, Dongyu
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 605 - 610
[26] Uncertainty-Aware Contact-Safe Model-Based Reinforcement Learning
Kuo, Cheng-Yu
Schaarschmidt, Andreas
Cui, Yunduan
Asfour, Tamim
Matsubara, Takamitsu
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 3918 - 3925
[27] Nonlinear stochastic optimal control with input saturation constraints based on path integrals
Satoh, Satoshi
Kappen, Hilbert J.
IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2020, 15 (08) : 1169 - 1175
[28] Cognitive Control Predicts Use of Model-based Reinforcement Learning
Otto, A. Ross
Skatova, Anya
Madlon-Kay, Seth
Daw, Nathaniel D.
JOURNAL OF COGNITIVE NEUROSCIENCE, 2015, 27 (02) : 319 - 333
[29] Model-based hierarchical reinforcement learning and human action control
Botvinick, Matthew
Weinstein, Ari
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2014, 369 (1655)
[30] Reinforcement learning-based event-triggered optimal control for unknown nonlinear systems with input delay
Chen, Xiangyu
Sun, Weiwei
Gao, Xinci
Li, Yongshu
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (07) : 4844 - 4863

← 1 2 3 4 5 →