Safe model-based reinforcement learning for nonlinear optimal control with state and input constraints

被引：17

作者：

Kim, Yeonsoo ^{[1
]}

Kim, Jong Woo ^{[2
]}

机构：

[1] Kwangwoon Univ, Dept Chem Engn, 20 Kwangwoon Ro, Seoul 01897, South Korea

[2] Tech Univ Berlin, Chair Bioproc Engn, Berlin, Germany

来源：

AICHE JOURNAL | 2022年 / 68卷 / 05期

基金：

新加坡国家研究基金会;

关键词：

approximate dynamic programming; barrier function; control Lyapunov function; reinforcement learning; Sontag's formula; PROGRAMS; SYSTEMS;

D O I：

10.1002/aic.17601

中图分类号：

TQ [化学工业];

学科分类号：

0817 ;

摘要：

Safety is a critical factor in reinforcement learning (RL) in chemical processes. In our previous work, we had proposed a new stability-guaranteed RL for unconstrained nonlinear control-affine systems. In the approximate policy iteration algorithm, a Lyapunov neural network (LNN) was updated while being restricted to the control Lyapunov function, and a policy was updated using a variation of Sontag's formula. In this study, we additionally consider state and input constraints by introducing a barrier function, and we extend the applicable type to general nonlinear systems. We augment the constraints into the objective function and use the LNN added with a Lyapunov barrier function to approximate the augmented value function. Sontag's formula input with this approximate function brings the states into its lower level set, thereby guaranteeing the constraints satisfaction and stability. We prove the practical asymptotic stability and forward invariance. The effectiveness is validated using four tank system simulations.

引用

页数：18

共 50 条

[31] Integral reinforcement learning-based guaranteed cost control for unknown nonlinear systems subject to input constraints and uncertainties
Liang, Yuling
Zhang, Huaguang
Zhang, Juan
Luo, Yanhong
APPLIED MATHEMATICS AND COMPUTATION, 2021, 408
[32] Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints
He, Pingan
Jagannathan, S.
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 425 - 436
[33] Model-Free Optimal Vibration Control of a Nonlinear System Based on Deep Reinforcement Learning
Jiang, Jiyuan
Tang, Jie
Zhao, Kun
Li, Meng
Li, Yinghui
Cao, Dengqing
INTERNATIONAL JOURNAL OF STRUCTURAL STABILITY AND DYNAMICS, 2024,
[34] Reinforcement learning-based optimal control of uncertain nonlinear systems
Garcia, Miguel
Dong, Wenjie
INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (12) : 2839 - 2850
[35] A survey on model-based reinforcement learning
Luo, Fan-Ming
Xu, Tian
Lai, Hang
Chen, Xiong-Hui
Zhang, Weinan
Yu, Yang
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (02)
[36] Efficient state synchronisation in model-based testing through reinforcement learning
Turker, Uraz Cengiz
Hierons, Robert M.
Mousavi, Mohammad Reza
Tyukin, Ivan Y.
2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING ASE 2021, 2021, : 368 - 380
[37] Model-Based Reinforcement Learning Exploiting State-Action Equivalence
Asadi, Mahsa
Talebi, Mohammad Sadegh
Bourel, Hippolyte
Maillard, Odalric-Ambrym
ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 204 - 219
[38] Model-based Reinforcement Learning: A Survey
Moerland, Thomas M.
Broekens, Joost
Plaat, Aske
Jonker, Catholijn M.
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2023, 16 (01): : 1 - 118
[39] Model-based reinforcement learning and predictive control for two-stage optimal control of fed-batch bioreactor
Kim, Jong Woo
Park, Byung Jun
Oh, Tae Hoon
Lee, Jong Min
COMPUTERS & CHEMICAL ENGINEERING, 2021, 154 (154)
[40] Reinforcement learning-based optimal control of unknown constrained-input nonlinear systems using simulated experience
Hamed Jabbari Asl
Eiji Uchibe
Nonlinear Dynamics, 2023, 111 : 16093 - 16110

← 1 2 3 4 5 →