Receding Horizon Actor-Critic Learning Control for Nonlinear Time-Delay Systems With Unknown Dynamics

被引：8

作者：

Liu, Jiahang ^{[1
,2
]}

Zhang, Xinglong ^{[1
]}

Xu, Xin ^{[1
]}

Xiong, Quan ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

[2] Beijing Inst Biotechnol, Beijing 100071, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2023年 / 53卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Delay effects; Optimal control; Control systems; Stability criteria; Simulation; Predictive control; Costs; Discrete-time nonlinear systems; Koopman operator; receding horizon control; reinforcement learning (RL); time-delay systems; MODEL-PREDICTIVE CONTROL; KOOPMAN OPERATOR; STABILITY; DESIGN;

D O I：

10.1109/TSMC.2023.3254911

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the development of modern mechatronics and networked systems, the controller design of time-delay systems has received notable attention. Time delays can greatly influence the stability and performance of the systems, especially for optimal control design. In this article, we propose a receding horizon actor-critic learning control approach for near-optimal control of nonlinear time-delay systems (RACL-TD) with unknown dynamics. In the proposed approach, a data-driven predictor for nonlinear time-delay systems is first learned based on the Koopman theory using precollected samples. Then, a receding horizon actor-critic architecture is designed to learn a near-optimal control policy. In RACL-TD, the terminal cost is determined by using the Lyapunov-Krasovskii approach so that the influences of the delayed states and control inputs can be well addressed. Furthermore, a relaxed terminal condition is present to reduce the computational cost. The convergence and optimality of RACL-TD in each prediction interval as well as the closed-loop property of the system are discussed and analyzed. Simulation results on a two-stage time-delayed chemical reactor illustrate that RACL-TD can achieve better control performance than nonlinear model predictive control (MPC) and infinite-horizon adaptive dynamic programming. Moreover, RACL-TD can have less computational cost than nonlinear MPC.

引用

页码：4980 / 4993

页数：14

共 47 条

[1] Lyapunov-Krasovskii Characterizations of Integral Input-to-State Stability of Delay Systems With Nonstrict Dissipation Rates
Chaillet, Antoine
Goksu, Gokhan
Pepe, Pierdomenico
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (07) : 3259 - 3272
[2] A Converse Lyapunov-Krasovskii Theorem for the Global Asymptotic Local Exponential Stability of Nonlinear Time-Delay Systems
Di Ferdinando, M.
Pepe, P.
Gennaro, S. Di
[J]. IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (01): : 7 - 12
[3] Functional Nonlinear Model Predictive Control Based on Adaptive Dynamic Programming
Dong, Lu
Yan, Jun
Yuan, Xin
He, Haibo
Sun, Changyin
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (12) : 4206 - 4218
[4] Stabilising predictive control of non-linear time-delay systems using control Lyapunov-Krasovskii functionals
Esfanjani, R. Mahboobi
Nikravesh, S. K. Y.
[J]. IET CONTROL THEORY AND APPLICATIONS, 2009, 3 (10) : 1395 - 1400
[5] Fridman E., INTRO TIME DELAY SYS
[6] Tutorial on Lyapunov-based methods for time-delay systems
Fridman, Emilia
[J]. EUROPEAN JOURNAL OF CONTROL, 2014, 20 (06) : 271 - 283
[7] A New Design of Robust H∞ Sliding Mode Control for Uncertain Stochastic T-S Fuzzy Time-Delay Systems
Gao, Qing
Feng, Gang
Xi, Zhiyu
Wang, Yong
Qiu, Jianbin
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (09) : 1556 - 1566
[8] CORRECTION
GROTSCHEL, M
[J]. COMBINATORICA, 1984, 4 (04) : 291 - 295
[9] Robust Finite-Time Bounded Controller Design of Time- Delay Conic Nonlinear Systems Using Sliding Mode Control Strategy
He, Shuping
Song, Jun
Liu, Fei
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (11): : 1863 - 1873
[10] Backstepping Control for Nonlinear Systems With Time Delays and Applications to Chemical Reactor Systems
Hua, Changchun
Liu, Peter X.
Guan, Xinping
[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2009, 56 (09) : 3723 - 3732

← 1 2 3 4 5 →