Receding Horizon Actor-Critic Learning Control for Nonlinear Time-Delay Systems With Unknown Dynamics

被引:8
作者
Liu, Jiahang [1 ,2 ]
Zhang, Xinglong [1 ]
Xu, Xin [1 ]
Xiong, Quan [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[2] Beijing Inst Biotechnol, Beijing 100071, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2023年 / 53卷 / 08期
基金
中国国家自然科学基金;
关键词
Delay effects; Optimal control; Control systems; Stability criteria; Simulation; Predictive control; Costs; Discrete-time nonlinear systems; Koopman operator; receding horizon control; reinforcement learning (RL); time-delay systems; MODEL-PREDICTIVE CONTROL; KOOPMAN OPERATOR; STABILITY; DESIGN;
D O I
10.1109/TSMC.2023.3254911
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of modern mechatronics and networked systems, the controller design of time-delay systems has received notable attention. Time delays can greatly influence the stability and performance of the systems, especially for optimal control design. In this article, we propose a receding horizon actor-critic learning control approach for near-optimal control of nonlinear time-delay systems (RACL-TD) with unknown dynamics. In the proposed approach, a data-driven predictor for nonlinear time-delay systems is first learned based on the Koopman theory using precollected samples. Then, a receding horizon actor-critic architecture is designed to learn a near-optimal control policy. In RACL-TD, the terminal cost is determined by using the Lyapunov-Krasovskii approach so that the influences of the delayed states and control inputs can be well addressed. Furthermore, a relaxed terminal condition is present to reduce the computational cost. The convergence and optimality of RACL-TD in each prediction interval as well as the closed-loop property of the system are discussed and analyzed. Simulation results on a two-stage time-delayed chemical reactor illustrate that RACL-TD can achieve better control performance than nonlinear model predictive control (MPC) and infinite-horizon adaptive dynamic programming. Moreover, RACL-TD can have less computational cost than nonlinear MPC.
引用
收藏
页码:4980 / 4993
页数:14
相关论文
共 47 条
  • [1] Lyapunov-Krasovskii Characterizations of Integral Input-to-State Stability of Delay Systems With Nonstrict Dissipation Rates
    Chaillet, Antoine
    Goksu, Gokhan
    Pepe, Pierdomenico
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (07) : 3259 - 3272
  • [2] A Converse Lyapunov-Krasovskii Theorem for the Global Asymptotic Local Exponential Stability of Nonlinear Time-Delay Systems
    Di Ferdinando, M.
    Pepe, P.
    Gennaro, S. Di
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2021, 5 (01): : 7 - 12
  • [3] Functional Nonlinear Model Predictive Control Based on Adaptive Dynamic Programming
    Dong, Lu
    Yan, Jun
    Yuan, Xin
    He, Haibo
    Sun, Changyin
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (12) : 4206 - 4218
  • [4] Stabilising predictive control of non-linear time-delay systems using control Lyapunov-Krasovskii functionals
    Esfanjani, R. Mahboobi
    Nikravesh, S. K. Y.
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2009, 3 (10) : 1395 - 1400
  • [5] Fridman E., INTRO TIME DELAY SYS
  • [6] Tutorial on Lyapunov-based methods for time-delay systems
    Fridman, Emilia
    [J]. EUROPEAN JOURNAL OF CONTROL, 2014, 20 (06) : 271 - 283
  • [7] A New Design of Robust H∞ Sliding Mode Control for Uncertain Stochastic T-S Fuzzy Time-Delay Systems
    Gao, Qing
    Feng, Gang
    Xi, Zhiyu
    Wang, Yong
    Qiu, Jianbin
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (09) : 1556 - 1566
  • [8] CORRECTION
    GROTSCHEL, M
    [J]. COMBINATORICA, 1984, 4 (04) : 291 - 295
  • [9] Robust Finite-Time Bounded Controller Design of Time- Delay Conic Nonlinear Systems Using Sliding Mode Control Strategy
    He, Shuping
    Song, Jun
    Liu, Fei
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (11): : 1863 - 1873
  • [10] Backstepping Control for Nonlinear Systems With Time Delays and Applications to Chemical Reactor Systems
    Hua, Changchun
    Liu, Peter X.
    Guan, Xinping
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2009, 56 (09) : 3723 - 3732