Learning-Based Adaptive Optimal Control of Linear Time-Delay Systems: A Policy Iteration Approach

被引:10
作者
Cui, Leilei [1 ]
Pang, Bo [1 ]
Jiang, Zhong-Ping [1 ]
机构
[1] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, Control & Networks Lab, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Optimal control; Aerospace electronics; Mathematical models; Heuristic algorithms; Delays; Trajectory; Stability criteria; Adaptive dynamic programming (ADP); linear time-delay systems; optimal control; policy iteration (PI); MULTIAGENT SYSTEMS; RICCATI-EQUATIONS; REGULATOR; CONSENSUS;
D O I
10.1109/TAC.2023.3273786
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article studies the adaptive optimal control problem for a class of linear time-delay systems described by delay differential equations. A crucial strategy is to take advantage of recent developments in reinforcement learning and adaptive dynamic programming and develop novel methods to learn adaptive optimal controllers from finite samples of input and state data. In this article, the data-driven policy iteration (PI) is proposed to solve the infinite-dimensional algebraic Riccati equation iteratively in the absence of exact model knowledge. Interestingly, the proposed recursive PI algorithm is new in the present context of continuous-time time-delay systems, even when the model knowledge is assumed known. The efficacy of the proposed learning-based control methods is validated by means of practical applications arising from metal cutting and autonomous driving.
引用
收藏
页码:629 / 636
页数:8
相关论文
共 48 条
[1]  
Astrom K.J., 1997, Adaptive Control, V2
[2]   A SPLINE BASED TECHNIQUE FOR COMPUTING RICCATI OPERATORS AND FEEDBACK CONTROLS IN REGULATOR PROBLEMS FOR DELAY EQUATIONS [J].
BANKS, HT ;
ROSEN, IG ;
ITO, K .
SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1984, 5 (04) :830-855
[3]   MESH INDEPENDENCE OF KLEINMAN-NEWTON ITERATIONS FOR RICCATI EQUATIONS IN HILBERT SPACE [J].
Burns, J. A. ;
Sachs, E. W. ;
Zietsman, L. .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (05) :2663-2692
[4]  
Cui L., 2022, arXiv
[5]  
Cui LL, 2023, Arxiv, DOI arXiv:2212.02072
[6]  
Curtain R. F., 2012, Introduction to Infinite-Dimensional Systems Theory: A State-Space Approach, V21, DOI [10.1007/978-1-4612-4224-6, DOI 10.1007/978-1-4612-4224-6]
[7]  
Eidelman Y., 2004, FUNCTIONAL ANAL INTR
[8]   OPTIMAL CONTROL OF LINEAR TIME-DELAY SYSTEMS [J].
ELLER, DH ;
AGGARWAL, JK ;
BANKS, HT .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1969, AC14 (06) :678-&
[9]  
Fridman E, 2014, 2014 EUROPEAN CONTROL CONFERENCE (ECC), P1428, DOI 10.1109/ECC.2014.6862628
[10]   New results on stability of discrete-time systems with time-varying state delay [J].
Gao, Huijun ;
Chen, Tongwen .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2007, 52 (02) :328-334