Sum-of-squares-based policy iteration for suboptimal control of polynomial time-varying systems

被引:7
作者
Pakkhesal, Sajjad [1 ]
Shamaghdari, Saeed [1 ]
机构
[1] Iran Univ Sci & Technol, Dept Elect Engn, Tehran 1684613114, Iran
关键词
adaptive dynamic programming (ADP); optimal control; policy iteration; sum-of-squares (SOS) programming; time-varying systems; NONLINEAR-SYSTEMS;
D O I
10.1002/asjc.2689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the suboptimal control design for polynomial time-varying systems. It is known that the solution to this problem relies on the solution of the Hamilton-Jacobi-Bellman (HJB) equation, which is a nonlinear partial differential equation (PDE). A policy iteration (PI) algorithm is developed to solve the HJB equation. The policy evaluation step of this algorithm consists of a sum-of-squares (SOS) program, which is computationally tractable. This algorithm distinguishes from previously known SOS-based adaptive dynamic programming (ADP) algorithms in that it is developed for time-varying systems. The convergence of the iterative algorithm and the global stability of the closed-loop system are proved. At the end, the effectiveness of the proposed algorithm is illustrated through two simulation examples.
引用
收藏
页码:3022 / 3031
页数:10
相关论文
共 24 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Policy iterations on the Hamilton-Jacobi-Isaacs equation for H∞ state feedback control with input saturation [J].
Abu-Khalaf, Murad ;
Lewis, Frank L. ;
Huang, Jie .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (12) :1989-1995
[3]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[4]  
Bertsekas D. P., 1996, Neuro-Dynamic Programming
[5]   Reinforcement Learning for Linear Continuous-time Systems: an Incremental Learning Approach [J].
Bian, Tao ;
Jiang, Zhong-Ping .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (02) :433-440
[6]   VISCOSITY SOLUTIONS OF HAMILTON-JACOBI EQUATIONS [J].
CRANDALL, MG ;
LIONS, PL .
TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1983, 277 (01) :1-42
[7]   Extended adaptive optimal control of linear systems with unknown dynamics using adaptive dynamic programming [J].
Gan, Minggang ;
Zhao, Jingang ;
Zhang, Chi .
ASIAN JOURNAL OF CONTROL, 2021, 23 (02) :1097-1106
[8]  
Hall G., 2019, ENG BUSINESS APPL SU
[9]  
Henrion D., 2005, Positive Polynomials in Control, V312
[10]   Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems [J].
Jiang, Yu ;
Jiang, Zhong-Ping .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (11) :2917-2929