Adaptive Dynamic Programming for Model-Free Global Stabilization of Control Constrained Continuous-Time Systems

被引:29
作者
Rizvi, Syed Ali Asad [1 ]
Lin, Zongli [1 ]
机构
[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
关键词
Actuators; Feedback control; System dynamics; Mathematical model; Heuristic algorithms; Stability analysis; Asymptotic stability; Actuator saturation; adaptive dynamic programming (ADP); constrained control; iterative learning; SEMIGLOBAL EXPONENTIAL STABILIZATION; LINEAR-SYSTEMS; NONLINEAR-SYSTEMS; INPUT SATURATION; VALUE-ITERATION; SUBJECT; DESIGN;
D O I
10.1109/TCYB.2020.2989419
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article addresses the problem of global stabilization of continuous-time linear systems subject to control constraints using a model-free approach. We propose a gain-scheduled low-gain feedback scheme that prevents saturation from occurring and achieves global stabilization. The framework of parameterized algebraic Riccati equations (AREs) is employed to design the low-gain feedback control laws. An adaptive dynamic programming (ADP) method is presented to find the solution of the parameterized ARE without requiring the knowledge of the system dynamics. In particular, we present an iterative ADP algorithm that searches for an appropriate value of the low-gain parameter and iteratively solves the parameterized ADP Bellman equation. We present both state feedback and output feedback algorithms. The closed-loop stability and the convergence of the algorithm to the nominal solution of the parameterized ARE are shown. The simulation results validate the effectiveness of the proposed scheme.
引用
收藏
页码:1048 / 1060
页数:13
相关论文
共 31 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]  
Bertsekas Dimitri P., 2011, Dynamic Programming and Optimal Control, VII
[3]   Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design [J].
Bian, Tao ;
Jiang, Zhong-Ping .
AUTOMATICA, 2016, 71 :348-360
[4]   Event-Triggered Adaptive Dynamic Programming for Continuous-Time Systems With Control Constraints [J].
Dong, Lu ;
Zhong, Xiangnan ;
Sun, Changyin ;
He, Haibo .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (08) :1941-1952
[5]   IN-LARGE STABILITY OF RELAY AND SATURATING CONTROL SYSTEMS WITH LINEAR CONTROLLERS [J].
FULLER, AT .
INTERNATIONAL JOURNAL OF CONTROL, 1969, 10 (04) :457-&
[6]   Data-Driven Adaptive Optimal Control of Connected Vehicles [J].
Gao, Weinan ;
Jiang, Zhong-Ping ;
Ozbay, Kaan .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (05) :1122-1133
[7]   Adaptive Dynamic Programming and Adaptive Optimal Output Regulation of Linear Systems [J].
Gao, Weinan ;
Jiang, Zhong-Ping .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2016, 61 (12) :4164-4169
[8]   Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics [J].
Jiang, Yu ;
Jiang, Zhong-Ping .
AUTOMATICA, 2012, 48 (10) :2699-2704
[9]   Reinforcement Learning and Feedback Control USING NATURAL DECISION METHODS TO DESIGN OPTIMAL ADAPTIVE CONTROLLERS [J].
Lewis, Frank L. ;
Vrabie, Draguna ;
Vamvoudakis, Kyriakos G. .
IEEE CONTROL SYSTEMS MAGAZINE, 2012, 32 (06) :76-105
[10]  
Lin X., 2012, P IEEE UKACC INT C C, P58