An iterative Q-learning scheme for the global stabilization of discrete-time linear systems subject to actuator saturation

被引：16

作者：

Rizvi, Syed Ali Asad ^{[1
]}

Lin, Zongli ^{[1
]}

机构：

[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA

来源：

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL | 2019年 / 29卷 / 09期

关键词：

actuator saturation; constrained control; Q-learning; reinforcement learning; Riccati equation; SEMIGLOBAL EXPONENTIAL STABILIZATION; INPUT SATURATION; OPTIMAL TRACKING;

D O I：

10.1002/rnc.4514

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose a model-free algorithm for global stabilization of linear systems subject to actuator saturation. The idea of gain-scheduled low gain feedback is applied to develop control laws that avoid saturation and achieve global stabilization. To design these control laws, we employ the framework of parameterized algebraic Riccati equations (AREs). Reinforcement learning techniques are developed to find the solution of the parameterized ARE without requiring any knowledge of the system dynamics. In particular, we present an iterative Q-learning scheme that searches for a low gain parameter and iteratively solves the parameterized ARE using the Bellman equation. Both state feedback and output feedback algorithms are developed. It is shown that the proposed scheme achieves model-free global stabilization under bounded controls and convergence to the optimal solution of the ARE is achieved. Simulation results are presented that confirm the effectiveness of the proposed method.

引用

页码：2660 / 2672

页数：13

共 50 条

[1] Finite gain stabilization of discrete-time linear systems subject to actuator saturation
Bao, XY
Lin, ZL
Sontag, ED
AUTOMATICA, 2000, 36 (02) : 269 - 277
[2] Improved Q-Learning Method for Linear Discrete-Time Systems
Chen, Jian
Wang, Jinhua
Huang, Jie
PROCESSES, 2020, 8 (03)
[3] Model-Free Global Stabilization of Discrete-time Linear Systems with Saturating Actuators Using Reinforcement Learning
Rizvi, Syed Ali Asad
Lin, Zongli
2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 5276 - 5281
[4] Stabilization with decay rate analysis for discrete-time linear systems subject to actuator saturation
Ma, Yong-Mei
Yang, Guang-Hong
2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 1887 - 1892
[5] Simultaneous global external and internal stabilization of linear time-invariant discrete-time systems subject to actuator saturation
Wang, Xu
Saberi, Ali
Stoorvogel, Anton A.
Sannuti, Peddapullaiah
AUTOMATICA, 2012, 48 (05) : 699 - 711
[6] Simultaneous global external and internal stabilization of linear time-invariant discrete-time systems subject to actuator saturation"
Wang, Xu
Saberi, Ali
Stoorvogel, Anton A.
Sannuti, Peddapullaiah
2011 AMERICAN CONTROL CONFERENCE, 2011, : 3808 - 3812
[7] Analysis and design for discrete-time linear systems subject to actuator saturation
Hu, TS
Lin, ZL
Chen, BM
SYSTEMS & CONTROL LETTERS, 2002, 45 (02) : 97 - 112
[8] Performance analysis for linear discrete-time systems subject to actuator saturation
Ma, Yong-Mei
Yang, Guang-Hong
2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 3608 - 3613
[9] Stability analysis for linear discrete-time systems subject to actuator saturation
Yongmei MA 1
2.College of Information Science and Engineering
3.Key Laboratory of Integrated Automation of Process Industry (Ministry of Education)
Control Theory and Technology, 2010, 8 (02) : 245 - 248
[10] Analysis and design for discrete-time linear systems subject to actuator saturation
Hu, TS
Lin, ZL
Chen, BM
PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 4675 - 4680

← 1 2 3 4 5 →