An iterative Q-learning scheme for the global stabilization of discrete-time linear systems subject to actuator saturation

被引:16
作者
Rizvi, Syed Ali Asad [1 ]
Lin, Zongli [1 ]
机构
[1] Univ Virginia, Charles L Brown Dept Elect & Comp Engn, Charlottesville, VA 22904 USA
关键词
actuator saturation; constrained control; Q-learning; reinforcement learning; Riccati equation; SEMIGLOBAL EXPONENTIAL STABILIZATION; INPUT SATURATION; OPTIMAL TRACKING;
D O I
10.1002/rnc.4514
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a model-free algorithm for global stabilization of linear systems subject to actuator saturation. The idea of gain-scheduled low gain feedback is applied to develop control laws that avoid saturation and achieve global stabilization. To design these control laws, we employ the framework of parameterized algebraic Riccati equations (AREs). Reinforcement learning techniques are developed to find the solution of the parameterized ARE without requiring any knowledge of the system dynamics. In particular, we present an iterative Q-learning scheme that searches for a low gain parameter and iteratively solves the parameterized ARE using the Bellman equation. Both state feedback and output feedback algorithms are developed. It is shown that the proposed scheme achieves model-free global stabilization under bounded controls and convergence to the optimal solution of the ARE is achieved. Simulation results are presented that confirm the effectiveness of the proposed method.
引用
收藏
页码:2660 / 2672
页数:13
相关论文
共 50 条
  • [1] Finite gain stabilization of discrete-time linear systems subject to actuator saturation
    Bao, XY
    Lin, ZL
    Sontag, ED
    AUTOMATICA, 2000, 36 (02) : 269 - 277
  • [2] Improved Q-Learning Method for Linear Discrete-Time Systems
    Chen, Jian
    Wang, Jinhua
    Huang, Jie
    PROCESSES, 2020, 8 (03)
  • [3] Model-Free Global Stabilization of Discrete-time Linear Systems with Saturating Actuators Using Reinforcement Learning
    Rizvi, Syed Ali Asad
    Lin, Zongli
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 5276 - 5281
  • [4] Stabilization with decay rate analysis for discrete-time linear systems subject to actuator saturation
    Ma, Yong-Mei
    Yang, Guang-Hong
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 1887 - 1892
  • [5] Simultaneous global external and internal stabilization of linear time-invariant discrete-time systems subject to actuator saturation
    Wang, Xu
    Saberi, Ali
    Stoorvogel, Anton A.
    Sannuti, Peddapullaiah
    AUTOMATICA, 2012, 48 (05) : 699 - 711
  • [6] Simultaneous global external and internal stabilization of linear time-invariant discrete-time systems subject to actuator saturation"
    Wang, Xu
    Saberi, Ali
    Stoorvogel, Anton A.
    Sannuti, Peddapullaiah
    2011 AMERICAN CONTROL CONFERENCE, 2011, : 3808 - 3812
  • [7] Analysis and design for discrete-time linear systems subject to actuator saturation
    Hu, TS
    Lin, ZL
    Chen, BM
    SYSTEMS & CONTROL LETTERS, 2002, 45 (02) : 97 - 112
  • [8] Performance analysis for linear discrete-time systems subject to actuator saturation
    Ma, Yong-Mei
    Yang, Guang-Hong
    2008 AMERICAN CONTROL CONFERENCE, VOLS 1-12, 2008, : 3608 - 3613
  • [9] Stability analysis for linear discrete-time systems subject to actuator saturation
    Yongmei MA 1
    2.College of Information Science and Engineering
    3.Key Laboratory of Integrated Automation of Process Industry (Ministry of Education)
    Control Theory and Technology, 2010, 8 (02) : 245 - 248
  • [10] Analysis and design for discrete-time linear systems subject to actuator saturation
    Hu, TS
    Lin, ZL
    Chen, BM
    PROCEEDINGS OF THE 40TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2001, : 4675 - 4680