Safety-Critical Optimal Control of Discrete-Time Non-Linear Systems via Policy Iteration-Based Q-Learning

被引:0
作者
Long, Lijun [1 ,2 ]
Liu, Xiaomei [1 ,2 ]
Huang, Xiaomin [1 ,2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang, Peoples R China
关键词
control barrier functions; discrete-time systems; neural networks; Q-learning; safety-critical control;
D O I
10.1002/rnc.7809
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the problem of safety-critical optimal control for discrete-time non-linear systems. A safety-critical control algorithm is developed based on Q-learning and an iterative adaptive dynamic programming, that is, policy iteration. Discrete-time control barrier functions (CBFs) are introduced into the utility function for guaranteeing safety, in which a novel definition of the safe set and its boundary with multiple discrete-time CBFs are given. Also, for discrete-time systems, by using multiple discrete-time CBFs, the safety-critical optimal control problem of multiple safety objectives is addressed. Meanwhile, safety, convergence, and stability of the developed algorithm are rigorously demonstrated. An effective method to obtain an initial safety-admissible control law is established. Also, the developed algorithm is implemented by building an actor-critic structure with neural networks. Finally, the effectiveness of the proposed algorithm is illustrated by three simulation examples.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Adaptive optimal output feedback tracking control for unknown discrete-time linear systems using a combined reinforcement Q-learning and internal model method
    Sun, Weijie
    Zhao, Guangyue
    Peng, Yunjian
    IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (18) : 3075 - 3086
  • [42] An iterative Q-learning scheme for the global stabilization of discrete-time linear systems subject to actuator saturation
    Rizvi, Syed Ali Asad
    Lin, Zongli
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2019, 29 (09) : 2660 - 2672
  • [43] Model-Free Optimal Tracking Control of Nonlinear Input-Affine Discrete-Time Systems via an Iterative Deterministic Q-Learning Algorithm
    Song, Shijie
    Zhu, Minglei
    Dai, Xiaolin
    Gong, Dawei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 999 - 1012
  • [44] Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems
    Lopez, Victor G.
    Alsalti, Mohammad
    Mueller, Matthias A.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (05) : 2922 - 2933
  • [45] Event-based model predictive control of discrete-time non-linear systems with external disturbances
    Wang, Mengzhi
    Sun, Jian
    Chen, Jie
    IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (01) : 27 - 35
  • [46] An indirect adaptive control for fully feedback linearizable discrete-time non-linear systems
    Rokui, MR
    Khorasani, K
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 1997, 11 (08) : 665 - 680
  • [47] Modified general policy iteration based adaptive dynamic programming for unknown discrete-time linear systems
    Jiang, Huaiyuan
    Zhou, Bin
    Duan, Guang-Ren
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (12) : 7149 - 7173
  • [48] Optimal event-triggered sliding mode control for discrete-time non-linear systems against actuator saturation
    Xiong, Yongyang
    Yang, Liu
    Wu, Chengwei
    Wu, Ligang
    IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (16) : 2638 - 2647
  • [49] Learning-Based Adaptive Optimal Output Regulation of Discrete-Time Linear Systems
    Chakraborty, Sayan
    Gao, Weinan
    Cui, Leilei
    Lewis, Frank L.
    Jiang, Zhong-Ping
    IFAC PAPERSONLINE, 2023, 56 (02): : 10283 - 10288
  • [50] An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems
    Alsalti, Mohammad
    Lopez, Victor G.
    Mueller, Matthias A.
    6TH ANNUAL LEARNING FOR DYNAMICS & CONTROL CONFERENCE, 2024, 242 : 312 - 323