Safety-Critical Optimal Control of Discrete-Time Non-Linear Systems via Policy Iteration-Based Q-Learning

被引：0

作者：

Long, Lijun ^{[1
,2
]}

Liu, Xiaomei ^{[1
,2
]}

Huang, Xiaomin ^{[1
,2
]}

机构：

[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China

[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL | 2025年

关键词：

control barrier functions; discrete-time systems; neural networks; Q-learning; safety-critical control;

D O I：

10.1002/rnc.7809

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates the problem of safety-critical optimal control for discrete-time non-linear systems. A safety-critical control algorithm is developed based on Q-learning and an iterative adaptive dynamic programming, that is, policy iteration. Discrete-time control barrier functions (CBFs) are introduced into the utility function for guaranteeing safety, in which a novel definition of the safe set and its boundary with multiple discrete-time CBFs are given. Also, for discrete-time systems, by using multiple discrete-time CBFs, the safety-critical optimal control problem of multiple safety objectives is addressed. Meanwhile, safety, convergence, and stability of the developed algorithm are rigorously demonstrated. An effective method to obtain an initial safety-admissible control law is established. Also, the developed algorithm is implemented by building an actor-critic structure with neural networks. Finally, the effectiveness of the proposed algorithm is illustrated by three simulation examples.

引用

页数：19

共 50 条

[41] Adaptive optimal output feedback tracking control for unknown discrete-time linear systems using a combined reinforcement Q-learning and internal model method
Sun, Weijie
Zhao, Guangyue
Peng, Yunjian
IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (18) : 3075 - 3086
[42] An iterative Q-learning scheme for the global stabilization of discrete-time linear systems subject to actuator saturation
Rizvi, Syed Ali Asad
Lin, Zongli
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2019, 29 (09) : 2660 - 2672
[43] Model-Free Optimal Tracking Control of Nonlinear Input-Affine Discrete-Time Systems via an Iterative Deterministic Q-Learning Algorithm
Song, Shijie
Zhu, Minglei
Dai, Xiaolin
Gong, Dawei
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 999 - 1012
[44] Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems
Lopez, Victor G.
Alsalti, Mohammad
Mueller, Matthias A.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (05) : 2922 - 2933
[45] Event-based model predictive control of discrete-time non-linear systems with external disturbances
Wang, Mengzhi
Sun, Jian
Chen, Jie
IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (01) : 27 - 35
[46] An indirect adaptive control for fully feedback linearizable discrete-time non-linear systems
Rokui, MR
Khorasani, K
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 1997, 11 (08) : 665 - 680
[47] Modified general policy iteration based adaptive dynamic programming for unknown discrete-time linear systems
Jiang, Huaiyuan
Zhou, Bin
Duan, Guang-Ren
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (12) : 7149 - 7173
[48] Optimal event-triggered sliding mode control for discrete-time non-linear systems against actuator saturation
Xiong, Yongyang
Yang, Liu
Wu, Chengwei
Wu, Ligang
IET CONTROL THEORY AND APPLICATIONS, 2019, 13 (16) : 2638 - 2647
[49] Learning-Based Adaptive Optimal Output Regulation of Discrete-Time Linear Systems
Chakraborty, Sayan
Gao, Weinan
Cui, Leilei
Lewis, Frank L.
Jiang, Zhong-Ping
IFAC PAPERSONLINE, 2023, 56 (02): : 10283 - 10288
[50] An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems
Alsalti, Mohammad
Lopez, Victor G.
Mueller, Matthias A.
6TH ANNUAL LEARNING FOR DYNAMICS & CONTROL CONFERENCE, 2024, 242 : 312 - 323

← 1 2 3 4 5 →