Discounted Stable Adaptive Critic Design for Zero-Sum Games With Application Verifications

被引：0

作者：

Ren, Jin ^{[1
,2
]}

Wang, Ding ^{[1
,2
]}

Li, Menghua ^{[1
,2
]}

Qiao, Junfei ^{[1
,2
]}

机构：

[1] Beijing Univ Technol, Sch Informat Sci & Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

[2] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2025年 / 22卷

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

Games; Game theory; Stability criteria; Asymptotic stability; Optimal control; Power system stability; Dynamic programming; Heuristic algorithms; Cost function; Nonlinear systems; Adaptive critic designs; adaptive dynamic programming; discounted value iteration; stability; zero-sum games; STABILITY ANALYSIS; ITERATION; SYSTEMS;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an adaptive critic design with performance guarantee is established based on the discounted value iteration algorithm to settle with the optimal regulation problem for discrete-time zero-sum games. Value iteration is implemented to obtain the approximate optimal solutions to the Hamilton-Jacobi-Isaacs equation for nonlinear systems and the game algebraic Riccati equation for linear systems. Then, we focus on system stability affected by the introduction of the discount factor and the admissibility of the policy pairs in the value iteration process. The appropriate selection range of the discount factor and the criteria for ensuring system stability are established to assist in obtaining the stabilized optimal policy pair, which not only makes the cost function converge to the optimal value, but also guarantees the asymptotic stability of the closed-loop system. Finally, practical examples for the power system and the ball-beam system are conducted to demonstrate the effectiveness of the presented method.

引用

页码：11706 / 11716

页数：11

共 43 条

[1] Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
Al-Tamimi, Asma
Lewis, Frank L.
Abu-Khalaf, Murad
[J]. AUTOMATICA, 2007, 43 (03) : 473 - 481
[2] Value and Policy Iterations in Optimal Control and Adaptive Dynamic Programming
Bertsekas, Dimitri P.
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 500 - 509
[3] Stability and almost disturbance decoupling analysis of nonlinear system subject to feedback linearization and feedforward neural network controller
Chien, Ting-Li
Chen, Chung-Cheng
Huang, Yi-Chieh
Lin, Wen-Jiun
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (07): : 1220 - 1230
[4] An Optimal Synchronization Control Method of PLL Utilizing Adaptive Dynamic Programming to Synchronize Inverter-Based Resources With Unbalanced, Low-Inertia, and Very Weak Grids
Davari, Masoud
Gao, Weinan
Aghazadeh, Amir
Blaabjerg, Frede
Lewis, Frank L.
[J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 24 - 42
[5] Novel Discounted Adaptive Critic Control Designs With Accelerated Learning Formulation
Ha, Mingming
Wang, Ding
Liu, Derong
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (05) : 3003 - 3016
[6] Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control
Ha, Mingming
Wang, Ding
Liu, Derong
[J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (07) : 1262 - 1272
[7] A Novel Value Iteration Scheme With Adjustable Convergence Rate
Ha, Mingming
Wang, Ding
Liu, Derong
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7430 - 7442
[8] Offline and Online Adaptive Critic Control Designs With Stability Guarantee Through Value Iteration
Ha, Mingming
Wang, Ding
Liu, Derong
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13262 - 13274
[9] Generalized value iteration for discounted optimal control with stability analysis
Ha, Mingming
Wang, Ding
Liu, Derong
[J]. SYSTEMS & CONTROL LETTERS, 2021, 147 (147)
[10] Stability Analysis of Optimal Adaptive Control Using Value Iteration With Approximation Errors
Heydari, Ali
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (09) : 3119 - 3126

← 1 2 3 4 5 →