Discounted Stable Adaptive Critic Design for Zero-Sum Games With Application Verifications

被引:0
作者
Ren, Jin [1 ,2 ]
Wang, Ding [1 ,2 ]
Li, Menghua [1 ,2 ]
Qiao, Junfei [1 ,2 ]
机构
[1] Beijing Univ Technol, Sch Informat Sci & Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Games; Game theory; Stability criteria; Asymptotic stability; Optimal control; Power system stability; Dynamic programming; Heuristic algorithms; Cost function; Nonlinear systems; Adaptive critic designs; adaptive dynamic programming; discounted value iteration; stability; zero-sum games; STABILITY ANALYSIS; ITERATION; SYSTEMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an adaptive critic design with performance guarantee is established based on the discounted value iteration algorithm to settle with the optimal regulation problem for discrete-time zero-sum games. Value iteration is implemented to obtain the approximate optimal solutions to the Hamilton-Jacobi-Isaacs equation for nonlinear systems and the game algebraic Riccati equation for linear systems. Then, we focus on system stability affected by the introduction of the discount factor and the admissibility of the policy pairs in the value iteration process. The appropriate selection range of the discount factor and the criteria for ensuring system stability are established to assist in obtaining the stabilized optimal policy pair, which not only makes the cost function converge to the optimal value, but also guarantees the asymptotic stability of the closed-loop system. Finally, practical examples for the power system and the ball-beam system are conducted to demonstrate the effectiveness of the presented method.
引用
收藏
页码:11706 / 11716
页数:11
相关论文
共 43 条
  • [1] Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
    Al-Tamimi, Asma
    Lewis, Frank L.
    Abu-Khalaf, Murad
    [J]. AUTOMATICA, 2007, 43 (03) : 473 - 481
  • [2] Value and Policy Iterations in Optimal Control and Adaptive Dynamic Programming
    Bertsekas, Dimitri P.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 500 - 509
  • [3] Stability and almost disturbance decoupling analysis of nonlinear system subject to feedback linearization and feedforward neural network controller
    Chien, Ting-Li
    Chen, Chung-Cheng
    Huang, Yi-Chieh
    Lin, Wen-Jiun
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (07): : 1220 - 1230
  • [4] An Optimal Synchronization Control Method of PLL Utilizing Adaptive Dynamic Programming to Synchronize Inverter-Based Resources With Unbalanced, Low-Inertia, and Very Weak Grids
    Davari, Masoud
    Gao, Weinan
    Aghazadeh, Amir
    Blaabjerg, Frede
    Lewis, Frank L.
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 24 - 42
  • [5] Novel Discounted Adaptive Critic Control Designs With Accelerated Learning Formulation
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (05) : 3003 - 3016
  • [6] Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (07) : 1262 - 1272
  • [7] A Novel Value Iteration Scheme With Adjustable Convergence Rate
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 7430 - 7442
  • [8] Offline and Online Adaptive Critic Control Designs With Stability Guarantee Through Value Iteration
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (12) : 13262 - 13274
  • [9] Generalized value iteration for discounted optimal control with stability analysis
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. SYSTEMS & CONTROL LETTERS, 2021, 147 (147)
  • [10] Stability Analysis of Optimal Adaptive Control Using Value Iteration With Approximation Errors
    Heydari, Ali
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (09) : 3119 - 3126