Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games

被引:19
|
作者
Zhao, Jun [1 ]
Lv, Yongfeng [2 ]
Zhao, Ziliang [3 ]
机构
[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China
[2] Taiyuan Univ Technol, Coll Elect & Power Engn, Taiyuan 030024, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Transportat, Qingdao 266590, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Optimal control; Adaptive learning; Game theory; Cost function; Observers; Estimation error; Output-feedback optimal control; adaptive learning; zero-sum games; SYSTEMS;
D O I
10.1109/TCSII.2021.3112050
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although optimal control with full state-feedback has been well studied, online solving output-feedback optimal control problem is difficult, in particular for learning online Nash equilibrium solution of the continuous-time (CT) two-player zero-sum differential games. For this purpose, we propose an adaptive learning algorithm to address this trick problem. A modified game algebraic Riccati equation (MGARE) is derived by tailoring its state-feedback control counterpart. An adaptive online learning method is proposed to approximate the solution to the MGARE through online data, where two operations (i.e., vectorization and Kronecker's product) can be adopted to reconstruct the MGARE. Only system output information is needed to implement developed learning algorithm. Simulation results are carried out to exemplify the proposed control and learning method.
引用
收藏
页码:1437 / 1441
页数:5
相关论文
共 50 条
  • [31] Neural Network Adaptive Output-Feedback Optimal Control for Active Suspension Systems
    Li, Yongming
    Wang, Tiechao
    Liu, Wei
    Tong, Shaocheng
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (06): : 4021 - 4032
  • [32] Adaptive critic design for nonlinear multi-player zero-sum games with unknown dynamics and control constraints
    Huo, Yu
    Wang, Ding
    Qiao, Junfei
    Li, Menghua
    NONLINEAR DYNAMICS, 2023, 111 (12) : 11671 - 11683
  • [33] Finite-Horizon Near Optimal Design of Nonlinear Two-Player Zero-Sum Game in Presence of Completely Unknown Dynamics
    Xu, Hao
    JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2015, 26 (04) : 361 - 370
  • [34] Output-feedback Robust Tracking Control of Uncertain Systems via Adaptive Learning
    Jun Zhao
    Yongfeng Lv
    International Journal of Control, Automation and Systems, 2023, 21 : 1108 - 1118
  • [35] Output-feedback Robust Tracking Control of Uncertain Systems via Adaptive Learning
    Zhao, Jun
    Lv, Yongfeng
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (04) : 1108 - 1118
  • [36] OPTIMAL CONTROL AND ZERO-SUM GAMES FOR MARKOV CHAINS OF MEAN-FIELD TYPE
    Choutri, Salah Eddine
    Djehiche, Boualem
    Tembine, Hamidou
    MATHEMATICAL CONTROL AND RELATED FIELDS, 2019, 9 (03) : 571 - 605
  • [37] Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control
    Rizvi, Syed Ali Asad
    Lin, Zongli
    AUTOMATICA, 2018, 95 : 213 - 221
  • [38] Nearly Optimal Control for Mixed Zero-Sum Game Based on Off-Policy Integral Reinforcement Learning
    Song, Ruizhuo
    Yang, Gaofu
    Lewis, Frank L.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2793 - 2804
  • [39] The Design of ϵ-Optimal Strategy for Two-Person Zero-Sum Markov Games
    Xie, Kaiyun
    Xiong, Junlin
    IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 2349 - 2354
  • [40] Data-based discrete-time two-player zero-sum delayed game via policy iteration Q-learning Method
    Jiang, Zongyang
    Zhang, Haiying
    Xiao, Yu
    NEUROCOMPUTING, 2025, 631