Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games

被引:19
|
作者
Zhao, Jun [1 ]
Lv, Yongfeng [2 ]
Zhao, Ziliang [3 ]
机构
[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China
[2] Taiyuan Univ Technol, Coll Elect & Power Engn, Taiyuan 030024, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Transportat, Qingdao 266590, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Optimal control; Adaptive learning; Game theory; Cost function; Observers; Estimation error; Output-feedback optimal control; adaptive learning; zero-sum games; SYSTEMS;
D O I
10.1109/TCSII.2021.3112050
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although optimal control with full state-feedback has been well studied, online solving output-feedback optimal control problem is difficult, in particular for learning online Nash equilibrium solution of the continuous-time (CT) two-player zero-sum differential games. For this purpose, we propose an adaptive learning algorithm to address this trick problem. A modified game algebraic Riccati equation (MGARE) is derived by tailoring its state-feedback control counterpart. An adaptive online learning method is proposed to approximate the solution to the MGARE through online data, where two operations (i.e., vectorization and Kronecker's product) can be adopted to reconstruct the MGARE. Only system output information is needed to implement developed learning algorithm. Simulation results are carried out to exemplify the proposed control and learning method.
引用
收藏
页码:1437 / 1441
页数:5
相关论文
共 50 条
  • [41] Multiplayer zero-sum games optimal control for modular robot manipulators with interconnected dynamic couplings
    Zhu, Xinye
    An, Tianjiao
    Dong, Bo
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2022, 36 (12) : 3254 - 3270
  • [42] Dual Event-Triggered Constrained Control Through Adaptive Critic for Discrete-Time Zero-Sum Games
    Wang, Ding
    Hu, Lingzhi
    Zhao, Mingming
    Qiao, Junfei
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (03): : 1584 - 1595
  • [43] An Approximate Control Algorithm for Zero-sum Differential Games Using Adaptive Critic Technique
    Wang, Ke
    Mu, Chaoxu
    Zhang, Yong
    Liu, Weiqiang
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2812 - 2817
  • [44] Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games
    Zhang X.
    Bo Y.-C.
    Cui L.-L.
    Zhang, Xin (zhangxin@upc.edu.cn), 2018, South China University of Technology (35): : 619 - 626
  • [45] Event-Triggered Adaptive Control for Discrete-Time Zero-Sum Games
    Wang, Ziyang
    Wei, Qinglai
    Liu, Derong
    Luo, Yanhong
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [46] Safe adaptive output-feedback optimal control of a class of linear systems
    Mahmud, S. M. Nahid
    Abudia, Moad
    Nivison, Scott A.
    Bell, Zachary I.
    Kamalapurkar, Rushikesh
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (11) : 7082 - 7095
  • [47] Iterative adaptive dynamic programming methods with neural network implementation for multi-player zero-sum games
    Jiang, He
    Zhang, Huaguang
    Han, Ji
    Zhang, Kun
    NEUROCOMPUTING, 2018, 307 : 54 - 60
  • [48] Solving two-player zero sum games with fuzzy payoffs when players have different risk attitudes
    Koca, Yesim
    Testik, Ozlem Muge
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2018, 34 (07) : 1461 - 1474
  • [49] Event-triggered adaptive dynamic programming for multi-player zero-sum games with unknown dynamics
    Zhang, Yongwei
    Zhao, Bo
    Liu, Derong
    SOFT COMPUTING, 2021, 25 (03) : 2237 - 2251
  • [50] An Improved Trajectory Tracking Mechanism With Adaptive Critic for Event-Based Multiplayer Zero-Sum Games
    Li, Menghua
    Wang, Ding
    Qiao, Junfei
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 7193 - 7204