Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games

被引:19
|
作者
Zhao, Jun [1 ]
Lv, Yongfeng [2 ]
Zhao, Ziliang [3 ]
机构
[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China
[2] Taiyuan Univ Technol, Coll Elect & Power Engn, Taiyuan 030024, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Transportat, Qingdao 266590, Peoples R China
基金
中国国家自然科学基金;
关键词
Games; Optimal control; Adaptive learning; Game theory; Cost function; Observers; Estimation error; Output-feedback optimal control; adaptive learning; zero-sum games; SYSTEMS;
D O I
10.1109/TCSII.2021.3112050
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although optimal control with full state-feedback has been well studied, online solving output-feedback optimal control problem is difficult, in particular for learning online Nash equilibrium solution of the continuous-time (CT) two-player zero-sum differential games. For this purpose, we propose an adaptive learning algorithm to address this trick problem. A modified game algebraic Riccati equation (MGARE) is derived by tailoring its state-feedback control counterpart. An adaptive online learning method is proposed to approximate the solution to the MGARE through online data, where two operations (i.e., vectorization and Kronecker's product) can be adopted to reconstruct the MGARE. Only system output information is needed to implement developed learning algorithm. Simulation results are carried out to exemplify the proposed control and learning method.
引用
收藏
页码:1437 / 1441
页数:5
相关论文
共 50 条
  • [21] Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
    Vamvoudakis, Kyriakos G.
    Lewis, F. L.
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2012, 22 (13) : 1460 - 1483
  • [22] Primal-Dual Reinforcement Learning for Zero-Sum Games in the Optimal Tracking Control
    Que, Xuejie
    Wang, Zhenlei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (06) : 3146 - 3150
  • [23] Model-free policy iteration optimal control of fuzzy systems via a two-player zero-sum game
    Deng, Yifan
    Wu, Wei
    Tong, Shaocheng
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025,
  • [24] Learning nonlinear robust control as a data-driven zero-sum two-player game for an active suspension system
    Radac, Mircea-Bogdan
    Lala, Timotei
    IFAC PAPERSONLINE, 2020, 53 (02): : 8057 - 8062
  • [25] Robust Optimal Control for Disturbed Nonlinear Zero-Sum Differential Games Based on Single NN and Least Squares
    Song, Ruizhuo
    Li, Junsong
    Lewis, Frank L.
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4009 - 4019
  • [26] Adaptive Dynamic Programming Algorithm for Finding Online the Equilibrium Solution of the Two-Player Zero-Sum Differential Game
    Vrabie, Draguna
    Lewis, Frank
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [27] A PROBABILISTIC VERIFICATION THEOREM FOR THE FINITE HORIZON TWO-PLAYER ZERO-SUM OPTIMAL SWITCHING GAME IN CONTINUOUS TIME
    Hamadene, S.
    Martyr, R.
    Moriarty, J.
    ADVANCES IN APPLIED PROBABILITY, 2019, 51 (02) : 425 - 442
  • [28] Output feedback Q-learning for discrete-time finite-horizon zero-sum games with application to the H? control
    Liu, Mingxiang
    Cai, Qianqian
    Li, Dandan
    Meng, Wei
    Fu, Minyue
    NEUROCOMPUTING, 2023, 529 : 48 - 55
  • [29] Adaptive critic control with multi-step policy evaluation for nonlinear zero-sum games
    Li, Xin
    Wang, Ding
    Wang, Jiangyu
    Qiao, Junfei
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (01) : 551 - 566
  • [30] Nonlinear Two-Player Zero-Sum Game Approximate Solution Using a Policy Iteration Algorithm
    Johnson, M.
    Bhasin, S.
    Dixon, W. E.
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 142 - 147