Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games

被引：19

作者：

Zhao, Jun ^{[1
]}

Lv, Yongfeng ^{[2
]}

Zhao, Ziliang ^{[3
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China

[2] Taiyuan Univ Technol, Coll Elect & Power Engn, Taiyuan 030024, Peoples R China

[3] Shandong Univ Sci & Technol, Coll Transportat, Qingdao 266590, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS | 2022年 / 69卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Games; Optimal control; Adaptive learning; Game theory; Cost function; Observers; Estimation error; Output-feedback optimal control; adaptive learning; zero-sum games; SYSTEMS;

D O I：

10.1109/TCSII.2021.3112050

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Although optimal control with full state-feedback has been well studied, online solving output-feedback optimal control problem is difficult, in particular for learning online Nash equilibrium solution of the continuous-time (CT) two-player zero-sum differential games. For this purpose, we propose an adaptive learning algorithm to address this trick problem. A modified game algebraic Riccati equation (MGARE) is derived by tailoring its state-feedback control counterpart. An adaptive online learning method is proposed to approximate the solution to the MGARE through online data, where two operations (i.e., vectorization and Kronecker's product) can be adopted to reconstruct the MGARE. Only system output information is needed to implement developed learning algorithm. Simulation results are carried out to exemplify the proposed control and learning method.

引用

页码：1437 / 1441

页数：5

共 50 条

[41] Multiplayer zero-sum games optimal control for modular robot manipulators with interconnected dynamic couplings
Zhu, Xinye
An, Tianjiao
Dong, Bo
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2022, 36 (12) : 3254 - 3270
[42] Dual Event-Triggered Constrained Control Through Adaptive Critic for Discrete-Time Zero-Sum Games
Wang, Ding
Hu, Lingzhi
Zhao, Mingming
Qiao, Junfei
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (03): : 1584 - 1595
[43] An Approximate Control Algorithm for Zero-sum Differential Games Using Adaptive Critic Technique
Wang, Ke
Mu, Chaoxu
Zhang, Yong
Liu, Weiqiang
2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2812 - 2817
[44] Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games
Zhang X.
Bo Y.-C.
Cui L.-L.
Zhang, Xin (zhangxin@upc.edu.cn), 2018, South China University of Technology (35): : 619 - 626
[45] Event-Triggered Adaptive Control for Discrete-Time Zero-Sum Games
Wang, Ziyang
Wei, Qinglai
Liu, Derong
Luo, Yanhong
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[46] Safe adaptive output-feedback optimal control of a class of linear systems
Mahmud, S. M. Nahid
Abudia, Moad
Nivison, Scott A.
Bell, Zachary I.
Kamalapurkar, Rushikesh
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (11) : 7082 - 7095
[47] Iterative adaptive dynamic programming methods with neural network implementation for multi-player zero-sum games
Jiang, He
Zhang, Huaguang
Han, Ji
Zhang, Kun
NEUROCOMPUTING, 2018, 307 : 54 - 60
[48] Solving two-player zero sum games with fuzzy payoffs when players have different risk attitudes
Koca, Yesim
Testik, Ozlem Muge
QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2018, 34 (07) : 1461 - 1474
[49] Event-triggered adaptive dynamic programming for multi-player zero-sum games with unknown dynamics
Zhang, Yongwei
Zhao, Bo
Liu, Derong
SOFT COMPUTING, 2021, 25 (03) : 2237 - 2251
[50] An Improved Trajectory Tracking Mechanism With Adaptive Critic for Event-Based Multiplayer Zero-Sum Games
Li, Menghua
Wang, Ding
Qiao, Junfei
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 7193 - 7204

← 1 2 3 4 5 →