Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games

被引：19

作者：

Zhao, Jun ^{[1
]}

Lv, Yongfeng ^{[2
]}

Zhao, Ziliang ^{[3
]}

机构：

[1] Shandong Univ Sci & Technol, Coll Mech & Elect Engn, Qingdao 266590, Peoples R China

[2] Taiyuan Univ Technol, Coll Elect & Power Engn, Taiyuan 030024, Peoples R China

[3] Shandong Univ Sci & Technol, Coll Transportat, Qingdao 266590, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS | 2022年 / 69卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Games; Optimal control; Adaptive learning; Game theory; Cost function; Observers; Estimation error; Output-feedback optimal control; adaptive learning; zero-sum games; SYSTEMS;

D O I：

10.1109/TCSII.2021.3112050

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Although optimal control with full state-feedback has been well studied, online solving output-feedback optimal control problem is difficult, in particular for learning online Nash equilibrium solution of the continuous-time (CT) two-player zero-sum differential games. For this purpose, we propose an adaptive learning algorithm to address this trick problem. A modified game algebraic Riccati equation (MGARE) is derived by tailoring its state-feedback control counterpart. An adaptive online learning method is proposed to approximate the solution to the MGARE through online data, where two operations (i.e., vectorization and Kronecker's product) can be adopted to reconstruct the MGARE. Only system output information is needed to implement developed learning algorithm. Simulation results are carried out to exemplify the proposed control and learning method.

引用

页码：1437 / 1441

页数：5

共 50 条

[31] Neural Network Adaptive Output-Feedback Optimal Control for Active Suspension Systems
Li, Yongming
Wang, Tiechao
Liu, Wei
Tong, Shaocheng
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (06): : 4021 - 4032
[32] Adaptive critic design for nonlinear multi-player zero-sum games with unknown dynamics and control constraints
Huo, Yu
Wang, Ding
Qiao, Junfei
Li, Menghua
NONLINEAR DYNAMICS, 2023, 111 (12) : 11671 - 11683
[33] Finite-Horizon Near Optimal Design of Nonlinear Two-Player Zero-Sum Game in Presence of Completely Unknown Dynamics
Xu, Hao
JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2015, 26 (04) : 361 - 370
[34] Output-feedback Robust Tracking Control of Uncertain Systems via Adaptive Learning
Jun Zhao
Yongfeng Lv
International Journal of Control, Automation and Systems, 2023, 21 : 1108 - 1118
[35] Output-feedback Robust Tracking Control of Uncertain Systems via Adaptive Learning
Zhao, Jun
Lv, Yongfeng
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (04) : 1108 - 1118
[36] OPTIMAL CONTROL AND ZERO-SUM GAMES FOR MARKOV CHAINS OF MEAN-FIELD TYPE
Choutri, Salah Eddine
Djehiche, Boualem
Tembine, Hamidou
MATHEMATICAL CONTROL AND RELATED FIELDS, 2019, 9 (03) : 571 - 605
[37] Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control
Rizvi, Syed Ali Asad
Lin, Zongli
AUTOMATICA, 2018, 95 : 213 - 221
[38] Nearly Optimal Control for Mixed Zero-Sum Game Based on Off-Policy Integral Reinforcement Learning
Song, Ruizhuo
Yang, Gaofu
Lewis, Frank L.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2793 - 2804
[39] The Design of ϵ-Optimal Strategy for Two-Person Zero-Sum Markov Games
Xie, Kaiyun
Xiong, Junlin
IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 2349 - 2354
[40] Data-based discrete-time two-player zero-sum delayed game via policy iteration Q-learning Method
Jiang, Zongyang
Zhang, Haiying
Xiao, Yu
NEUROCOMPUTING, 2025, 631

← 1 2 3 4 5 →