Model-free finite-horizon optimal control of discrete-time two-player zero-sum games

被引:3
|
作者
Wang, Wei [1 ,2 ,3 ]
Chen, Xin [2 ,3 ,4 ]
Du, Jianhua [1 ]
机构
[1] Zhongnan Univ Econ & Law, Sch Informat & Safety Engn, Wuhan, Peoples R China
[2] Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan, Peoples R China
[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan, Peoples R China
[4] China Univ Geosci, Sch Automat, Wuhan, Peoples R China
关键词
Q-function; finite horizon; optimal control; zero-sum game; H-INFINITY-CONTROL; DIFFERENTIAL-GAMES; SYSTEMS;
D O I
10.1080/00207721.2022.2111236
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventionally, as the system dynamics is known, the finite-horizon optimal control of zero-sum games relies on solving the time-varying Riccati equations. In this paper, with unknown system dynamics being considered, a Q-function-based finite-horizon control method is introduced to approximate the solutions of the time-varying Riccati equations. First, a time-varying Q-function explicitly dependent on the time-varying control and disturbance is defined. Then the defined time-varying Q-function is utilised to represent the time-varying control and disturbance which are equivalent to the solutions of the time-varying Riccati equations by relaxing the system dynamics. Finally, a model-free method is introduced to approximate the defined time-varying Q-function. Simulation studies are conducted to demonstrate the validity of the developed method.
引用
收藏
页码:167 / 179
页数:13
相关论文
共 50 条
  • [41] Structure in the Value Function of Two-Player Zero-Sum Games of Incomplete Information
    Wiggers, Auke J.
    Oliehoek, Frans A.
    Roijers, Diederik M.
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1628 - 1629
  • [42] Stability analysis of discrete-time finite-horizon discounted optimal control
    Granzotto, Mathieu
    Postoyan, Romain
    Busoniu, Lucian
    Nesic, Dragan
    Daafouz, Jamal
    2018 IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2018, : 2322 - 2327
  • [43] Finite-Horizon Optimal Control of Discrete-Time Switched Linear Systems
    Zhu, Qixin
    Xie, Guangming
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2012, 2012
  • [44] Zero-sum Two-player Game Theoretic Formulation of Affine Nonlinear Discrete-time Systems Using Neural Networks
    Mehraeen, S.
    Dierks, T.
    Jagannathan, S.
    Crow, M. L.
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [45] Zero-sum two-player game theoretic formulation of affine nonlinear discrete-time systems using neural networks
    Department of Electrical and Computer Engineering, Missouri University of Science and Technology, 1870 Miner Circle, Rolla, MO 65409, United States
    Proc Int Jt Conf Neural Networks, 2010,
  • [46] Zero-Sum Two-Player Game Theoretic Formulation of Affine Nonlinear Discrete-Time Systems Using Neural Networks
    Mehraeen, Shahab
    Dierks, Travis
    Jagannathan, S.
    Crow, Mariesa L.
    IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (06) : 1641 - 1655
  • [47] Finite-horizon inverse optimal control for discrete-time nonlinear systems
    Molloy, Timothy L.
    Ford, Jason J.
    Perez, Tristan
    AUTOMATICA, 2018, 87 : 442 - 446
  • [48] Two-player nonzero-sum and zero-sum games subject to stochastic noncausal systems
    Chen, Xin
    Zhang, Zeyu
    Zhang, Yijia
    Yuan, Dongmei
    INTERNATIONAL JOURNAL OF CONTROL, 2025,
  • [49] Event-Triggered Adaptive Control for Discrete-Time Zero-Sum Games
    Wang, Ziyang
    Wei, Qinglai
    Liu, Derong
    Luo, Yanhong
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [50] Differential dynamic programming for finite-horizon zero-sum differential games of nonlinear systems
    Zhang, Bin
    Jia, Yingmin
    Zhang, Yuqi
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (18) : 11062 - 11084