Model-free finite-horizon optimal control of discrete-time two-player zero-sum games

被引:3
|
作者
Wang, Wei [1 ,2 ,3 ]
Chen, Xin [2 ,3 ,4 ]
Du, Jianhua [1 ]
机构
[1] Zhongnan Univ Econ & Law, Sch Informat & Safety Engn, Wuhan, Peoples R China
[2] Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan, Peoples R China
[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan, Peoples R China
[4] China Univ Geosci, Sch Automat, Wuhan, Peoples R China
关键词
Q-function; finite horizon; optimal control; zero-sum game; H-INFINITY-CONTROL; DIFFERENTIAL-GAMES; SYSTEMS;
D O I
10.1080/00207721.2022.2111236
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventionally, as the system dynamics is known, the finite-horizon optimal control of zero-sum games relies on solving the time-varying Riccati equations. In this paper, with unknown system dynamics being considered, a Q-function-based finite-horizon control method is introduced to approximate the solutions of the time-varying Riccati equations. First, a time-varying Q-function explicitly dependent on the time-varying control and disturbance is defined. Then the defined time-varying Q-function is utilised to represent the time-varying control and disturbance which are equivalent to the solutions of the time-varying Riccati equations by relaxing the system dynamics. Finally, a model-free method is introduced to approximate the defined time-varying Q-function. Simulation studies are conducted to demonstrate the validity of the developed method.
引用
收藏
页码:167 / 179
页数:13
相关论文
共 50 条
  • [21] Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games
    Chen, Zixiang
    Zhou, Dongruo
    Gu, Quanquan
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [22] Pure strategy equilibria in symmetric two-player zero-sum games
    Peter Duersch
    Jörg Oechssler
    Burkhard C. Schipper
    International Journal of Game Theory, 2012, 41 : 553 - 564
  • [23] Optimality and Asymptotic Stability in Two-Player Zero-Sum Hybrid Games
    Leudo, Santiago J.
    Sanfelice, Ricardo G.
    HSCC 2022: PROCEEDINGS OF THE 25TH ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (PART OF CPS-IOT WEEK 2022), 2022,
  • [24] Generating Dominant Strategies for Continuous Two-Player Zero-Sum Games
    Vazquez-Chanlatte, Marcell J.
    Ghosh, Shromona
    Raman, Vasumathi
    Sangiovanni-Vincentelli, Alberto
    Seshia, Sanjit A.
    IFAC PAPERSONLINE, 2018, 51 (16): : 7 - 12
  • [25] Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games
    Perolat, Julien
    Scherrer, Bruno
    Piot, Bilal
    Pietquin, Olivier
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1321 - 1329
  • [26] When are Offline Two-Player Zero-Sum Markov Games Solvable?
    Cui, Qiwen
    Du, Simon S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [27] Two-player zero-sum stochastic differential games with regime switching
    Lv, Siyu
    AUTOMATICA, 2020, 114
  • [28] Pure strategy equilibria in symmetric two-player zero-sum games
    Duersch, Peter
    Oechssler, Joerg
    Schipper, Burkhard C.
    INTERNATIONAL JOURNAL OF GAME THEORY, 2012, 41 (03) : 553 - 564
  • [29] Policy Gradient Algorithm in Two-Player Zero-Sum Markov Games
    Li Y.
    Zhou J.
    Feng Y.
    Feng Y.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (01): : 81 - 91
  • [30] Algorithms for uniform optimal strategies in two-player zero-sum stochastic games with perfect information
    Avrachenkov, Konstantin
    Cottatellucci, Laura
    Maggi, Lorenzo
    OPERATIONS RESEARCH LETTERS, 2012, 40 (01) : 56 - 60