Approximate Dynamic Programming for Two-player Zero-sum Game Related to H∞ Control of Unknown Nonlinear Continuous-time Systems

被引:34
|
作者
Yasini, Sholeh [1 ]
Bagher, Mohammad [1 ]
Sistani, Naghibi [1 ]
Karimpour, Ali [1 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Elect Engn, Mashhad 917551111, Iran
关键词
Approximate dynamic programming; concurrent learning; H-infinity control; neural networks; two-player zero-sum game; unknown dynamics; S FUZZY-SYSTEMS; DESIGN;
D O I
10.1007/s12555-014-0085-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a concurrent learning-based approximate dynamic programming (ADP) algorithm for solving the two-player zero-sum (ZS) game arising in H-infinity control of continuous-time (CT) systems with unknown nonlinear dynamics. First, the H-infinity control is formulated as a ZS game and then, an online algorithm is developed that learns the solution to the Hamilton-Jacobi-Isaacs (HJI) equation without using any knowledge on the system dynamics. This is achieved by using a neural network (NN) identifier to approximate the uncertain system dynamics. The algorithm is implemented on actor-critic-disturbance NN structure along with the NN identifier to approximate the optimal value function and the corresponding Nash solution of the game. All NNs are tuned at the same time. By using the idea of concurrent learning the need to check for the persistency of excitation condition is relaxed to simplified condition. The stability of the overall system is guaranteed and the convergence to the Nash solution of the game is shown. Simulation results show the effectiveness of the algorithm.
引用
收藏
页码:99 / 109
页数:11
相关论文
共 50 条
  • [1] Approximate dynamic programming for two-player zero-sum game related to H∞ control of unknown nonlinear continuous-time systems
    Sholeh Yasini
    Mohammad Bagher Naghibi Sistani
    Ali Karimpour
    International Journal of Control, Automation and Systems, 2015, 13 : 99 - 109
  • [2] Event-Triggered Adaptive Dynamic Programming for Continuous-Time Nonlinear Two-Player Zero-Sum Game
    Xue, Shan
    Luo, Biao
    Liu, Derong
    Li, Yueheng
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT VII, 2018, 11307 : 15 - 25
  • [3] Robust Adaptive Dynamic Programming of Two-Player Zero-Sum Games for Continuous-Time Linear Systems
    Fu, Yue
    Fu, Jun
    Chai, Tianyou
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (12) : 3314 - 3319
  • [4] Online Solution of Two-Player Zero-Sum Games for Continuous-Time Nonlinear Systems With Completely Unknown Dynamics
    Fu, Yue
    Chai, Tianyou
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (12) : 2577 - 2587
  • [5] Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games
    Perolat, Julien
    Scherrer, Bruno
    Piot, Bilal
    Pietquin, Olivier
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1321 - 1329
  • [6] Event-Triggered Adaptive Dynamic Programming for Zero-Sum Game of Partially Unknown Continuous-Time Nonlinear Systems
    Xue, Shan
    Luo, Biao
    Liu, Derong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (09): : 3189 - 3199
  • [7] Online concurrent reinforcement learning algorithm to solve two-player zero-sum games for partially unknown nonlinear continuous-time systems
    Yasini, Sholeh
    Karimpour, Ali
    Sistani, Mohammad-Bagher Naghibi
    Modares, Hamidreza
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2015, 29 (04) : 473 - 493
  • [8] Stable value iteration for two-player zero-sum game of discrete-time nonlinear systems based on adaptive dynamic programming
    Song, Ruizhuo
    Zhu, Liao
    NEUROCOMPUTING, 2019, 340 : 180 - 195
  • [9] Nonlinear Two-Player Zero-Sum Game Approximate Solution Using a Policy Iteration Algorithm
    Johnson, M.
    Bhasin, S.
    Dixon, W. E.
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 142 - 147
  • [10] Discrete-Time Two-Player Zero-Sum Games for Nonlinear Systems Using Iterative Adaptive Dynamic Programming
    Wei, Qinglai
    Liu, Derong
    ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 269 - 276