Model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on reinforcement learning

被引:5
|
作者
Guo, Lei [1 ]
Zhao, Han [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
来源
IET CONTROL THEORY AND APPLICATIONS | 2023年 / 17卷 / 02期
基金
中国国家自然科学基金;
关键词
APPROXIMATE OPTIMAL-CONTROL; LINEAR-SYSTEMS;
D O I
10.1049/cth2.12376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, two novel algorithms to find the Nash equilibrium solution of the non-zero-sum games for continuous-time input-affine nonlinear systems are presented. Based on integral reinforcement learning method, the integral-exploration-coupled Hamilton-Jacobi (HJ) equations are derived, which does not contain any information of the system dynamics. Then, based on neural networks approximation, two different adaptive tuning law of weights are given to estimate the approximate solution of the coupled HJ equations. Both two algorithms can estimate the value function and the policy without knowing or identifying the system dynamics. The closed-loop system stability and the convergence of weights are guaranteed based on Lyapunov analysis. Finally, the simulation results of a two-player non-zero-sum game demonstrate the effectiveness of our algorithms.
引用
收藏
页码:223 / 239
页数:17
相关论文
共 50 条
  • [31] Model-free continuation of periodic orbits in certain nonlinear systems using continuous-time adaptive control
    Li, Yang
    Dankowicz, Harry
    NONLINEAR DYNAMICS, 2023, 111 (06) : 4945 - 4957
  • [32] Model-free continuation of periodic orbits in certain nonlinear systems using continuous-time adaptive control
    Yang Li
    Harry Dankowicz
    Nonlinear Dynamics, 2023, 111 : 4945 - 4957
  • [33] Neural networks-based optimal tracking control for nonzero-sum games of multi-player continuous-time nonlinear systems via reinforcement learning
    Zhao, Jingang
    NEUROCOMPUTING, 2020, 412 : 167 - 176
  • [34] MODEL-FREE PREDICTIVE CONTROL OF NONLINEAR PROCESSES BASED ON REINFORCEMENT LEARNING
    Shah, Hitesh
    Gopal, M.
    IFAC PAPERSONLINE, 2016, 49 (01): : 89 - 94
  • [35] Model-free adaptive control design for nonlinear discrete-time processes with reinforcement learning techniques
    Liu, Dong
    Yang, Guang-Hong
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2018, 49 (11) : 2298 - 2308
  • [36] Adaptive sampling artificial-actual control for non-zero-sum games of constrained systems
    Liu, Lu
    Song, Ruizhuo
    NEURAL NETWORKS, 2024, 178
  • [37] Adaptive critic control for multi-player non-zero-sum games with asymmetric constraints
    Li, Meng-Hua
    Wang, Ding
    Qiao, Jun-Fei
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (09): : 1562 - 1568
  • [38] Neural-network-based safe learning control for non-zero-sum differential games of nonlinear systems with asymmetric input constraints
    Qin, Chunbin
    Zhu, Tianzeng
    Jiang, Kaijun
    Wu, Yinliang
    Zhang, Jishi
    APPLIED INTELLIGENCE, 2024, : 7810 - 7828
  • [39] Model-free PAC Time-Optimal Control Synthesis with Reinforcement Learning
    Liu, Mengyu
    Lu, Pengyuan
    Chen, Xin
    Sokolsky, Oleg
    Lee, Insup
    Kong, Fanxin
    2024 22ND ACM-IEEE INTERNATIONAL SYMPOSIUM ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN, MEMOCODE 2024, 2024, : 34 - 45
  • [40] Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game
    Wei, Qinglai
    Zhu, Liao
    Song, Ruizhuo
    Zhang, Pinjia
    Liu, Derong
    Xiao, Jun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (02) : 879 - 892