An efficient model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on integral reinforcement learning with exploration

被引:0
|
作者
Guo, Lei [1 ]
Xiong, Wenbo [1 ]
Song, Yuan [1 ]
Gan, Dongming [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
[2] Purdue Univ, Sch Engn Technol, W Lafayette, IN USA
来源
IET CONTROL THEORY AND APPLICATIONS | 2024年 / 18卷 / 06期
基金
中国国家自然科学基金;
关键词
adaptive control; dynamic programming; game theory; optimal control; OPTIMAL TRACKING CONTROL; SYSTEMS;
D O I
10.1049/cth2.12610
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To reduce the learning time and space occupation, this study presents a novel model-free algorithm for obtaining the Nash equilibrium solution of continuous-time nonlinear non-zero-sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous-time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed-loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two-player non-zero-sum game validate the effectiveness of the proposed algorithm.
引用
收藏
页码:748 / 763
页数:16
相关论文
共 50 条
  • [31] Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games
    Song, Ruizhuo
    Lewis, Frank L.
    Wei, Qinglai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 704 - 713
  • [32] Model-free adaptive optimal control for nonlinear multiplayer games with input disturbances
    Shi, Jing
    Peng, Chen
    Zhang, Jin
    Zhang, Zhihao
    Xie, Xiangpeng
    NEUROCOMPUTING, 2024, 580
  • [33] Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game
    Zhong, Xiangnan
    He, Haibo
    Wang, Ding
    Ni, Zhen
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (05) : 1633 - 1646
  • [34] Reinforcement learning for adaptive optimal control of continuous-time linear periodic systems
    Pang, Bo
    Jiang, Zhong-Ping
    Mareels, Iven
    AUTOMATICA, 2020, 118
  • [35] Model-free continuation of periodic orbits in certain nonlinear systems using continuous-time adaptive control
    Li, Yang
    Dankowicz, Harry
    NONLINEAR DYNAMICS, 2023, 111 (06) : 4945 - 4957
  • [36] Model-free continuation of periodic orbits in certain nonlinear systems using continuous-time adaptive control
    Yang Li
    Harry Dankowicz
    Nonlinear Dynamics, 2023, 111 : 4945 - 4957
  • [37] Neural networks-based optimal tracking control for nonzero-sum games of multi-player continuous-time nonlinear systems via reinforcement learning
    Zhao, Jingang
    NEUROCOMPUTING, 2020, 412 : 167 - 176
  • [38] MODEL-FREE PREDICTIVE CONTROL OF NONLINEAR PROCESSES BASED ON REINFORCEMENT LEARNING
    Shah, Hitesh
    Gopal, M.
    IFAC PAPERSONLINE, 2016, 49 (01): : 89 - 94
  • [39] Model-free adaptive control design for nonlinear discrete-time processes with reinforcement learning techniques
    Liu, Dong
    Yang, Guang-Hong
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2018, 49 (11) : 2298 - 2308
  • [40] Neural-network-based safe learning control for non-zero-sum differential games of nonlinear systems with asymmetric input constraints
    Qin, Chunbin
    Zhu, Tianzeng
    Jiang, Kaijun
    Wu, Yinliang
    Zhang, Jishi
    APPLIED INTELLIGENCE, 2024, : 7810 - 7828