An efficient model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on integral reinforcement learning with exploration

被引:0
|
作者
Guo, Lei [1 ]
Xiong, Wenbo [1 ]
Song, Yuan [1 ]
Gan, Dongming [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
[2] Purdue Univ, Sch Engn Technol, W Lafayette, IN USA
来源
IET CONTROL THEORY AND APPLICATIONS | 2024年 / 18卷 / 06期
基金
中国国家自然科学基金;
关键词
adaptive control; dynamic programming; game theory; optimal control; OPTIMAL TRACKING CONTROL; SYSTEMS;
D O I
10.1049/cth2.12610
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To reduce the learning time and space occupation, this study presents a novel model-free algorithm for obtaining the Nash equilibrium solution of continuous-time nonlinear non-zero-sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous-time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed-loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two-player non-zero-sum game validate the effectiveness of the proposed algorithm.
引用
收藏
页码:748 / 763
页数:16
相关论文
共 50 条
  • [21] Reinforcement Q-Learning and Non-Zero-Sum Games Optimal Tracking Control for Discrete-Time Linear Multi-Input Systems
    Zhao, Jin-Gang
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 277 - 282
  • [22] Stackelberg games for model-free continuous-time stochastic systems based on adaptive dynamic programming
    Liu, Xikui
    Ge, Yingying
    Li, Yan
    APPLIED MATHEMATICS AND COMPUTATION, 2019, 363
  • [23] Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints
    Yang, Xiong
    Liu, Derong
    Wang, Ding
    INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (03) : 553 - 566
  • [24] Reinforcement Learning and Adaptive Optimal Control for Continuous-Time Nonlinear Systems: A Value Iteration Approach
    Bian, Tao
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 2781 - 2790
  • [25] Event-Triggered Optimal Tracking Control for Multiplayer Non-Zero-Sum Games of Nonlinear Systems via Concurrent Learning
    Qin, Yi
    Wang, Lijie
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 479 - 484
  • [26] Online reinforcement learning multiplayer non-zero sum games of continuous-time Markov jump linear systems
    Xin, Xilin
    Tu, Yidong
    Stojanovic, Vladimir
    Wang, Hai
    Shi, Kaibo
    He, Shuping
    Pan, Tianhong
    APPLIED MATHEMATICS AND COMPUTATION, 2022, 412
  • [27] Model-free distributed optimal control for continuous-time linear systems
    Feng, Xinjun
    Zhao, Zhiyun
    IET CONTROL THEORY AND APPLICATIONS, 2022, 16 (16): : 1685 - 1695
  • [28] Integral Reinforcement Learning-Based Adaptive NN Control for Continuous-Time Nonlinear MIMO Systems With Unknown Control Directions
    Guo, Xinxin
    Yan, Weisheng
    Cui, Rongxin
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 4068 - 4077
  • [29] Model-Free Optimal Vibration Control of a Nonlinear System Based on Deep Reinforcement Learning
    Jiang, Jiyuan
    Tang, Jie
    Zhao, Kun
    Li, Meng
    Li, Yinghui
    Cao, Dengqing
    INTERNATIONAL JOURNAL OF STRUCTURAL STABILITY AND DYNAMICS, 2024,
  • [30] Integral reinforcement learning-based event-triggered optimal tracking control for modular robot manipulators via non-zero-sum game
    Dong, Bo
    Ding, Zhendong
    An, Tianjiao
    Cui, Yiming
    Zhu, Xinye
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)