Model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on reinforcement learning

被引:5
|
作者
Guo, Lei [1 ]
Zhao, Han [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
来源
IET CONTROL THEORY AND APPLICATIONS | 2023年 / 17卷 / 02期
基金
中国国家自然科学基金;
关键词
APPROXIMATE OPTIMAL-CONTROL; LINEAR-SYSTEMS;
D O I
10.1049/cth2.12376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, two novel algorithms to find the Nash equilibrium solution of the non-zero-sum games for continuous-time input-affine nonlinear systems are presented. Based on integral reinforcement learning method, the integral-exploration-coupled Hamilton-Jacobi (HJ) equations are derived, which does not contain any information of the system dynamics. Then, based on neural networks approximation, two different adaptive tuning law of weights are given to estimate the approximate solution of the coupled HJ equations. Both two algorithms can estimate the value function and the policy without knowing or identifying the system dynamics. The closed-loop system stability and the convergence of weights are guaranteed based on Lyapunov analysis. Finally, the simulation results of a two-player non-zero-sum game demonstrate the effectiveness of our algorithms.
引用
收藏
页码:223 / 239
页数:17
相关论文
共 50 条
  • [21] Reinforcement Learning and Adaptive Optimal Control for Continuous-Time Nonlinear Systems: A Value Iteration Approach
    Bian, Tao
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 2781 - 2790
  • [22] Event-Triggered Optimal Tracking Control for Multiplayer Non-Zero-Sum Games of Nonlinear Systems via Concurrent Learning
    Qin, Yi
    Wang, Lijie
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 479 - 484
  • [23] Online reinforcement learning multiplayer non-zero sum games of continuous-time Markov jump linear systems
    Xin, Xilin
    Tu, Yidong
    Stojanovic, Vladimir
    Wang, Hai
    Shi, Kaibo
    He, Shuping
    Pan, Tianhong
    APPLIED MATHEMATICS AND COMPUTATION, 2022, 412
  • [24] Model-free distributed optimal control for continuous-time linear systems
    Feng, Xinjun
    Zhao, Zhiyun
    IET CONTROL THEORY AND APPLICATIONS, 2022, 16 (16): : 1685 - 1695
  • [25] Equilibrium in two-player non-zero-sum Dynkin games in continuous time
    Laraki, Rida
    Solan, Eilon
    STOCHASTICS-AN INTERNATIONAL JOURNAL OF PROBABILITY AND STOCHASTIC PROCESSES, 2013, 85 (06) : 997 - 1014
  • [26] Model-Free Optimal Vibration Control of a Nonlinear System Based on Deep Reinforcement Learning
    Jiang, Jiyuan
    Tang, Jie
    Zhao, Kun
    Li, Meng
    Li, Yinghui
    Cao, Dengqing
    INTERNATIONAL JOURNAL OF STRUCTURAL STABILITY AND DYNAMICS, 2024,
  • [27] Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics
    Li, Hongliang
    Liu, Derong
    Wang, Ding
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (03) : 706 - 714
  • [28] Model-free adaptive optimal control for nonlinear multiplayer games with input disturbances
    Shi, Jing
    Peng, Chen
    Zhang, Jin
    Zhang, Zhihao
    Xie, Xiangpeng
    NEUROCOMPUTING, 2024, 580
  • [29] Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game
    Zhong, Xiangnan
    He, Haibo
    Wang, Ding
    Ni, Zhen
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (05) : 1633 - 1646
  • [30] Reinforcement learning for adaptive optimal control of continuous-time linear periodic systems
    Pang, Bo
    Jiang, Zhong-Ping
    Mareels, Iven
    AUTOMATICA, 2020, 118