Model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on reinforcement learning

被引:5
|
作者
Guo, Lei [1 ]
Zhao, Han [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
来源
IET CONTROL THEORY AND APPLICATIONS | 2023年 / 17卷 / 02期
基金
中国国家自然科学基金;
关键词
APPROXIMATE OPTIMAL-CONTROL; LINEAR-SYSTEMS;
D O I
10.1049/cth2.12376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, two novel algorithms to find the Nash equilibrium solution of the non-zero-sum games for continuous-time input-affine nonlinear systems are presented. Based on integral reinforcement learning method, the integral-exploration-coupled Hamilton-Jacobi (HJ) equations are derived, which does not contain any information of the system dynamics. Then, based on neural networks approximation, two different adaptive tuning law of weights are given to estimate the approximate solution of the coupled HJ equations. Both two algorithms can estimate the value function and the policy without knowing or identifying the system dynamics. The closed-loop system stability and the convergence of weights are guaranteed based on Lyapunov analysis. Finally, the simulation results of a two-player non-zero-sum game demonstrate the effectiveness of our algorithms.
引用
收藏
页码:223 / 239
页数:17
相关论文
共 50 条
  • [1] An efficient model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on integral reinforcement learning with exploration
    Guo, Lei
    Xiong, Wenbo
    Song, Yuan
    Gan, Dongming
    IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (06): : 748 - 763
  • [2] Data-Driven Integral Reinforcement Learning for Continuous-Time Non-Zero-Sum Games
    Yang, Yongliang
    Wang, Liming
    Modares, Hamidreza
    Ding, Dawei
    Yin, Yixin
    Wunsch, Donald
    IEEE ACCESS, 2019, 7 : 82901 - 82912
  • [3] Model-Free Temporal Difference Learning for Non-Zero-Sum Games
    Wang, Liming
    Yang, Yongliang
    Ding, Dawei
    Yin, Yixin
    Guo, Zhishan
    Wunsch, Donald C.
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [4] Robust Tracking Control for Non-Zero-Sum Games of Continuous-Time Uncertain Nonlinear Systems
    Qin, Chunbin
    Shang, Ziyang
    Zhang, Zhongwei
    Zhang, Dehua
    Zhang, Jishi
    MATHEMATICS, 2022, 10 (11)
  • [5] Model-Free Adaptive Algorithm for Optimal Control of Continuous-Time Nonlinear System
    Zhu, Yuanheng
    Zhao, Dongbin
    2015 CHINESE AUTOMATION CONGRESS (CAC), 2015, : 1850 - 1855
  • [6] Model-Free Reinforcement Learning for Nonlinear Zero-Sum Games with Simultaneous Explorations
    Zhang, Qichao
    Zhao, Donghin
    Zhu, Yuanheng
    Chen, Xi
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4533 - 4538
  • [7] Model-free optimal tracking policies for Markov jump systems by solving non-zero-sum games
    Zhou, Peixin
    Xue, Huiwen
    Wen, Jiwei
    Shi, Peng
    Luan, Xaoli
    INFORMATION SCIENCES, 2023, 647
  • [8] Integral reinforcement learning-based online adaptive event-triggered control for non-zero-sum games of partially unknown nonlinear systems
    Su, Hanguang
    Zhang, Huaguang
    Sun, Shaoxin
    Cai, Yuliang
    NEUROCOMPUTING, 2020, 377 : 243 - 255
  • [9] Off-Policy Model-Free Learning for Multi-Player Non-Zero-Sum Games With Constrained Inputs
    Huo, Yu
    Wang, Ding
    Qiao, Junfei
    Li, Menghua
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (02) : 910 - 920
  • [10] Adaptive Q-Learning Based Model-Free H∞ Control of Continuous-Time Nonlinear Systems: Theory and Application
    Zhao, Jun
    Lv, Yongfeng
    Wang, Zhangu
    Zhao, Ziliang
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,