Robust ADP-based control for uncertain nonlinear Stackelberg games

被引:4
作者
Yu, Lin [1 ,2 ]
Lai, Jing [1 ]
Xiong, Junlin [1 ]
Xie, Min [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei, Peoples R China
[2] City Univ Hong Kong, Dept Adv Design & Syst Engn, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Actor-critic structure; Adaptive dynamic programming; Identifier; Neural network; Stackelberg game; EXISTENCE; FEEDBACK; SYSTEMS;
D O I
10.1016/j.neucom.2023.126834
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stackelberg games allow players to access system information differently and take actions asynchronously. This paper introduces a robust adaptive dynamic programming-based method to solve the nonlinear two-player Stackelberg game subject to external disturbances. Combined with a neural network identifier, our method is implemented on the actor-critic-disturbance structure to approximate the optimal value function, i.e., the corresponding Stackelberg equilibrium. With the aid of costate, we transform this leader-follower optimization problem into solving two parametric equations and a costate equation. The coefficients of critic approximators and the costate are updated simultaneously to reach the Stackelberg equilibrium. The proposed control method finds real-time approximations of the Stackelberg-Saddle equilibrium while ensuring the closed-loop system's stability. Finally, the simulation example shows the effectiveness and advantage of our method.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] ADP-based nonlinear optimal output regulation with nonlinear exosystem
    Jiang, Haoan
    Jin, Peng
    Ma, Qian
    Zhou, Guopeng
    Miao, Guoying
    NEURAL COMPUTING & APPLICATIONS, 2023,
  • [2] ADP-based robust consensus for multi-agent systems with unknown dynamics and random uncertain channels
    Xiong, Chunping
    Ma, Qian
    Zhou, Guopeng
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (06) : 4051 - 4063
  • [3] Distributed Finite-Time ADP-Based Optimal Control for Nonlinear Multiagent Systems
    Zhang, Longjie
    Chen, Yong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (12) : 4534 - 4538
  • [4] Optimal Placements of Actuators and Robust ADP-Based Vibration Control for Large Flexible Space Structures
    Guo, Jianguo
    Tian, Dalong
    Huang, He
    Guo, Zongyi
    Feng, Zhenxin
    JOURNAL OF VIBRATION ENGINEERING & TECHNOLOGIES, 2024, 12 (02) : 1291 - 1307
  • [5] Event-triggered robust hierarchical control for uncertain multiplayer Stackelberg games via adaptive dynamic programming
    Zhang, Yongwei
    Zhao, Bo
    Liu, Derong
    Polycarpou, Marios M.
    Peng, Shiguo
    Zhang, Shunchao
    NEUROCOMPUTING, 2025, 616
  • [6] Event-Triggered Robust Adaptive Dynamic Programming for Multiplayer Stackelberg-Nash Games of Uncertain Nonlinear Systems
    Lin, Mingduo
    Zhao, Bo
    Liu, Derong
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (01) : 273 - 286
  • [7] ADP-Based Model Reference Adaptive Control Design for Unknown Discrete-Time Nonlinear Systems
    Wang, Wei
    Chen, Xin
    Wang, Fang
    Fu, Hao
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 8049 - 8054
  • [8] ADP-based online compensation hierarchical sliding-mode control for partially unknown switched nonlinear systems with actuator failures
    Wang, Tengda
    Niu, Ben
    Xu, Ning
    Zhang, Liang
    ISA TRANSACTIONS, 2024, 155 : 69 - 81
  • [9] Robust ADP-based solution of a class of nonlinear multi-agent systems with input saturation and collision avoidance constraints
    Khankalantary, Saeed
    Izadi, Iman
    Sheikholeslam, Farid
    ISA TRANSACTIONS, 2020, 107 : 52 - 62
  • [10] Online policy iteration ADP-based attitude-tracking control for hypersonic vehicles
    Han, Xiao
    Zheng, Zongzhun
    Liu, Lei
    Wang, Bo
    Cheng, Zhongtao
    Fan, Huijin
    Wang, Yongji
    AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 106