Optimal Self-Learning Cooperative Control for Continuous-Time Heterogeneous Multi-Agent Systems

被引:0
|
作者
Wei Qinglai [1 ]
Liu Derong [1 ]
Song Ruizhuo [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
来源
2015 34TH CHINESE CONTROL CONFERENCE (CCC) | 2015年
关键词
Adaptive Critic Designs; Adaptive Dynamic Programming; Approximate Dynamic Programming; Heterogeneous Multi-Agents; Graphical Games; Policy Iteration; Synchronization; DYNAMIC-PROGRAMMING ALGORITHM; OPTIMAL TRACKING CONTROL; NONLINEAR-SYSTEMS; CONTROL SCHEME; REINFORCEMENT; GAMES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an optimal self-learning cooperative control for heterogeneous multi-agent systems by iterative adaptive dynamic programming (ADP) is developed. The main idea is to design an optimal control law by policy iteration based ADP technique which makes all the agents track a given dynamics and simultaneously makes the iterative performance index function reach the Nash equilibrium. The cooperative policy iteration algorithm for graphical differential games is developed to achieve the optimal control law for the agent of each node. Convergence properties are analyzed which make the performance index functions of heterogeneous multi-agent differential graphical games converge to the Nash equilibrium. Simulation example is given to show the effectiveness of the developed optimal self-learning control scheme.
引用
收藏
页码:3005 / 3010
页数:6
相关论文
共 50 条
  • [31] Off-policy integral reinforcement learning optimal tracking control for continuous-time chaotic systems
    Wei Qing-Lai
    Song Rui-Zhuo
    Sun Qiu-Ye
    Xiao Wen-Dong
    CHINESE PHYSICS B, 2015, 24 (09)
  • [32] Off-policy integral reinforcement learning optimal tracking control for continuous-time chaotic systems
    魏庆来
    宋睿卓
    孙秋野
    肖文栋
    Chinese Physics B, 2015, 24 (09) : 151 - 156
  • [33] Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning
    Zhao, Jingang
    Gan, Minggang
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2020, 51 (13) : 2429 - 2440
  • [34] Privacy-preserving bipartite output consensus for continuous-time heterogeneous multi-agent systems via event-triggered impulsive control
    Ma, Jiayue
    Hu, Jiangping
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 5127 - 5137
  • [35] Optimal containment control for a class of heterogeneous multi-agent systems with actuator faults
    Fan, Sijia
    Wang, Tong
    Wu, Ju
    Qiu, Jianbin
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (02) : 849 - 865
  • [36] Data-Driven Optimal Output Cluster Synchronization Control of Heterogeneous Multi-Agent Systems
    Li, Hongyang
    Wei, Qinglai
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (03) : 3910 - 3920
  • [37] Optimal couple-group tracking control for the heterogeneous multi-agent systems with cooperative-competitive interactions via reinforcement learning method
    Li, Jun
    Ji, Lianghao
    Zhang, Cuijuan
    Li, Huaqing
    INFORMATION SCIENCES, 2022, 610 : 401 - 424
  • [38] Finite-time distributed cooperative control for heterogeneous nonlinear multi-agent systems with unknown input constraints
    Liu, Dacai
    Liu, Zhi
    Chen, C. L. Philip
    Zhang, Yun
    NEUROCOMPUTING, 2020, 415 (415) : 123 - 134
  • [39] Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent Systems
    Zhu, Xiaoxia
    Yuan, Xin
    Wang, Yuanda
    Sun, Changyin
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 6178 - 6182
  • [40] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    Chakrabarti, Prasun
    Kurths, Juergen
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23