Robust output group formation tracking control of heterogeneous multi-agent systems with multiple leaders using reinforcement learning☆ ☆

被引:1
作者
Shi, Yu [1 ]
Hua, Yongzhao [2 ]
Yu, Jianglong [1 ]
Dong, Xiwang [1 ,2 ,3 ]
Ren, Zhang [1 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
[3] Beihang Univ, Inst Unmanned Syst, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Output group formation; Distributed adaptive observer; Data-driven; Robust control; Reinforcement learning; TIME LINEAR-SYSTEMS; FORMATION-CONTAINMENT; SYNCHRONIZATION;
D O I
10.1016/j.sysconle.2024.105897
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the distributed output formation tracking problem of grouped heterogeneous multi-agent systems under multiple leaders and uncertainties using reinforcement learning (RL). The outputs of followers are supposed to achieve robust tracking to the respective convex point of group leaders while generating an expected time-varying formation configuration. First, a distributed adaptive observer is designed under a directed graph to coordinate the multiple group leaders while estimating the leaders' dynamics in finite-time. The adaptive mechanism avoids global information of the graph. Second, an optimal tracking problem with respect to the observer is formulated for each follower, while the feedback tracking controller is derived using an action-dependent RL algorithm. An extended learning process for essential dynamics is constructed using the same data, while the output regulation equations are solved equivalently. Third, the robust formation controller and feasibility condition are further proposed based on previous learning results. Stability of the synthetical data-driven controller is analyzed under internal uncertainties and external disturbances. Finally, simulation results are provided to demonstrate the effectiveness of the hierarchical control framework.
引用
收藏
页数:11
相关论文
共 42 条
[31]   Exponential stability for formation control systems with generalized controllers: A unified approach [J].
Sun, Zhiyong ;
Mou, Shaoshuai ;
Anderson, Brian D. O. ;
Cao, Ming .
SYSTEMS & CONTROL LETTERS, 2016, 93 :50-57
[32]  
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[33]   Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach [J].
Vamvoudakis, Kyriakos G. .
SYSTEMS & CONTROL LETTERS, 2017, 100 :14-20
[34]   Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem [J].
Vamvoudakis, Kyriakos G. ;
Lewis, Frank L. .
AUTOMATICA, 2010, 46 (05) :878-888
[35]   Data-Efficient Off-Policy Learning for Distributed Optimal Tracking Control of HMAS With Unidentified Exosystem Dynamics [J].
Xu, Yong ;
Wu, Zheng-Guang .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) :3181-3190
[36]   Optimal Containment Control of Unknown Heterogeneous Systems With Active Leaders [J].
Yang, Yongliang ;
Modares, Hamidreza ;
Wunsch, Donald C., II ;
Yin, Yixin .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 27 (03) :1228-1236
[37]   Output containment control of heterogeneous multi-agent systems with leaders of bounded inputs: An adaptive finite-time observer approach [J].
Yuan, Chengzhi ;
Zeng, Wei .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2019, 356 (06) :3419-3442
[38]   Formation Learning Control of Multiple Autonomous Underwater Vehicles With Heterogeneous Nonlinear Uncertain Dynamics [J].
Yuan, Chengzhi ;
Licht, Stephen ;
He, Haibo .
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (10) :2920-2934
[39]   Event-Triggered Robust Adaptive Dynamic Programming With Output Feedback for Large-Scale Systems [J].
Zhao, Fuyu ;
Gao, Weinan ;
Liu, Tengfei ;
Jiang, Zhong-Ping .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (01) :63-74
[40]   Data-Driven Fault-Tolerant Control for Attitude Synchronization of Nonlinear Quadrotors [J].
Zhao, Wanbing ;
Liu, Hao ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (11) :5584-5591