Robust output group formation tracking control of heterogeneous multi-agent systems with multiple leaders using reinforcement learning☆ ☆

被引：1

作者：

Shi, Yu ^{[1
]}

Hua, Yongzhao ^{[2
]}

Yu, Jianglong ^{[1
]}

Dong, Xiwang ^{[1
,2
,3
]}

Ren, Zhang ^{[1
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China

[3] Beihang Univ, Inst Unmanned Syst, Beijing 100191, Peoples R China

来源：

SYSTEMS & CONTROL LETTERS | 2024年 / 192卷

基金：

中国国家自然科学基金;

关键词：

Output group formation; Distributed adaptive observer; Data-driven; Robust control; Reinforcement learning; TIME LINEAR-SYSTEMS; FORMATION-CONTAINMENT; SYNCHRONIZATION;

D O I：

10.1016/j.sysconle.2024.105897

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies the distributed output formation tracking problem of grouped heterogeneous multi-agent systems under multiple leaders and uncertainties using reinforcement learning (RL). The outputs of followers are supposed to achieve robust tracking to the respective convex point of group leaders while generating an expected time-varying formation configuration. First, a distributed adaptive observer is designed under a directed graph to coordinate the multiple group leaders while estimating the leaders' dynamics in finite-time. The adaptive mechanism avoids global information of the graph. Second, an optimal tracking problem with respect to the observer is formulated for each follower, while the feedback tracking controller is derived using an action-dependent RL algorithm. An extended learning process for essential dynamics is constructed using the same data, while the output regulation equations are solved equivalently. Third, the robust formation controller and feasibility condition are further proposed based on previous learning results. Stability of the synthetical data-driven controller is analyzed under internal uncertainties and external disturbances. Finally, simulation results are provided to demonstrate the effectiveness of the hierarchical control framework.

引用

页数：11

共 42 条

[1] The adaptive distributed observer approach to the cooperative output regulation of linear multi-agent systems [J].

Cai, He ;

Lewis, Frank L. ;

Hu, Guoqiang ;

Huang, Jie .

AUTOMATICA, 2017, 75 :299-305

[2] Off-policy learning for adaptive optimal output synchronization of heterogeneous multi-agent systems [J].

Chen, Ci ;

Lewis, Frank L. ;

Xie, Kan ;

Xie, Shengli ;

Liu, Yilu .

AUTOMATICA, 2020, 119

[3] Reinforcement Learning-Based Adaptive Optimal Exponential Tracking Control of Linear Systems With Unknown Dynamics [J].

Chen, Ci ;

Modares, Hamidreza ;

Xie, Kan ;

Lewis, Frank L. ;

Wan, Yan ;

Xie, Shengli .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (11) :4423-4438

[4] Time-Varying Formation Tracking for Linear Multiagent Systems With Multiple Leaders [J].

Dong, Xiwang ;

Hu, Guoqiang .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (07) :3658-3664

[5] Distributed Time-Varying Formation Tracking Analysis and Design for Second-Order Multi-Agent Systems [J].

Dong, Xiwang ;

Xiang, Jie ;

Han, Liang ;

Li, Qingdong ;

Ren, Zhang .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 86 (02) :277-289

[6] Fully Distributed Cooperative Output Regulation for Heterogeneous Linear Parameter-Varying Systems With Directed Graphs [J].

Fu, Chengcheng ;

Zhang, Hao ;

Huang, Chao ;

Wang, Zhuping ;

Yan, Huaicheng .

IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (03) :1350-1361

[7] Resilient reinforcement learning and robust output regulation under denial-of-service attacks [J].

Gao, Weinan ;

Deng, Chao ;

Jiang, Yi ;

Jiang, Zhong-Ping .

AUTOMATICA, 2022, 142

[8] Adaptive Dynamic Programming and Adaptive Optimal Output Regulation of Linear Systems [J].

Gao, Weinan ;

Jiang, Zhong-Ping .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2016, 61 (12) :4164-4169

[9] Distributed Adaptive Time-Varying Group Formation Tracking for Multiagent Systems With Multiple Leaders on Directed Graphs [J].

Hu, Junyan ;

Bhowmick, Parijat ;

Lanzon, Alexander .

IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2020, 7 (01) :140-150

[10] Distributed adaptive formation tracking for heterogeneous multiagent systems with multiple nonidentical leaders and without well-informed follower [J].

Hua, Yongzhao ;

Dong, Xiwang ;

Li, Qingdong ;

Ren, Zhang .

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (06) :2131-2151

← 1 2 3 4 5 →