Optimal Robust Output Containment of Unknown Heterogeneous Multiagent System Using Off-Policy Reinforcement Learning

被引:64
作者
Zuo, Shan [1 ,2 ]
Song, Yongduan [2 ]
Lewis, Frank L. [1 ]
Davoudi, Ali [1 ]
机构
[1] Univ Texas Arlington, UTA Res Inst, Ft Worth, TX 76118 USA
[2] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 611731, Sichuan, Peoples R China
基金
美国国家科学基金会;
关键词
Heterogeneous systems; integral reinforcement learning (RL); internal model principle; optimal robust output containment; output-feedback; ADAPTIVE OPTIMAL-CONTROL; CONTINUOUS-TIME SYSTEMS; H-INFINITY CONTROL; TRACKING CONTROL; DYNAMIC LEADERS; CONSENSUS; ITERATION;
D O I
10.1109/TCYB.2017.2761878
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates optimal robust output containment problem of general linear heterogeneous multiagent systems (MAS) with completely unknown dynamics. A model-based algorithm using offline policy iteration (PI) is first developed, where the p-copy internal model principle is utilized to address the system parameter variations. This offline PI algorithm requires the nominal model of each agent, which may not be available in most real-world applications. To address this issue, a discounted performance function is introduced to express the optimal robust output containment problem as an optimal output-feedback design problem with bounded L-2-gain. To solve this problem online in real time, a Bellman equation is first developed to evaluate a certain control policy and find the updated control policies, simultaneously, using only the state/output information measured online. Then, using this Bellman equation, a model-free off-policy integral reinforcement learning algorithm is proposed to solve the optimal robust output containment problem of heterogeneous MAS, in real time, without requiring any knowledge of the system dynamics. Simulation results are provided to verify the effectiveness of the proposed method.
引用
收藏
页码:3197 / 3207
页数:11
相关论文
共 49 条
[1]   LYAPUNOV AND RICCATI-EQUATIONS - PERIODIC INERTIA THEOREMS [J].
BITTANTI, S ;
COLANERI, P .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1986, 31 (07) :659-661
[2]   Distributed containment control with multiple stationary or dynamic leaders in fixed and switching directed networks [J].
Cao, Yongcan ;
Ren, Wei ;
Egerstedt, Magnus .
AUTOMATICA, 2012, 48 (08) :1586-1597
[3]   Distributed Containment Control for Multiple Autonomous Vehicles With Double-Integrator Dynamics: Algorithms and Experiments [J].
Cao, Yongcan ;
Stuart, Daniel ;
Ren, Wei ;
Meng, Ziyang .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2011, 19 (04) :929-938
[4]   Distributed adaptive containment control of heterogeneous linear multi-agent systems: an output regulation approach [J].
Chu, Hongjun ;
Gao, Lixin ;
Zhang, Weidong .
IET CONTROL THEORY AND APPLICATIONS, 2016, 10 (01) :95-102
[5]   Leader-follower cooperative attitude control of multiple rigid bodies [J].
Dimarogonas, Dimos V. ;
Tsiotras, Panagiotis ;
Kyriakopoulos, Kostas J. .
SYSTEMS & CONTROL LETTERS, 2009, 58 (06) :429-435
[6]   Information flow and cooperative control of vehicle formations [J].
Fax, JA ;
Murray, RM .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (09) :1465-1476
[7]   Necessary and sufficient conditions for H-∞ static output-feedback control [J].
Gadewadikar, Jyotirmay ;
Lewis, Frank L. ;
Abu-Khalaf, Murad .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2006, 29 (04) :915-920
[8]   Containment control of heterogeneous linear multi-agent systems [J].
Haghshenas, Hamed ;
Badamchizadeh, Mohammad Ali ;
Baradarannia, Mandi .
AUTOMATICA, 2015, 54 :210-216
[9]   Cooperative Output Regulation of Heterogeneous Multi-Agent Systems: An H∞ Criterion [J].
Huang, Chao ;
Ye, Xudong .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (01) :267-273
[10]  
Huang J., 2004, Nonlinear Output Regulation: Theory and Applications