Leader-Follower Bipartite Output Synchronization on Signed Digraphs Under Adversarial Factors via Data-Based Reinforcement Learning

被引:29
作者
Li, Qin [1 ]
Xia, Lina [1 ]
Song, Ruizhuo [1 ]
Liu, Jian [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Adversarial inputs; bipartite output synchronization; heterogeneous multiagent systems (MASs); reinforcement learning (RL); resilient H-infinity controller; signed digraphs; MULTIAGENT SYSTEMS; CONTAINMENT CONTROL; FEEDBACK CONTROL; CONSENSUS;
D O I
10.1109/TNNLS.2019.2952611
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The optimal solution to the leader-follower bipartite output synchronization problem is proposed for heterogeneous multiagent systems (MASs) over signed digraphs in the presence of adversarial inputs in this article. For the MASs, the dynamics and dimensions of the followers are different. Distributed observers are first designed to estimate the leader's two-way state and output over signed digraphs. Then, the leader-follower bipartite output synchronization problem on signed graphs is translated into a conventional output distributed leader-follower problem over nonnegative graphs after the state transformation by using the information of followers and observers. The effect of adversarial inputs in sensors or actuators of agents is mitigated by designing the resilient H-infinity controller. A data-based reinforcement learning (RL) algorithm is proposed to obtain the optimal control law, which implies that the dynamics of the followers is not required. Finally, a simulation example is given to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:4185 / 4195
页数:11
相关论文
共 39 条
[1]   Consensus Problems on Networks With Antagonistic Interactions [J].
Altafini, Claudio .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2013, 58 (04) :935-946
[2]  
Basar T., 1995, H optimal control and related minimax design problems: A dynamic game approach
[3]   Distributed containment control with multiple stationary or dynamic leaders in fixed and switching directed networks [J].
Cao, Yongcan ;
Ren, Wei ;
Egerstedt, Magnus .
AUTOMATICA, 2012, 48 (08) :1586-1597
[4]   Effective leadership and decision-making in animal groups on the move [J].
Couzin, ID ;
Krause, J ;
Franks, NR ;
Levin, SA .
NATURE, 2005, 433 (7025) :513-516
[5]   Distributed Robust Fixed-Time Consensus for Nonlinear and Disturbed Multiagent Systems [J].
Hong, Huifen ;
Yu, Wenwu ;
Wen, Guanghui ;
Yu, Xinghuo .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (07) :1464-1473
[6]   Containment control in mobile networks [J].
Ji, M. ;
Ferrari-Trecate, G. ;
Egerstedt, M. ;
Buffa, A. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (08) :1972-1975
[7]  
Jiang CL, 2006, STRUCTURE OF HILBERT SPACE OPERATORS, P1
[8]   Distributed L2-gain output-feedback control of homogeneous and heterogeneous systems [J].
Jiao, Qiang ;
Modares, Hamidreza ;
Lewis, Frank L. ;
Xu, Shengyuan ;
Xie, Lihua .
AUTOMATICA, 2016, 71 :361-368
[9]   Adaptive Fuzzy Control for Coordinated Multiple Robots With Constraint Using Impedance Learning [J].
Kong, Linghuan ;
He, Wei ;
Yang, Chenguang ;
Li, Zhijun ;
Sun, Changyin .
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (08) :3052-3063
[10]  
Lewis FL, 2014, COMMUN CONTROL ENG, P1, DOI 10.1007/978-1-4471-5574-4