Observer-based adaptive optimal output containment control problem of linear heterogeneous Multiagent systems with relative output measurements

被引：13

作者：

Mazouchi, Majid ^{[1
]}

Naghibi-Sistani, Mohammad Bagher ^{[1
]}

Sani, Seyed Kamal Hosseini ^{[1
]}

Tatari, Farzaneh ^{[2
]}

Modares, Hamidreza ^{[3
]}

机构：

[1] Ferdowsi Univ Mashhad, Dept Elect Engn, Mashhad, Iran

[2] Univ Semnan, Dept Elect Engn, Semnan, Iran

[3] Michigan State Univ, Dept Mech Engn, E Lansing, MI 48824 USA

来源：

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING | 2019年 / 33卷 / 02期

关键词：

adaptive distributed observer; cooperative output regulation; optimal control; output containment control; reinforcement learning; TRACKING CONTROL; DYNAMIC LEADERS; STATE-FEEDBACK; CONSENSUS; DESIGN; SYNCHRONIZATION; ALGORITHM; NETWORK;

D O I：

10.1002/acs.2950

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper develops a relative output-feedback-based solution to the containment control of linear heterogeneous multiagent systems. A distributed optimal control protocol is presented for the followers to not only assure that their outputs fall into the convex hull of the leaders' output but also optimizes their transient performance. The proposed optimal solution is composed of a feedback part, depending of the followers' state, and a feed-forward part, depending on the convex hull of the leaders' state. To comply with most real-world applications, the feedback and feed-forward states are assumed to be unavailable and are estimated using two distributed observers. That is, a distributed observer is designed to measure each agent's states using only its relative output measurements and the information that it receives by its neighbors. Another adaptive distributed observer is designed, which uses exchange of information between followers over a communication network to estimate the convex hull of the leaders' state. The proposed observer relaxes the restrictive requirement of having access to the complete knowledge of the leaders' dynamics by all the followers. An off-policy reinforcement learning algorithm on an actor-critic structure is next developed to solve the optimal containment control problem online, using relative output measurements and without requiring the leaders' dynamics. Finally, the theoretical results are verified by numerical simulations.

引用

页码：262 / 284

页数：23

共 64 条

[51] Distributed learning algorithm for non-linear differential graphical games
Tatari, Farzaneh
Naghibi-Sistani, Mohammad-Bagher
Vamvoudakis, Kyriakos G.
[J]. TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2017, 39 (02) : 173 - 182
[52] Conflict resolution for air traffic management: A study in multiagent hybrid systems
Tomlin, C
Pappas, GJ
Sastry, S
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1998, 43 (04) : 509 - 521
[53] Vamvoudakis KG, 2017, INT J ADAPT CONTROL
[54] A Distributed Control Approach to A Robust Output Regulation Problem for Multi-Agent Linear Systems
Wang, Xiaoli
Hong, Yiguang
Huang, Jie
Jiang, Zhong-Ping
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2010, 55 (12) : 2891 - 2895
[55] Containment control of multi-agent systems in a noisy communication environment
Wang, Yunpeng
Cheng, Long
Hou, Zeng-Guang
Tan, Min
Wang, Ming
[J]. AUTOMATICA, 2014, 50 (07) : 1922 - 1928
[56] Containment of Higher-Order Multi-Leader Multi-Agent Systems: A Dynamic Output Approach
Wen, Guanghui
Zhao, Yu
Duan, Zhisheng
Yu, Wenwu
Chen, Guanrong
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2016, 61 (04) : 1135 - 1140
[57] Wu J, 2016, PROC EUR CONF ANTENN
[58] Output regulation of heterogeneous linear multi-agent systems with differential graphical game
Yaghmaie, Farnaz Adib
Lewis, Frank L.
Su, Rong
[J]. INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2016, 26 (10) : 2256 - 2278
[59] Yang X, 2017, IEEE T SYST MAN CYBE
[60] Online concurrent reinforcement learning algorithm to solve two-player zero-sum games for partially unknown nonlinear continuous-time systems
Yasini, Sholeh
Karimpour, Ali
Sistani, Mohammad-Bagher Naghibi
Modares, Hamidreza
[J]. INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2015, 29 (04) : 473 - 493

← 1 2 3 4 5 6 7 →