Multi-agent differential game based cooperative synchronization control using a data-driven method

被引：3

作者：

SHI, Yu ^{[1
]}

HUA, Yongzhao ^{[2
]}

YU, Jianglong ^{[1
]}

DONG, Xiwang ^{[1
,2
]}

REN, Zhang ^{[1
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China

来源：

FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING | 2022年 / 23卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Multi-agent system; Differential game; Synchronization control; Data-driven; Reinforcement learning; TP273; ADAPTIVE LEARNING SOLUTION; INFINITY TRACKING CONTROL; CONTINUOUS-TIME SYSTEMS; ZERO-SUM GAMES; SWITCHING TOPOLOGY; GRAPHICAL GAMES; CONSENSUS; SEEKING; ROBUSTNESS; NETWORKS;

D O I：

10.1631/FITEE.2200001

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies the multi-agent differential game based problem and its application to cooperative synchronization control. A systematized formulation and analysis method for the multi-agent differential game is proposed and a data-driven methodology based on the reinforcement learning (RL) technique is given. First, it is pointed out that typical distributed controllers may not necessarily lead to global Nash equilibrium of the differential game in general cases because of the coupling of networked interactions. Second, to this end, an alternative local Nash solution is derived by defining the best response concept, while the problem is decomposed into local differential games. An off-policy RL algorithm using neighboring interactive data is constructed to update the controller without requiring a system model, while the stability and robustness properties are proved. Third, to further tackle the dilemma, another differential game configuration is investigated based on modified coupling index functions. The distributed solution can achieve global Nash equilibrium in contrast to the previous case while guaranteeing the stability. An equivalent parallel RL method is constructed corresponding to this Nash solution. Finally, the effectiveness of the learning process and the stability of synchronization control are illustrated in simulation results.

引用

页码：1043 / 1056

页数：14

共 50 条

[21] Game-based coordination control of multi-agent systems [J].

Zhou, Liqi ;

Zheng, Yuanshi ;

Zhao, Qi ;

Xiao, Feng ;

Zhang, Yuling .

SYSTEMS & CONTROL LETTERS, 2022, 169

[22] Synchronization of Multi-Agent Systems With Time-Varying Control and Delayed Communications [J].

Jia, Qiang ;

Han, Zeyu ;

Tang, Wallace K. S. .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2019, 66 (11) :4429-4438

[23] Reset control for synchronization of multi-agent systems [J].

Meng, Xiangyu ;

Xie, Lihua ;

Soh, Yeng Chai .

AUTOMATICA, 2019, 104 :189-195

[24] Distributed data-driven consensus control of multi-agent systems under switched uncertainties [J].

Liu, Wenjie ;

Li, Yifei ;

Wang, Gang ;

Sun, Jian ;

Chen, Jie .

CONTROL THEORY AND TECHNOLOGY, 2023, 21 (03) :478-487

[25] Distributed data-driven consensus control of multi-agent systems under switched uncertainties [J].

Wenjie Liu ;

Yifei Li ;

Gang Wang ;

Jian Sun ;

Jie Chen .

Control Theory and Technology, 2023, 21 :478-487

[26] Data-driven tracking consensus for a class of unknown nonlinear multi-agent systems [J].

Wu, Jia ;

Liu, Ning ;

Tang, Wenyan .

JOURNAL OF VIBRATION AND CONTROL, 2022, 28 (23-24) :3559-3574

[27] Multi-Agent-Based Data-Driven Distributed Adaptive Cooperative Control in Urban Traffic Signal Timing [J].

Zhang, Haibo ;

Liu, Xiaoming ;

Ji, Honghai ;

Hou, Zhongsheng ;

Fan, Lingling .

ENERGIES, 2019, 12 (07)

[28] Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games [J].

Wei, Qinglai ;

Liu, Derong ;

Lewis, Frank L. .

INFORMATION SCIENCES, 2015, 317 :96-113

[29] Reinforcement learning based optimal synchronization control for multi-agent systems with input constraints using vanishing viscosity method [J].

Zhang, Dianfeng ;

Yao, Ying ;

Wu, Zhaojing .

INFORMATION SCIENCES, 2023, 637

[30] Coordination of low-power nonlinear multi-agent systems using cloud computing and a data-driven hybrid predictive control method [J].

Tan, Haoran ;

Wang, Yaonan ;

Zhong, Hang ;

Wu, Min ;

Jiang, Yiming .

CONTROL ENGINEERING PRACTICE, 2021, 108

← 1 2 3 4 5 →