Consensus of Nonlinear Multiagent Systems With Uncertainties Using Reinforcement Learning Based Sliding Mode Control

被引：27

作者：

Li, Jinna ^{[1
]}

Yuan, Lin ^{[1
]}

Chai, Tianyou ^{[2
]}

Lewis, Frank L. ^{[3
]}

机构：

[1] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun 113001, Peoples R China

[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China

[3] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76118 USA

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS | 2023年 / 70卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Uncertainty; Delays; Protocols; Sliding mode control; Multi-agent systems; Robustness; Reinforcement learning; distributed consensus control; sliding mode control; reinforcement learning; CONTINUOUS-TIME SYSTEMS; TRACKING CONTROL; GRAPHICAL GAMES; DESIGN; DISTURBANCES;

D O I：

10.1109/TCSI.2022.3206102

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper investigates distributed control protocols design for uncertain nonlinear multi-agent systems with the goal of achieving the optimal consensus. The critical challenges encountered when designing the optimal distributed control protocols are mainly caused by the internal coupling of agents, uncertainty and nonlinear dynamics. Communication delay among agents makes overcoming these challenges even more difficult. To this end, a novel sliding mode control design method is developed based on the sliding mode control principle and the reinforcement learning technique. The remarkable highlights of the developed method in this paper include the design of distributed sliding mode controllers and the integrated framework of sliding mode control and reinforcement learning, which bring the outcome of successfully learning the composite distributed control protocols for multi-agent systems. Thus, all agents can successfully eliminate the negative impacts brought by system uncertainties and communication delay among agents, and finally follow the leader with a nearly optimal approach. The reachability of sliding mode surfaces and the optimal consensus are rigorously proven and analyzed. Finally, simulation results illustrate the effectiveness of the developed method.

引用

页码：424 / 434

页数：11

共 36 条

[1] Multi-agent discrete-time graphical games and reinforcement learning solutions [J].

Abouheaf, Mohammed I. ;

Lewis, Frank L. ;

Vamvoudakis, Kyriakos G. ;

Haesaert, Sofie ;

Babuska, Robert .

AUTOMATICA, 2014, 50 (12) :3038-3053

[2] A Combined Reinforcement Learning and Sliding Mode Control Scheme for Grid Integration of a PV System [J].

Bag, Aurobinda ;

Subudhi, Bidyadhar ;

Ray, Pravat Kumar .

CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2019, 5 (04) :498-506

[3] Fuzzy adaptive dynamic programming-based optimal leader-following consensus for heterogeneous nonlinear multi-agent systems [J].

Cai, Yuliang ;

Zhang, Huaguang ;

Zhang, Kun ;

Liu, Chong .

NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13) :8763-8781

[4] Adaptive Reinforcement Learning Strategy with Sliding Mode Control for Unknown and Disturbed Wheeled Inverted Pendulum [J].

Dao, Phuong Nam ;

Liu, Yen-Chen .

INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2021, 19 (02) :1139-1150

[5] Novel Nonsingular Terminal Sliding Mode Control for Multi-Agent Tracking Systems With Application to Jerk Circuit [J].

Dong, Lijing ;

Yu, Deyin ;

Nguang, Sing Kiong .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (08) :1429-1433

[6] Adaptive Actor-Critic Design-Based Integral Sliding-Mode Control for Partially Unknown Nonlinear Systems With Input Disturbances [J].

Fan, Quan-Yong ;

Yang, Guang-Hong .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (01) :165-177

[7] Fault-Tolerant Consensus Control for Multiagent Systems: An Encryption-Decryption Scheme [J].

Gao, Chen ;

Wang, Zidong ;

He, Xiao ;

Dong, Hongli .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (05) :2560-2567

[8] Consensus Control of Linear Multiagent Systems Under Actuator Imperfection: When Saturation Meets Fault [J].

Gao, Chen ;

Wang, Zidong ;

He, Xiao ;

Han, Qing-Long .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (04) :2651-2663

[9] On Consensus of Second-Order Multiagent Systems With Actuator Saturations: A Generalized-Nyquist-Criterion-Based Approach [J].

Gao, Chen ;

Wang, Zidong ;

He, Xiao ;

Han, Qing-Long .

IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) :9048-9058

[10] Robust Finite-Time Consensus Tracking Algorithm for Multirobot Systems [J].

Khoo, Suiyang ;

Xie, Lihua ;

Man, Zhihong .

IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2009, 14 (02) :219-228

← 1 2 3 4 →