共 36 条
Consensus of Nonlinear Multiagent Systems With Uncertainties Using Reinforcement Learning Based Sliding Mode Control
被引:39
作者:
Li, Jinna
[1
]
Yuan, Lin
[1
]
Chai, Tianyou
[2
]
Lewis, Frank L.
[3
]
机构:
[1] Liaoning Petrochem Univ, Sch Informat & Control Engn, Fushun 113001, Peoples R China
[2] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[3] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76118 USA
基金:
中国国家自然科学基金;
关键词:
Uncertainty;
Delays;
Protocols;
Sliding mode control;
Multi-agent systems;
Robustness;
Reinforcement learning;
distributed consensus control;
sliding mode control;
reinforcement learning;
CONTINUOUS-TIME SYSTEMS;
TRACKING CONTROL;
GRAPHICAL GAMES;
DESIGN;
DISTURBANCES;
D O I:
10.1109/TCSI.2022.3206102
中图分类号:
TM [电工技术];
TN [电子技术、通信技术];
学科分类号:
0808 ;
0809 ;
摘要:
This paper investigates distributed control protocols design for uncertain nonlinear multi-agent systems with the goal of achieving the optimal consensus. The critical challenges encountered when designing the optimal distributed control protocols are mainly caused by the internal coupling of agents, uncertainty and nonlinear dynamics. Communication delay among agents makes overcoming these challenges even more difficult. To this end, a novel sliding mode control design method is developed based on the sliding mode control principle and the reinforcement learning technique. The remarkable highlights of the developed method in this paper include the design of distributed sliding mode controllers and the integrated framework of sliding mode control and reinforcement learning, which bring the outcome of successfully learning the composite distributed control protocols for multi-agent systems. Thus, all agents can successfully eliminate the negative impacts brought by system uncertainties and communication delay among agents, and finally follow the leader with a nearly optimal approach. The reachability of sliding mode surfaces and the optimal consensus are rigorously proven and analyzed. Finally, simulation results illustrate the effectiveness of the developed method.
引用
收藏
页码:424 / 434
页数:11
相关论文