Two-order cooperative optimization of swarm control based on reinforcement learning

被引:0
作者
Yu, Dengxiu [1 ]
Qin, Zhenhao [1 ]
Chen, Kang [1 ,4 ]
Cheong, Kang Hao [2 ]
Chen, C. L. Philip [3 ]
机构
[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian, Peoples R China
[2] Singapore Univ Technol & Design, Sci Math & Technol Cluster, Singapore, Singapore
[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[4] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
adaptive control; control system analysis; TRACKING CONTROL; CONTROL DESIGN; SYSTEMS; SCHEME;
D O I
10.1049/cth2.12545
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a study of the cooperative optimal swarm control problem for two-order multi-agent systems with partially unknown nonlinear functions. Unlike traditional approaches that consider a single error, this paper proposes to use multi-order errors in the performance index function to achieve optimal control performance. Additionally, different proportional coefficients are assigned to illustrate the varying influences of each sequence error, and a two-order cooperative (TOC)performance index function is designed. To address the influence of unknown nonlinear functions, a swarm control system based on sliding mode control with an actor-critic network is constructed, which increases the applicability of the proposed method to a variety of dynamic models. Furthermore, to alleviate the computational pressure caused by the multi-order errors in the TOC performance index function, a new reinforcement learning (RL)-based sliding mode swarm controller is designed. The stability of the proposed controller is demonstrated using the Lyapunov function. Finally, the control model and control rate are applied to a quadrotor unmanned aerial vehicle system, and simulation results demonstrate that the multi-agent systems can effectively achieve swarm control.Impact Statement: This paper proposes a reinforcement learning-based sliding mode control strategy for the cooperative optimal swarm control problem, where the nonlinear functions of two-order multi-agent systems are only partially known. In addition, we also propose a cooperative performance index function, which takes into account multi-order errors for optimizing the performance. This contribution is significant for research in sliding mode control strategies and error co-optimization. In this paper, we propose a reinforcement learning based sliding mode control strategy for the cooperative optimal swarm control problem where the nonlinear functions of two-order multi-agent systems are partially unknown. In addition, we also propose a two-order cooperative performance index function, the performance function can be optimized according to the multi-order errors at the same time to achieve the purpose of cooperative optimization. This article is very helpful for the research of sliding mode control strategy and error co-optimization.image
引用
收藏
页码:125 / 136
页数:12
相关论文
共 46 条
[1]   NN Reinforcement Learning Adaptive Control for a Class of Nonstrict-Feedback Discrete-Time Systems [J].
Bai, Weiwei ;
Li, Tieshan ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) :4573-4584
[2]   Adaptive Reinforcement Learning Neural Network Control for Uncertain Nonlinear System With Input Saturation [J].
Bai, Weiwei ;
Zhou, Qi ;
Li, Tieshan ;
Li, Hongyi .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (08) :3433-3443
[3]   Swarm Formation Control Utilizing Elliptical Surfaces and Limiting Functions [J].
Barnes, Laura E. ;
Fields, Mary Anne ;
Valavanis, Kimon P. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2009, 39 (06) :1434-1445
[4]   A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].
Bhasin, S. ;
Kamalapurkar, R. ;
Johnson, M. ;
Vamvoudakis, K. G. ;
Lewis, F. L. ;
Dixon, W. E. .
AUTOMATICA, 2013, 49 (01) :82-92
[5]  
Chen Y., 2021, IEEE T ARTIF INTELL, V2, P352, DOI DOI 10.1109/TAI.2021.3093499
[6]  
Dengxiu Yu, 2021, IEEE T SYST MAN CY-S, DOI [10.1109/TSMC.2021.3102587, DOI 10.1109/TSMC.2021.3102587]
[7]   Time-varying formation control for unmanned aerial vehicles with switching interaction topologies [J].
Dong, Xiwang ;
Zhou, Yan ;
Ren, Zhang ;
Zhong, Yisheng .
CONTROL ENGINEERING PRACTICE, 2016, 46 :26-36
[8]   Distributed fixed-time consensus for nonlinear heterogeneous multi-agent systems [J].
Du, Haibo ;
Wen, Guanghui ;
Wu, Di ;
Cheng, Yingying ;
Lu, Jinhu .
AUTOMATICA, 2020, 113
[9]   Finite-Time Synchronization of a Class of Second-Order Nonlinear Multi-Agent Systems Using Output Feedback Control [J].
Du, Haibo ;
He, Yigang ;
Cheng, Yingying .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2014, 61 (06) :1778-1788
[10]   Robust Identification of Nonlinear Errors-in-Variables Systems With Parameter Uncertainties Using Variational Bayesian Approach [J].
Guo, Fan ;
Kodamana, Hariprasad ;
Zhao, Yujia ;
Huang, Biao ;
Ding, Yongsheng .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (06) :3047-3057