Two-order cooperative optimization of swarm control based on reinforcement learning

被引：0

作者：

Yu, Dengxiu ^{[1
]}

Qin, Zhenhao ^{[1
]}

Chen, Kang ^{[1
,4
]}

Cheong, Kang Hao ^{[2
]}

Chen, C. L. Philip ^{[3
]}

机构：

[1] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian, Peoples R China

[2] Singapore Univ Technol & Design, Sci Math & Technol Cluster, Singapore, Singapore

[3] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China

[4] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China

来源：

IET CONTROL THEORY AND APPLICATIONS | 2024年 / 18卷 / 01期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

adaptive control; control system analysis; TRACKING CONTROL; CONTROL DESIGN; SYSTEMS; SCHEME;

D O I：

10.1049/cth2.12545

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a study of the cooperative optimal swarm control problem for two-order multi-agent systems with partially unknown nonlinear functions. Unlike traditional approaches that consider a single error, this paper proposes to use multi-order errors in the performance index function to achieve optimal control performance. Additionally, different proportional coefficients are assigned to illustrate the varying influences of each sequence error, and a two-order cooperative (TOC)performance index function is designed. To address the influence of unknown nonlinear functions, a swarm control system based on sliding mode control with an actor-critic network is constructed, which increases the applicability of the proposed method to a variety of dynamic models. Furthermore, to alleviate the computational pressure caused by the multi-order errors in the TOC performance index function, a new reinforcement learning (RL)-based sliding mode swarm controller is designed. The stability of the proposed controller is demonstrated using the Lyapunov function. Finally, the control model and control rate are applied to a quadrotor unmanned aerial vehicle system, and simulation results demonstrate that the multi-agent systems can effectively achieve swarm control.Impact Statement: This paper proposes a reinforcement learning-based sliding mode control strategy for the cooperative optimal swarm control problem, where the nonlinear functions of two-order multi-agent systems are only partially known. In addition, we also propose a cooperative performance index function, which takes into account multi-order errors for optimizing the performance. This contribution is significant for research in sliding mode control strategies and error co-optimization. In this paper, we propose a reinforcement learning based sliding mode control strategy for the cooperative optimal swarm control problem where the nonlinear functions of two-order multi-agent systems are partially unknown. In addition, we also propose a two-order cooperative performance index function, the performance function can be optimized according to the multi-order errors at the same time to achieve the purpose of cooperative optimization. This article is very helpful for the research of sliding mode control strategy and error co-optimization.image

引用

页码：125 / 136

页数：12

共 46 条

[1] NN Reinforcement Learning Adaptive Control for a Class of Nonstrict-Feedback Discrete-Time Systems [J].

Bai, Weiwei ;

Li, Tieshan ;

Tong, Shaocheng .

IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (11) :4573-4584

[2] Adaptive Reinforcement Learning Neural Network Control for Uncertain Nonlinear System With Input Saturation [J].

Bai, Weiwei ;

Zhou, Qi ;

Li, Tieshan ;

Li, Hongyi .

IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (08) :3433-3443

[3] Swarm Formation Control Utilizing Elliptical Surfaces and Limiting Functions [J].

Barnes, Laura E. ;

Fields, Mary Anne ;

Valavanis, Kimon P. .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2009, 39 (06) :1434-1445

[4] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems [J].

Bhasin, S. ;

Kamalapurkar, R. ;

Johnson, M. ;

Vamvoudakis, K. G. ;

Lewis, F. L. ;

Dixon, W. E. .

AUTOMATICA, 2013, 49 (01) :82-92

[5]

Chen Y., 2021, IEEE T ARTIF INTELL, V2, P352, DOI DOI 10.1109/TAI.2021.3093499

[6]

Dengxiu Yu, 2021, IEEE T SYST MAN CY-S, DOI [10.1109/TSMC.2021.3102587, DOI 10.1109/TSMC.2021.3102587]

[7] Time-varying formation control for unmanned aerial vehicles with switching interaction topologies [J].

Dong, Xiwang ;

Zhou, Yan ;

Ren, Zhang ;

Zhong, Yisheng .

CONTROL ENGINEERING PRACTICE, 2016, 46 :26-36

[8] Distributed fixed-time consensus for nonlinear heterogeneous multi-agent systems [J].

Du, Haibo ;

Wen, Guanghui ;

Wu, Di ;

Cheng, Yingying ;

Lu, Jinhu .

AUTOMATICA, 2020, 113

[9] Finite-Time Synchronization of a Class of Second-Order Nonlinear Multi-Agent Systems Using Output Feedback Control [J].

Du, Haibo ;

He, Yigang ;

Cheng, Yingying .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2014, 61 (06) :1778-1788

[10] Robust Identification of Nonlinear Errors-in-Variables Systems With Parameter Uncertainties Using Variational Bayesian Approach [J].

Guo, Fan ;

Kodamana, Hariprasad ;

Zhao, Yujia ;

Huang, Biao ;

Ding, Yongsheng .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (06) :3047-3057

← 1 2 3 4 5 →