Optimized Leader-Follower Consensus Control of Multi-QUAV Attitude System Using Reinforcement Learning and Backstepping

被引：0

作者：

Wen, Guoxing ^{[1
]}

Song, Yanfen ^{[2
,3
]}

Li, Zijun ^{[2
,3
]}

Li, Bin ^{[2
,3
]}

机构：

[1] Shandong Univ Aeronaut, Coll Sci, Binzhou 256600, Peoples R China

[2] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan 250103, Peoples R China

[3] Qilu Univ Technol, Shandong Acad Sci, Shandong Artificial Intelligence Inst, Jinan 250353, Peoples R China

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2025年 / 9卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Attitude control; Consensus control; Backstepping; Artificial neural networks; Mathematical models; Vehicle dynamics; Reinforcement learning; Optimal control; Nonlinear dynamical systems; Differential equations; Unmanned aerial vehicle; optimal consensus control; quadrotor attitude; reinforcement learning; backstepping control; TRACKING CONTROL; CONTROL DESIGN; QUADROTOR; FEEDBACK;

D O I：

10.1109/TETCI.2025.3537943

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work is to explore the optimized leader-follower attitude consensus scheme for the multi-quadrotor unmanned aerial vehicle (QUAV) system. Since the QUAV attitude dynamic is modeled by a second-order nonlinear differential equation, the optimized backstepping (OB) technique can be competent for this control design. To derive the optimized leader-follower attitude consensus control, the critic-actor reinforcement learning (RL) is performed in the final backstepping step. Different with the attitude control of single QUAV, the case of multi-QUAV is composed of multiple intercommunicated QUAV attitude individuals, so its control design is more complex and thorny. Moreover, the traditional RL optimizing controls deduce the critic or actor updating law from the negative gradient of approximated Hamilton-Jacobi-Bellman (HJB) equation' square, thus it leads to these algorithms very complexity. Hence the traditional optimizing control methods are implemented to multi-QUAV attitude system difficultly. However, since this optimized scheme deduces the RL training laws from a simple positive function of equivalent with HJB equation, it can obviously simplify algorithm for the smooth application in the multi-QUAV attitude system. Finally, theory and simulation certify the feasibility of this optimized consensus control.

引用

页码：1469 / 1479

页数：11

共 39 条

[1] The Employment of Unmanned Aerial Vehicles for Analyzing and Mitigating Disaster Risks in Industrial Sites
Aiello, Giuseppe
Hopps, Fabrizio
Santisi, Domenico
Venticinque, Mario
[J]. IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2020, 67 (03) : 519 - 530
[2] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
Bhasin, S.
Kamalapurkar, R.
Johnson, M.
Vamvoudakis, K. G.
Lewis, F. L.
Dixon, W. E.
[J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
[3] Bolandi H., 2013, Intell. Control Automat., V04, P342, DOI [10.4236/ica.2013.43040, DOI 10.4236/ICA.2013.43039, 10.4236/ica.2013.43039]
[4] Caprari G, 2010, INT C APPL ROBOT POW
[5] Coaxial Helicopter Attitude Control System Design by Advanced Model Predictive Control under Disturbance
Chen, Zhi
Lin, Xiangyu
Jiang, Wanyue
[J]. AEROSPACE, 2024, 11 (06)
[6] Fixed-Time Prescribed Performance Adaptive Trajectory Tracking Control for a QUAV
Cui, Guozeng
Yang, Wei
Yu, Jinpeng
Li, Ze
Tao, Chongben
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (02) : 494 - 498
[7] Nonlinear control of active magnetic bearings: A backstepping approach
deQueiroz, MS
Dawson, DM
[J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 1996, 4 (05) : 545 - 552
[8] Event-triggered-based adaptive dynamic programming for distributed formation control of multi-UAV
Dou, Liqian
Cai, Siyuan
Zhang, Xiuyun
Su, Xiaotong
Zhang, Ruilong
[J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2022, 359 (08): : 3671 - 3691
[9] Direct adaptive NN control of a class of nonlinear systems
Ge, SS
Wang, C
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (01): : 214 - 221
[10] Robust tracking control for quadrotor with unknown nonlinear dynamics using adaptive neural network based fractional-order backstepping control
Guettal, Lemya
Chelihi, Abdelghani
Ajgou, Riadh
Touba, Mostefa Mohamed
[J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2022, 359 (14): : 7337 - 7364

← 1 2 3 4 →