Optimized Leader-Follower Consensus Control of Multi-QUAV Attitude System Using Reinforcement Learning and Backstepping

被引:0
作者
Wen, Guoxing [1 ]
Song, Yanfen [2 ,3 ]
Li, Zijun [2 ,3 ]
Li, Bin [2 ,3 ]
机构
[1] Shandong Univ Aeronaut, Coll Sci, Binzhou 256600, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan 250103, Peoples R China
[3] Qilu Univ Technol, Shandong Acad Sci, Shandong Artificial Intelligence Inst, Jinan 250353, Peoples R China
来源
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2025年 / 9卷 / 02期
基金
中国国家自然科学基金;
关键词
Attitude control; Consensus control; Backstepping; Artificial neural networks; Mathematical models; Vehicle dynamics; Reinforcement learning; Optimal control; Nonlinear dynamical systems; Differential equations; Unmanned aerial vehicle; optimal consensus control; quadrotor attitude; reinforcement learning; backstepping control; TRACKING CONTROL; CONTROL DESIGN; QUADROTOR; FEEDBACK;
D O I
10.1109/TETCI.2025.3537943
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work is to explore the optimized leader-follower attitude consensus scheme for the multi-quadrotor unmanned aerial vehicle (QUAV) system. Since the QUAV attitude dynamic is modeled by a second-order nonlinear differential equation, the optimized backstepping (OB) technique can be competent for this control design. To derive the optimized leader-follower attitude consensus control, the critic-actor reinforcement learning (RL) is performed in the final backstepping step. Different with the attitude control of single QUAV, the case of multi-QUAV is composed of multiple intercommunicated QUAV attitude individuals, so its control design is more complex and thorny. Moreover, the traditional RL optimizing controls deduce the critic or actor updating law from the negative gradient of approximated Hamilton-Jacobi-Bellman (HJB) equation' square, thus it leads to these algorithms very complexity. Hence the traditional optimizing control methods are implemented to multi-QUAV attitude system difficultly. However, since this optimized scheme deduces the RL training laws from a simple positive function of equivalent with HJB equation, it can obviously simplify algorithm for the smooth application in the multi-QUAV attitude system. Finally, theory and simulation certify the feasibility of this optimized consensus control.
引用
收藏
页码:1469 / 1479
页数:11
相关论文
共 39 条
  • [1] The Employment of Unmanned Aerial Vehicles for Analyzing and Mitigating Disaster Risks in Industrial Sites
    Aiello, Giuseppe
    Hopps, Fabrizio
    Santisi, Domenico
    Venticinque, Mario
    [J]. IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2020, 67 (03) : 519 - 530
  • [2] A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
    Bhasin, S.
    Kamalapurkar, R.
    Johnson, M.
    Vamvoudakis, K. G.
    Lewis, F. L.
    Dixon, W. E.
    [J]. AUTOMATICA, 2013, 49 (01) : 82 - 92
  • [3] Bolandi H., 2013, Intell. Control Automat., V04, P342, DOI [10.4236/ica.2013.43040, DOI 10.4236/ICA.2013.43039, 10.4236/ica.2013.43039]
  • [4] Caprari G, 2010, INT C APPL ROBOT POW
  • [5] Coaxial Helicopter Attitude Control System Design by Advanced Model Predictive Control under Disturbance
    Chen, Zhi
    Lin, Xiangyu
    Jiang, Wanyue
    [J]. AEROSPACE, 2024, 11 (06)
  • [6] Fixed-Time Prescribed Performance Adaptive Trajectory Tracking Control for a QUAV
    Cui, Guozeng
    Yang, Wei
    Yu, Jinpeng
    Li, Ze
    Tao, Chongben
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (02) : 494 - 498
  • [7] Nonlinear control of active magnetic bearings: A backstepping approach
    deQueiroz, MS
    Dawson, DM
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 1996, 4 (05) : 545 - 552
  • [8] Event-triggered-based adaptive dynamic programming for distributed formation control of multi-UAV
    Dou, Liqian
    Cai, Siyuan
    Zhang, Xiuyun
    Su, Xiaotong
    Zhang, Ruilong
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2022, 359 (08): : 3671 - 3691
  • [9] Direct adaptive NN control of a class of nonlinear systems
    Ge, SS
    Wang, C
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (01): : 214 - 221
  • [10] Robust tracking control for quadrotor with unknown nonlinear dynamics using adaptive neural network based fractional-order backstepping control
    Guettal, Lemya
    Chelihi, Abdelghani
    Ajgou, Riadh
    Touba, Mostefa Mohamed
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2022, 359 (14): : 7337 - 7364