Optimal Containment Control of a Quadrotor Team With Active Leaders via Reinforcement Learning

被引:3
|
作者
Cheng, Ming [1 ]
Liu, Hao [2 ,3 ]
Gao, Qing [3 ,4 ]
Lu, Jinhu [3 ,4 ]
Xia, Xiaohua [5 ]
机构
[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
[3] Zhongguancun Lab, Beijing 100191, Peoples R China
[4] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[5] Univ Pretoria, Dept Elect Elect & Comp Engn, ZA-0002 Pretoria, South Africa
基金
美国国家科学基金会; 中国国家自然科学基金; 北京市自然科学基金;
关键词
Cooperative control; multiagent system; optimal control; quadrotor; reinforcement learning (RL); MULTIAGENT SYSTEMS; TRACKING CONTROL;
D O I
10.1109/TCYB.2023.3284648
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes an optimal controller for a team of underactuated quadrotors with multiple active leaders in containment control tasks. The quadrotor dynamics are underactuated, nonlinear, uncertain, and subject to external disturbances. The active team leaders have control inputs to enhance the maneuverability of the containment system. The proposed controller consists of a position control law to guarantee the achievement of position containment and an attitude control law to regulate the rotational motion, which are learned via off-policy reinforcement learning using historical data from quadrotor trajectories. The closed-loop system stability can be guaranteed by theoretical analysis. Simulation results of cooperative transportation missions with multiple active leaders demonstrate the effectiveness of the proposed controller.
引用
收藏
页码:4502 / 4512
页数:11
相关论文
共 50 条
  • [41] Model-Free Linear Noncausal Optimal Control of Wave Energy Converters via Reinforcement Learning
    Zhan, Siyuan
    Ringwood, John V.
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024, 32 (06) : 2164 - 2177
  • [42] Learning Stabilization Control of Quadrotor in Near-Ground Setting Using Reinforcement Learning
    Briliauskas, Mantas
    INFORMATION TECHNOLOGY AND CONTROL, 2024, 53 (01): : 237 - 242
  • [43] Optimized Backstepping Tracking Control Using Reinforcement Learning for Quadrotor Unmanned Aerial Vehicle System
    Wen, Guoxing
    Hao, Wei
    Feng, Weiwei
    Gao, Kaizhou
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 5004 - 5015
  • [44] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
    Mingjun Li
    Zhihao Cai
    Jiang Zhao
    Jinyan Wang
    Yingxun Wang
    Neural Computing and Applications, 2022, 34 : 11141 - 11161
  • [45] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
    Li, Mingjun
    Cai, Zhihao
    Zhao, Jiang
    Wang, Jinyan
    Wang, Yingxun
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13) : 11141 - 11161
  • [46] Reinforcement Learning and Optimal Control of PMSM Speed Servo System
    Zhao, Jianguo
    Yang, Chunyu
    Gao, Weinan
    Zhou, Linna
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (08) : 8305 - 8313
  • [47] The Challenges of Reinforcement Learning in Robotics and Optimal Control
    El-Telbany, Mohammed E.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 881 - 890
  • [48] Adaptive Optimal Surrounding Control of Multiple Unmanned Surface Vessels via Actor-Critic Reinforcement Learning
    Lu, Renzhi
    Wang, Xiaotao
    Ding, Yiyu
    Zhang, Hai-Tao
    Zhao, Feng
    Zhu, Lijun
    He, Yong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [49] Adaptive Optimal Tracking Control for Uncertain Unmanned Surface Vessel via Reinforcement Learning
    Chen, Lin
    Wang, Min
    Dai, Shi-Lu
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8398 - 8403
  • [50] Using process data to generate an optimal control policy via apprenticeship and reinforcement learning
    Mowbray, Max
    Smith, Robin
    Del Rio-Chanona, Ehecatl A.
    Zhang, Dongda
    AICHE JOURNAL, 2021, 67 (09)