Optimal Containment Control of a Quadrotor Team With Active Leaders via Reinforcement Learning

被引：3

作者：

Cheng, Ming ^{[1
]}

Liu, Hao ^{[2
,3
]}

Gao, Qing ^{[3
,4
]}

Lu, Jinhu ^{[3
,4
]}

Xia, Xiaohua ^{[5
]}

机构：

[1] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China

[2] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China

[3] Zhongguancun Lab, Beijing 100191, Peoples R China

[4] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[5] Univ Pretoria, Dept Elect Elect & Comp Engn, ZA-0002 Pretoria, South Africa

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2024年 / 54卷 / 08期

基金：

美国国家科学基金会; 中国国家自然科学基金; 北京市自然科学基金;

关键词：

Cooperative control; multiagent system; optimal control; quadrotor; reinforcement learning (RL); MULTIAGENT SYSTEMS; TRACKING CONTROL;

D O I：

10.1109/TCYB.2023.3284648

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article proposes an optimal controller for a team of underactuated quadrotors with multiple active leaders in containment control tasks. The quadrotor dynamics are underactuated, nonlinear, uncertain, and subject to external disturbances. The active team leaders have control inputs to enhance the maneuverability of the containment system. The proposed controller consists of a position control law to guarantee the achievement of position containment and an attitude control law to regulate the rotational motion, which are learned via off-policy reinforcement learning using historical data from quadrotor trajectories. The closed-loop system stability can be guaranteed by theoretical analysis. Simulation results of cooperative transportation missions with multiple active leaders demonstrate the effectiveness of the proposed controller.

引用

页码：4502 / 4512

页数：11

共 50 条

[41] Model-Free Linear Noncausal Optimal Control of Wave Energy Converters via Reinforcement Learning
Zhan, Siyuan
Ringwood, John V.
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024, 32 (06) : 2164 - 2177
[42] Learning Stabilization Control of Quadrotor in Near-Ground Setting Using Reinforcement Learning
Briliauskas, Mantas
INFORMATION TECHNOLOGY AND CONTROL, 2024, 53 (01): : 237 - 242
[43] Optimized Backstepping Tracking Control Using Reinforcement Learning for Quadrotor Unmanned Aerial Vehicle System
Wen, Guoxing
Hao, Wei
Feng, Weiwei
Gao, Kaizhou
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 5004 - 5015
[44] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
Mingjun Li
Zhihao Cai
Jiang Zhao
Jinyan Wang
Yingxun Wang
Neural Computing and Applications, 2022, 34 : 11141 - 11161
[45] Disturbance rejection and high dynamic quadrotor control based on reinforcement learning and supervised learning
Li, Mingjun
Cai, Zhihao
Zhao, Jiang
Wang, Jinyan
Wang, Yingxun
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13) : 11141 - 11161
[46] Reinforcement Learning and Optimal Control of PMSM Speed Servo System
Zhao, Jianguo
Yang, Chunyu
Gao, Weinan
Zhou, Linna
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (08) : 8305 - 8313
[47] The Challenges of Reinforcement Learning in Robotics and Optimal Control
El-Telbany, Mohammed E.
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 881 - 890
[48] Adaptive Optimal Surrounding Control of Multiple Unmanned Surface Vessels via Actor-Critic Reinforcement Learning
Lu, Renzhi
Wang, Xiaotao
Ding, Yiyu
Zhang, Hai-Tao
Zhao, Feng
Zhu, Lijun
He, Yong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[49] Adaptive Optimal Tracking Control for Uncertain Unmanned Surface Vessel via Reinforcement Learning
Chen, Lin
Wang, Min
Dai, Shi-Lu
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8398 - 8403
[50] Using process data to generate an optimal control policy via apprenticeship and reinforcement learning
Mowbray, Max
Smith, Robin
Del Rio-Chanona, Ehecatl A.
Zhang, Dongda
AICHE JOURNAL, 2021, 67 (09)

← 1 2 3 4 5 →