A meta-reinforcement learning method for adaptive payload transportation with variations

被引：0

作者：

Chen, Jingyu ^{[1
]}

Ma, Ruidong ^{[2
]}

Xu, Meng ^{[3
]}

Candan, Fethi ^{[2
]}

Mihaylova, Lyudmila ^{[2
]}

Oyekan, John ^{[4
]}

机构：

[1] Chinese Acad Sci, Inst Software, Beijing, Peoples R China

[2] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield, England

[3] Univ Int Business & Econ, Sch Informat Technol & Management, Beijing, Peoples R China

[4] Univ York, Dept Comp Sci, York, England

来源：

NEUROCOMPUTING | 2025年 / 638卷

基金：

英国工程与自然科学研究理事会;

关键词：

Reinforcement learning; Meta-learning; Cooperative transportation; Trajectory tracking; Path planning; LEVEL CONTROL; QUADROTOR;

D O I：

10.1016/j.neucom.2025.130032

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The safe transport of cable-suspended payloads by a group of Unmanned Aerial Vehicles (UAVs) depends on their capacity to effectively respond to fluctuations in the dynamics caused by external variations, such as wind gusts. For group transportation with obstacles, internal variations, such as changes in formation, can also alter the space occupancy of the system related to collision detection. However, traditional adaptive learning methods are challenging to adapt to these two variations. In this paper, we present a learning-based method for collision-free dual-UAV-payload transportation in the presence of varied wind force and formation change. It consists of an adaptive trajectory tracking controller based on meta-model-based reinforcement learning with online adaptation and a novel correction policy, and a path planner that can sample collision-free goal states of the system for the controller based on the meta-collision predictor. The simulation results demonstrate that the proposed trajectory tracking controller outperforms state-of-the-art model-free, model-based, and variational inference methods in terms of payload tracking error reduction and robustness when dealing with the variations mentioned above. Specifically, the proposed controller reduces the average payload tracking error to less than 0.1 metres in most tasks without obstacles. Furthermore, by following the adapted paths generated by the path planner, the trajectory tracking controller can effectively track the payload while ensuring collision-free safety of the dual-UAV-payload system during navigation among obstacles. The success rate of the proposed method is more than 80% in all scenarios with obstacles. Our project website can be seen at https://sites.google.com/view/meta-payload-fly/ and the source code is available at https://github.com/wawachen/Meta-load-fly.

引用

页数：20

共 58 条

[21] Kahn G, 2017, Arxiv, DOI arXiv:1702.01182
[22] Kamthe S, 2018, PR MACH LEARN RES, V84
[23] Fast Online Adaptation in Robotics through Meta-Learning Embeddings of Simulated Priors
Kaushik, Rituraj
Anne, Timothee
Mouret, Jean-Baptiste
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5269 - 5276
[24] Probabilistic roadmaps for path planning in high-dimensional configuration spaces
Kavraki, LE
Svestka, P
Latombe, JC
Overmars, MH
[J]. IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1996, 12 (04): : 566 - 580
[25] Kim H., 2003, Adv. Neural Inf. Process. Syst., V16
[26] Lambert N, 2021, Arxiv, DOI arXiv:2002.04523
[27] Low-Level Control of a Quadrotor With Deep Model-Based Reinforcement Learning
Lambert, Nathan O.
Drewe, Daniel S.
Yaconelli, Joseph
Levine, Sergey
Calandra, Roberto
Pister, Kristofer S. J.
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4224 - 4230
[28] Lavalle S., 1998, Research Report 9811
[29] Deep learning
LeCun, Yann
Bengio, Yoshua
Hinton, Geoffrey
[J]. NATURE, 2015, 521 (7553) : 436 - 444
[30] Lee TY, 2011, Arxiv, DOI arXiv:1003.2005

← 1 2 3 4 5 6 →