A meta-reinforcement learning method for adaptive payload transportation with variations

被引：0

作者：

Chen, Jingyu ^{[1
]}

Ma, Ruidong ^{[2
]}

Xu, Meng ^{[3
]}

Candan, Fethi ^{[2
]}

Mihaylova, Lyudmila ^{[2
]}

Oyekan, John ^{[4
]}

机构：

[1] Chinese Acad Sci, Inst Software, Beijing, Peoples R China

[2] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield, England

[3] Univ Int Business & Econ, Sch Informat Technol & Management, Beijing, Peoples R China

[4] Univ York, Dept Comp Sci, York, England

来源：

NEUROCOMPUTING | 2025年 / 638卷

基金：

英国工程与自然科学研究理事会;

关键词：

Reinforcement learning; Meta-learning; Cooperative transportation; Trajectory tracking; Path planning; LEVEL CONTROL; QUADROTOR;

D O I：

10.1016/j.neucom.2025.130032

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The safe transport of cable-suspended payloads by a group of Unmanned Aerial Vehicles (UAVs) depends on their capacity to effectively respond to fluctuations in the dynamics caused by external variations, such as wind gusts. For group transportation with obstacles, internal variations, such as changes in formation, can also alter the space occupancy of the system related to collision detection. However, traditional adaptive learning methods are challenging to adapt to these two variations. In this paper, we present a learning-based method for collision-free dual-UAV-payload transportation in the presence of varied wind force and formation change. It consists of an adaptive trajectory tracking controller based on meta-model-based reinforcement learning with online adaptation and a novel correction policy, and a path planner that can sample collision-free goal states of the system for the controller based on the meta-collision predictor. The simulation results demonstrate that the proposed trajectory tracking controller outperforms state-of-the-art model-free, model-based, and variational inference methods in terms of payload tracking error reduction and robustness when dealing with the variations mentioned above. Specifically, the proposed controller reduces the average payload tracking error to less than 0.1 metres in most tasks without obstacles. Furthermore, by following the adapted paths generated by the path planner, the trajectory tracking controller can effectively track the payload while ensuring collision-free safety of the dual-UAV-payload system during navigation among obstacles. The success rate of the proposed method is more than 80% in all scenarios with obstacles. Our project website can be seen at https://sites.google.com/view/meta-payload-fly/ and the source code is available at https://github.com/wawachen/Meta-load-fly.

引用

页数：20

共 58 条

[1] Autonomous Helicopter Aerobatics through Apprenticeship Learning
Abbeel, Pieter
Coates, Adam
Ng, Andrew Y.
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2010, 29 (13) : 1608 - 1639
[2] Bansal S, 2016, IEEE DECIS CONTR P, P4653, DOI 10.1109/CDC.2016.7798978
[3] A Review of Real-Time Implementable Cooperative Aerial Manipulation Systems
Barakou, Stamatina C.
Tzafestas, Costas S.
Valavanis, Kimon P.
[J]. DRONES, 2024, 8 (05)
[4] Model-Based Meta-Reinforcement Learning for Flight With Suspended Payloads
Belkhale, Suneel
Li, Rachel
Kahn, Gregory
McAllister, Rowan
Calandra, Roberto
Levine, Sergey
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 1471 - 1478
[5] Model Predictive Contouring Control for Collision Avoidance in Unstructured Dynamic Environments
Brito, Bruno
Floor, Boaz
Ferranti, Laura
Alonso-Mora, Javier
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4459 - 4466
[6] Fully body visual self-modeling of robot morphologies
Chen, Boyuan
Kwiatkowski, Robert
Vondrick, Carl
Lipson, Hod
[J]. SCIENCE ROBOTICS, 2022, 7 (68)
[7] Chen J., 2023, 2023 IEEE C EV COMP, P1, DOI [10.1109/CEC53210.2023.10254023, DOI 10.1109/CEC53210.2023.10254023]
[8] A deep multi-agent reinforcement learning framework for autonomous aerial navigation to grasping points on loads
Chen, Jingyu
Ma, Ruidong
Oyekan, John
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 167
[9] RL-RRT: Kinodynamic Motion Planning via Learning Reachability Estimators From RL Policies
Chiang, Hao-Tien Lewis
Hsu, Jasmine
Fiser, Marek
Tapia, Lydia
Faust, Aleksandra
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04): : 4298 - 4305
[10] Chua K, 2018, ADV NEUR IN, V31

← 1 2 3 4 5 6 →