A meta-reinforcement learning method for adaptive payload transportation with variations

被引:0
作者
Chen, Jingyu [1 ]
Ma, Ruidong [2 ]
Xu, Meng [3 ]
Candan, Fethi [2 ]
Mihaylova, Lyudmila [2 ]
Oyekan, John [4 ]
机构
[1] Chinese Acad Sci, Inst Software, Beijing, Peoples R China
[2] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield, England
[3] Univ Int Business & Econ, Sch Informat Technol & Management, Beijing, Peoples R China
[4] Univ York, Dept Comp Sci, York, England
基金
英国工程与自然科学研究理事会;
关键词
Reinforcement learning; Meta-learning; Cooperative transportation; Trajectory tracking; Path planning; LEVEL CONTROL; QUADROTOR;
D O I
10.1016/j.neucom.2025.130032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The safe transport of cable-suspended payloads by a group of Unmanned Aerial Vehicles (UAVs) depends on their capacity to effectively respond to fluctuations in the dynamics caused by external variations, such as wind gusts. For group transportation with obstacles, internal variations, such as changes in formation, can also alter the space occupancy of the system related to collision detection. However, traditional adaptive learning methods are challenging to adapt to these two variations. In this paper, we present a learning-based method for collision-free dual-UAV-payload transportation in the presence of varied wind force and formation change. It consists of an adaptive trajectory tracking controller based on meta-model-based reinforcement learning with online adaptation and a novel correction policy, and a path planner that can sample collision-free goal states of the system for the controller based on the meta-collision predictor. The simulation results demonstrate that the proposed trajectory tracking controller outperforms state-of-the-art model-free, model-based, and variational inference methods in terms of payload tracking error reduction and robustness when dealing with the variations mentioned above. Specifically, the proposed controller reduces the average payload tracking error to less than 0.1 metres in most tasks without obstacles. Furthermore, by following the adapted paths generated by the path planner, the trajectory tracking controller can effectively track the payload while ensuring collision-free safety of the dual-UAV-payload system during navigation among obstacles. The success rate of the proposed method is more than 80% in all scenarios with obstacles. Our project website can be seen at https://sites.google.com/view/meta-payload-fly/ and the source code is available at https://github.com/wawachen/Meta-load-fly.
引用
收藏
页数:20
相关论文
共 58 条
  • [1] Autonomous Helicopter Aerobatics through Apprenticeship Learning
    Abbeel, Pieter
    Coates, Adam
    Ng, Andrew Y.
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2010, 29 (13) : 1608 - 1639
  • [2] Bansal S, 2016, IEEE DECIS CONTR P, P4653, DOI 10.1109/CDC.2016.7798978
  • [3] A Review of Real-Time Implementable Cooperative Aerial Manipulation Systems
    Barakou, Stamatina C.
    Tzafestas, Costas S.
    Valavanis, Kimon P.
    [J]. DRONES, 2024, 8 (05)
  • [4] Model-Based Meta-Reinforcement Learning for Flight With Suspended Payloads
    Belkhale, Suneel
    Li, Rachel
    Kahn, Gregory
    McAllister, Rowan
    Calandra, Roberto
    Levine, Sergey
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 1471 - 1478
  • [5] Model Predictive Contouring Control for Collision Avoidance in Unstructured Dynamic Environments
    Brito, Bruno
    Floor, Boaz
    Ferranti, Laura
    Alonso-Mora, Javier
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4459 - 4466
  • [6] Fully body visual self-modeling of robot morphologies
    Chen, Boyuan
    Kwiatkowski, Robert
    Vondrick, Carl
    Lipson, Hod
    [J]. SCIENCE ROBOTICS, 2022, 7 (68)
  • [7] Chen J., 2023, 2023 IEEE C EV COMP, P1, DOI [10.1109/CEC53210.2023.10254023, DOI 10.1109/CEC53210.2023.10254023]
  • [8] A deep multi-agent reinforcement learning framework for autonomous aerial navigation to grasping points on loads
    Chen, Jingyu
    Ma, Ruidong
    Oyekan, John
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2023, 167
  • [9] RL-RRT: Kinodynamic Motion Planning via Learning Reachability Estimators From RL Policies
    Chiang, Hao-Tien Lewis
    Hsu, Jasmine
    Fiser, Marek
    Tapia, Lydia
    Faust, Aleksandra
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04): : 4298 - 4305
  • [10] Chua K, 2018, ADV NEUR IN, V31