A meta-reinforcement learning method for adaptive payload transportation with variations

被引:0
作者
Chen, Jingyu [1 ]
Ma, Ruidong [2 ]
Xu, Meng [3 ]
Candan, Fethi [2 ]
Mihaylova, Lyudmila [2 ]
Oyekan, John [4 ]
机构
[1] Chinese Acad Sci, Inst Software, Beijing, Peoples R China
[2] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield, England
[3] Univ Int Business & Econ, Sch Informat Technol & Management, Beijing, Peoples R China
[4] Univ York, Dept Comp Sci, York, England
基金
英国工程与自然科学研究理事会;
关键词
Reinforcement learning; Meta-learning; Cooperative transportation; Trajectory tracking; Path planning; LEVEL CONTROL; QUADROTOR;
D O I
10.1016/j.neucom.2025.130032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The safe transport of cable-suspended payloads by a group of Unmanned Aerial Vehicles (UAVs) depends on their capacity to effectively respond to fluctuations in the dynamics caused by external variations, such as wind gusts. For group transportation with obstacles, internal variations, such as changes in formation, can also alter the space occupancy of the system related to collision detection. However, traditional adaptive learning methods are challenging to adapt to these two variations. In this paper, we present a learning-based method for collision-free dual-UAV-payload transportation in the presence of varied wind force and formation change. It consists of an adaptive trajectory tracking controller based on meta-model-based reinforcement learning with online adaptation and a novel correction policy, and a path planner that can sample collision-free goal states of the system for the controller based on the meta-collision predictor. The simulation results demonstrate that the proposed trajectory tracking controller outperforms state-of-the-art model-free, model-based, and variational inference methods in terms of payload tracking error reduction and robustness when dealing with the variations mentioned above. Specifically, the proposed controller reduces the average payload tracking error to less than 0.1 metres in most tasks without obstacles. Furthermore, by following the adapted paths generated by the path planner, the trajectory tracking controller can effectively track the payload while ensuring collision-free safety of the dual-UAV-payload system during navigation among obstacles. The success rate of the proposed method is more than 80% in all scenarios with obstacles. Our project website can be seen at https://sites.google.com/view/meta-payload-fly/ and the source code is available at https://github.com/wawachen/Meta-load-fly.
引用
收藏
页数:20
相关论文
共 58 条
  • [21] Kahn G, 2017, Arxiv, DOI arXiv:1702.01182
  • [22] Kamthe S, 2018, PR MACH LEARN RES, V84
  • [23] Fast Online Adaptation in Robotics through Meta-Learning Embeddings of Simulated Priors
    Kaushik, Rituraj
    Anne, Timothee
    Mouret, Jean-Baptiste
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5269 - 5276
  • [24] Probabilistic roadmaps for path planning in high-dimensional configuration spaces
    Kavraki, LE
    Svestka, P
    Latombe, JC
    Overmars, MH
    [J]. IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1996, 12 (04): : 566 - 580
  • [25] Kim H., 2003, Adv. Neural Inf. Process. Syst., V16
  • [26] Lambert N, 2021, Arxiv, DOI arXiv:2002.04523
  • [27] Low-Level Control of a Quadrotor With Deep Model-Based Reinforcement Learning
    Lambert, Nathan O.
    Drewe, Daniel S.
    Yaconelli, Joseph
    Levine, Sergey
    Calandra, Roberto
    Pister, Kristofer S. J.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4224 - 4230
  • [28] Lavalle S., 1998, Research Report 9811
  • [29] Deep learning
    LeCun, Yann
    Bengio, Yoshua
    Hinton, Geoffrey
    [J]. NATURE, 2015, 521 (7553) : 436 - 444
  • [30] Lee TY, 2011, Arxiv, DOI arXiv:1003.2005