Offline Reinforcement Learning for Quadrotor Control: Overcoming the Ground Effect

被引:0
作者
Sacchetto, Luca [1 ]
Korte, Mathias [1 ]
Gronauer, Sven [1 ]
Diepold, Klaus [1 ]
机构
[1] Tech Univ Munich, Sch Computat Informat & Technol, Arcisstr 21, D-80333 Munich, Germany
来源
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年
关键词
D O I
10.1109/IROS55552.2023.10341599
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applying Reinforcement Learning to solve real-world optimization problems presents significant challenges because of the large amount of data normally required. A popular solution is to train the algorithms in a simulation and transfer the weights to the real system. However, sim-to-real approaches are prone to fail when the Reality Gap is too big, e.g. in robotic systems with complex and non-linear dynamics. In this work, we propose the use of Offline Reinforcement Learning as a viable alternative to sim-to-real policy transfer to address such instances. On the example of a small quadrotor, we show that the ground effect causes problems in an otherwise functioning zero-shot sim-to-real framework. Our sim-to-real experiments show that, even with the explicit modelling of the ground effect and the employing of popular transfer techniques, the trained policies fail to capture the physical nuances necessary to perform a real-world take-off maneuver. Contrariwise, we show that state-of-the-art Offline Reinforcement Learning algorithms represent a feasible, reliable and sample efficient alternative in this use case.
引用
收藏
页码:7539 / 7544
页数:6
相关论文
共 26 条
  • [1] Cheeseman I.C., 1955, Aeronautical Research Council RM No, V3021
  • [2] Conyers SA, 2018, IEEE INT CONF ROBOT, P1244
  • [3] Dulac-Arnold G., 2019, CHALLENGES REAL WORL
  • [4] Furrer F, 2016, STUD COMPUT INTELL, V625, P595, DOI 10.1007/978-3-319-26054-9_23
  • [5] Golemo F., 2018, Conference on Robot Learning, P817
  • [6] Using Simulation Optimization to Improve Zero-shot Policy Transfer of Quadrotors
    Gronauer, Sven
    Kissel, Matthias
    Sacchetto, Luca
    Korte, Mathias
    Diepold, Klaus
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10170 - 10176
  • [7] Energy-Efficient Online Path Planning of Multiple Drones Using Reinforcement Learning
    Hong, Dooyoung
    Lee, Seonhoon
    Cho, Young Hoo
    Baek, Donkyu
    Kim, Jaemin
    Chang, Naehyuck
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (10) : 9725 - 9740
  • [8] Kostrikov I., 2022, INT C LEARN REPR
  • [9] Kumar A., 2020, P INT C ADV NEUR INF, V33, P1179
  • [10] Kumar A, 2019, ADV NEUR IN, V32