Offline Reinforcement Learning for Quadrotor Control: Overcoming the Ground Effect

被引：0

作者：

Sacchetto, Luca ^{[1
]}

Korte, Mathias ^{[1
]}

Gronauer, Sven ^{[1
]}

Diepold, Klaus ^{[1
]}

机构：

[1] Tech Univ Munich, Sch Computat Informat & Technol, Arcisstr 21, D-80333 Munich, Germany

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

关键词：

D O I：

10.1109/IROS55552.2023.10341599

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Applying Reinforcement Learning to solve real-world optimization problems presents significant challenges because of the large amount of data normally required. A popular solution is to train the algorithms in a simulation and transfer the weights to the real system. However, sim-to-real approaches are prone to fail when the Reality Gap is too big, e.g. in robotic systems with complex and non-linear dynamics. In this work, we propose the use of Offline Reinforcement Learning as a viable alternative to sim-to-real policy transfer to address such instances. On the example of a small quadrotor, we show that the ground effect causes problems in an otherwise functioning zero-shot sim-to-real framework. Our sim-to-real experiments show that, even with the explicit modelling of the ground effect and the employing of popular transfer techniques, the trained policies fail to capture the physical nuances necessary to perform a real-world take-off maneuver. Contrariwise, we show that state-of-the-art Offline Reinforcement Learning algorithms represent a feasible, reliable and sample efficient alternative in this use case.

引用

页码：7539 / 7544

页数：6

共 26 条

[1] Cheeseman I.C., 1955, Aeronautical Research Council RM No, V3021
[2] Conyers SA, 2018, IEEE INT CONF ROBOT, P1244
[3] Dulac-Arnold G., 2019, CHALLENGES REAL WORL
[4] Furrer F, 2016, STUD COMPUT INTELL, V625, P595, DOI 10.1007/978-3-319-26054-9_23
[5] Golemo F., 2018, Conference on Robot Learning, P817
[6] Using Simulation Optimization to Improve Zero-shot Policy Transfer of Quadrotors
Gronauer, Sven
Kissel, Matthias
Sacchetto, Luca
Korte, Mathias
Diepold, Klaus
[J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10170 - 10176
[7] Energy-Efficient Online Path Planning of Multiple Drones Using Reinforcement Learning
Hong, Dooyoung
Lee, Seonhoon
Cho, Young Hoo
Baek, Donkyu
Kim, Jaemin
Chang, Naehyuck
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (10) : 9725 - 9740
[8] Kostrikov I., 2022, INT C LEARN REPR
[9] Kumar A., 2020, P INT C ADV NEUR INF, V33, P1179
[10] Kumar A, 2019, ADV NEUR IN, V32

← 1 2 3 →