A Benchmark Comparison of Learned Control Policies for Agile Quadrotor Flight

被引:38
作者
Kaufmann, Elia [1 ,2 ,3 ]
Bauersfeld, Leonard [1 ,2 ,3 ]
Scaramuzza, Davide [1 ,2 ,3 ]
机构
[1] Univ Zurich, Dept Informat, Robot & Percept Grp, Zurich, Switzerland
[2] Univ Zurich, Dept Neuroinformat, Zurich, Switzerland
[3] Swiss Fed Inst Technol, Zurich, Switzerland
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022 | 2022年
基金
瑞士国家科学基金会; 欧洲研究理事会;
关键词
D O I
10.1109/ICRA46639.2022.9811564
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Quadrotors are highly nonlinear dynamical systems that require carefully tuned controllers to be pushed to their physical limits. Recently, learning-based control policies have been proposed for quadrotors, as they would potentially allow learning direct mappings from high-dimensional raw sensory observations to actions. Due to sample inefficiency, training such learned controllers on the real platform is impractical or even impossible. Training in simulation is attractive but requires to transfer policies between domains, which demands trained policies to be robust to such domain gap. In this work, we make two contributions: (i) we perform the first benchmark comparison of existing learned control policies for agile quadrotor flight and show that training a control policy that commands body-rates and thrust results in more robust sim-to-real transfer compared to a policy that directly specifies individual rotor thrusts, (ii) we demonstrate for the first time that such a control policy trained via deep reinforcement learning can control a quadrotor in real-world experiments at speeds over 45 km/h.
引用
收藏
页码:10504 / 10510
页数:7
相关论文
共 30 条
[1]   Learning dexterous in-hand manipulation [J].
Andrychowicz, Marcin ;
Baker, Bowen ;
Chociej, Maciek ;
Jozefowicz, Rafal ;
McGrew, Bob ;
Pachocki, Jakub ;
Petron, Arthur ;
Plappert, Matthias ;
Powell, Glenn ;
Ray, Alex ;
Schneider, Jonas ;
Sidor, Szymon ;
Tobin, Josh ;
Welinder, Peter ;
Weng, Lilian ;
Zaremba, Wojciech .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) :3-20
[2]  
[Anonymous], 2016, ROBOT OPERATING SYST
[3]  
[Anonymous], P ROB SCI SYST
[4]  
Bauersfeld L, 2021, ROBOT SCI SYS
[5]  
Belkhale S., 2021, IEEE ROBOT AUTOM LET
[6]  
Dosovitskiy A., 2018, C ROB LEARN CORL
[7]  
Faessler Matthias., 2017, IEEE Robotics and Automation Letters
[8]   A Machine Learning Approach to Visual Perception of Forest Trails for Mobile Robots [J].
Giusti, Alessandro ;
Guzzi, Jerome ;
Ciresan, Dan C. ;
He, Fang-Lin ;
Rodriguez, Juan P. ;
Fontana, Flavio ;
Faessler, Matthias ;
Forster, Christian ;
Schmidhuber, Jurgen ;
Di Caro, Gianni ;
Scaramuzza, Davide ;
Gambardella, Luca M. .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2016, 1 (02) :661-667
[9]  
Hwangbo J., 2017, IEEE ROBOT AUTOM LET
[10]  
Kaufmann E, 2020, ROBOTICS: SCIENCE AND SYSTEMS XVI