Reinforcement learning and model predictive control for robust embedded quadrotor guidance and control

被引：0

作者：

Colin Greatwood

Arthur G. Richards

机构：

[1] University of Bristol,Department of Aerospace Engineering

[2] Bristol Robotics Laboratory,undefined

来源：

Autonomous Robots | 2019年 / 43卷

关键词：

Model predictive control; Reinforcement learning; Exploration; Micro air vehicle;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A new method for enabling a quadrotor micro air vehicle (MAV) to navigate unknown environments using reinforcement learning (RL) and model predictive control (MPC) is developed. An efficient implementation of MPC provides vehicle control and obstacle avoidance. RL is used to guide the MAV through complex environments where dead-end corridors may be encountered and backtracking is necessary. All of the presented algorithms were deployed on embedded hardware using automatic code generation from Simulink. Results are given for flight tests, demonstrating that the algorithms perform well with modest computing requirements and robust navigation.

引用

页码：1681 / 1693

页数：12

共 49 条

[1] Aswani A(2013)Provably safe and robust learning-based model predictive control Automatica 49 1216-1226
[2] Gonzalez H(1986)A robust layered control system for a mobile robot IEEE Journal of Robotics and Automation 2 14-23
[3] Sastry SS(2010)Direct method based control system for an autonomous quadrotor Journal of Intelligent & Robotic Systems 60 285-316
[4] Tomlin C(2014)Predictive control using an fpga with application to aircraft control IEEE Transactions on Control Systems Technology 22 1006-1017
[5] Brooks R(2008)Real-time indoor autonomous vehicle test environment Control Systems, IEEE 28 51-64
[6] Cowling I(2008)MPC for tracking piecewise constant references for constrained linear systems Automatica 44 2382-2387
[7] Yakimenko O(1997)Globally consistent range scan alignment for environment mapping Autonomous Robots 4 333-349
[8] Whidborne J(2009)Linear offset-free model predictive control Automatica 45 2214-2222
[9] Cooke A(1992)Integration of representation into goal-driven behavior-based robots IEEE Transactions on Robotics and Automation 8 304-312
[10] Hartley EN(2010)The grasp multiple micro-uav testbed Robotics & Automation Magazine, IEEE 17 56-65

← 1 2 3 4 5 →