Vision-Based Autonomous Car Racing Using Deep Imitative Reinforcement Learning

被引：52

作者：

Cai, Peide ^{[1
]}

Wang, Hengli ^{[1
]}

Huang, Huaiyang ^{[1
]}

Liu, Yuxuan ^{[1
]}

Liu, Ming ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 04期

关键词：

Reinforcement learning; imitation learning; model learning for control; autonomous racing; uncertainty awareness;

D O I：

10.1109/LRA.2021.3097345

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Autonomous car racing is a challenging task in the robotic control area. Traditional modular methods require accurate mapping, localization and planning, which makes them computationally inefficient and sensitive to environmental changes. Recently, deep-learning-based end-to-end systems have shown promising results for autonomous driving/racing. However, they are commonly implemented by supervised imitation learning (IL), which suffers from the distribution mismatch problem, or by reinforcement learning (RL), which requires a huge amount of risky interaction data. In this work, we present a general deep imitative reinforcement learning approach (DIRL), which successfully achieves agile autonomous racing using visual inputs. The driving knowledge is acquired from both IL and model-based RL, where the agent can learn from human teachers as well as perform self-improvement by safely interacting with an offline world model. We validate our algorithm both in a high-fidelity driving simulation and on a real-world 1/20-scale RC-car with limited onboard computation. The evaluation results demonstrate that our method outperforms previous IL and RL methods in terms of sample efficiency and task performance. Demonstration videos are available at https://caipeide.github.io/autorace-dirl/.

引用

页码：7262 / 7269

页数：8

共 29 条

[1]

Amini A., 2020, P ADV NEUR INF PROC, V33, P14927

[2]

[Anonymous], 2011, Thinking fast and thinking slow

[3]

[Anonymous], 2016, P NIPS

[4]

Baheri A, 2020, P AMER CONTR CONF, P2520, DOI [10.23919/ACC45564.2020.9147510, 10.23919/acc45564.2020.9147510]

[5]

Brunner M, 2017, IEEE DECIS CONTR P, DOI 10.1109/CDC.2017.8264027

[6] VTGNet: A Vision-Based Trajectory Generation Network for Autonomous Vehicles in Urban Environments [J].

Cai, Peide ;

Sun, Yuxiang ;

Wang, Hengli ;

Liu, Ming .

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2021, 6 (03) :419-429

[7] Probabilistic End-to-End Vehicle Navigation in Complex Dynamic Environments With Multimodal Sensor Fusion [J].

Cai, Peide ;

Wang, Sukai ;

Sun, Yuxiang ;

Liu, Ming .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03) :4218-4224

[8] High-Speed Autonomous Drifting With Deep Reinforcement Learning [J].

Cai, Peide ;

Mei, Xiaodong ;

Tai, Lei ;

Sun, Yuxiang ;

Liu, Ming .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1247-1254

[9]

Chua K, 2018, ADV NEUR IN, V31

[10]

Codevilla F, 2018, IEEE INT CONF ROBOT, P4693

← 1 2 3 →