Vision-Based Autonomous Car Racing Using Deep Imitative Reinforcement Learning

被引:52
作者
Cai, Peide [1 ]
Wang, Hengli [1 ]
Huang, Huaiyang [1 ]
Liu, Yuxuan [1 ]
Liu, Ming [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
Reinforcement learning; imitation learning; model learning for control; autonomous racing; uncertainty awareness;
D O I
10.1109/LRA.2021.3097345
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Autonomous car racing is a challenging task in the robotic control area. Traditional modular methods require accurate mapping, localization and planning, which makes them computationally inefficient and sensitive to environmental changes. Recently, deep-learning-based end-to-end systems have shown promising results for autonomous driving/racing. However, they are commonly implemented by supervised imitation learning (IL), which suffers from the distribution mismatch problem, or by reinforcement learning (RL), which requires a huge amount of risky interaction data. In this work, we present a general deep imitative reinforcement learning approach (DIRL), which successfully achieves agile autonomous racing using visual inputs. The driving knowledge is acquired from both IL and model-based RL, where the agent can learn from human teachers as well as perform self-improvement by safely interacting with an offline world model. We validate our algorithm both in a high-fidelity driving simulation and on a real-world 1/20-scale RC-car with limited onboard computation. The evaluation results demonstrate that our method outperforms previous IL and RL methods in terms of sample efficiency and task performance. Demonstration videos are available at https://caipeide.github.io/autorace-dirl/.
引用
收藏
页码:7262 / 7269
页数:8
相关论文
共 29 条
[1]  
Amini A., 2020, P ADV NEUR INF PROC, V33, P14927
[2]  
[Anonymous], 2011, Thinking fast and thinking slow
[3]  
[Anonymous], 2016, P NIPS
[4]  
Baheri A, 2020, P AMER CONTR CONF, P2520, DOI [10.23919/ACC45564.2020.9147510, 10.23919/acc45564.2020.9147510]
[5]  
Brunner M, 2017, IEEE DECIS CONTR P, DOI 10.1109/CDC.2017.8264027
[6]   VTGNet: A Vision-Based Trajectory Generation Network for Autonomous Vehicles in Urban Environments [J].
Cai, Peide ;
Sun, Yuxiang ;
Wang, Hengli ;
Liu, Ming .
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2021, 6 (03) :419-429
[7]   Probabilistic End-to-End Vehicle Navigation in Complex Dynamic Environments With Multimodal Sensor Fusion [J].
Cai, Peide ;
Wang, Sukai ;
Sun, Yuxiang ;
Liu, Ming .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (03) :4218-4224
[8]   High-Speed Autonomous Drifting With Deep Reinforcement Learning [J].
Cai, Peide ;
Mei, Xiaodong ;
Tai, Lei ;
Sun, Yuxiang ;
Liu, Ming .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1247-1254
[9]  
Chua K, 2018, ADV NEUR IN, V31
[10]  
Codevilla F, 2018, IEEE INT CONF ROBOT, P4693