ReinforcementDriving: Exploring Trajectories and Navigation for Autonomous Vehicles

被引:24
作者
Liu, Meng [1 ]
Zhao, Fei [2 ]
Niu, Jianwei [3 ]
Liu, Yu [1 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Transportat Sci & Engn, Beijing 100191, Peoples R China
[3] Beihang Univ, Beijing Adv Innovat Ctr Big Data & Brain Comp, State Key Lab Virtual Real Technol & Syst, Sch Comp Sci & Engn,Hangzhou Innovat Res Inst, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Autonomous vehicles; Navigation; Roads; Trajectory; Robots; Real-time systems; DDPG; lane keeping; navigation; trajectory exploration; autonomous driving; ARCHITECTURE; GAME; GO;
D O I
10.1109/TITS.2019.2960872
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Autonomous vehicles need to solve the road keeping problem and the existing solutions based on reinforcement learning are mainly implemented in the simulators. The key of transferring the well-trained models to the real world is bridging the gaps between the simulator scenarios and the real scenarios. In this paper, we propose a method called ReinforcementDriving which explores navigation skills and trajectories from simulator for full-sized road keeping. Based on the real scenario, a driving simulator is firstly established to train an intelligent driving agent. The well-trained ReinforcementDriving agent is evaluated in a real-world scenario. We compare our work with human driving, optimal control-based tracking methods and other reinforcement learning-based lane following methods. The results demonstrate that the ReinforcementDriving system can effectively achieve lane keeping in a realistic scenario with satisfactory running time and lateral accuracy.
引用
收藏
页码:808 / 820
页数:13
相关论文
共 38 条
[1]  
[Anonymous], 2018, ARXIV180700412
[2]  
[Anonymous], 2016, SCI ROBOT
[3]  
Bansal M., 2018, arXiv preprint arXiv:1812.03079
[4]   Natural actor-critic algorithms [J].
Bhatnagar, Shalabh ;
Sutton, Richard S. ;
Ghavamzadeh, Mohammad ;
Lee, Mark .
AUTOMATICA, 2009, 45 (11) :2471-2482
[5]   Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning [J].
Bin Peng, Xue ;
Berseth, Glen ;
van de Panne, Michiel .
ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04)
[6]  
Brockman Greg, 2016, ARXIV160601540
[7]  
Gu SX, 2016, PR MACH LEARN RES, V48
[8]   A LEARNING ARCHITECTURE BASED ON REINFORCEMENT LEARNING FOR ADAPTIVE-CONTROL OF THE WALKING MACHINE LAURON [J].
ILG, W ;
BERNS, K .
ROBOTICS AND AUTONOMOUS SYSTEMS, 1995, 15 (04) :321-334
[9]  
Ioffe S., 2015, PMLR, P448, DOI DOI 10.48550/ARXIV.1502.03167
[10]  
Jung A., 2017, Self-driving truck