ReinforcementDriving: Exploring Trajectories and Navigation for Autonomous Vehicles

被引：24

作者：

Liu, Meng ^{[1
]}

Zhao, Fei ^{[2
]}

Niu, Jianwei ^{[3
]}

Liu, Yu ^{[1
]}

机构：

[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China

[2] Beihang Univ, Sch Transportat Sci & Engn, Beijing 100191, Peoples R China

[3] Beihang Univ, Beijing Adv Innovat Ctr Big Data & Brain Comp, State Key Lab Virtual Real Technol & Syst, Sch Comp Sci & Engn,Hangzhou Innovat Res Inst, Beijing 100191, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2021年 / 22卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Autonomous vehicles; Navigation; Roads; Trajectory; Robots; Real-time systems; DDPG; lane keeping; navigation; trajectory exploration; autonomous driving; ARCHITECTURE; GAME; GO;

D O I：

10.1109/TITS.2019.2960872

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Autonomous vehicles need to solve the road keeping problem and the existing solutions based on reinforcement learning are mainly implemented in the simulators. The key of transferring the well-trained models to the real world is bridging the gaps between the simulator scenarios and the real scenarios. In this paper, we propose a method called ReinforcementDriving which explores navigation skills and trajectories from simulator for full-sized road keeping. Based on the real scenario, a driving simulator is firstly established to train an intelligent driving agent. The well-trained ReinforcementDriving agent is evaluated in a real-world scenario. We compare our work with human driving, optimal control-based tracking methods and other reinforcement learning-based lane following methods. The results demonstrate that the ReinforcementDriving system can effectively achieve lane keeping in a realistic scenario with satisfactory running time and lateral accuracy.

引用

页码：808 / 820

页数：13

共 38 条

[1]

[Anonymous], 2018, ARXIV180700412

[2]

[Anonymous], 2016, SCI ROBOT

[3]

Bansal M., 2018, arXiv preprint arXiv:1812.03079

[4] Natural actor-critic algorithms [J].

Bhatnagar, Shalabh ;

Sutton, Richard S. ;

Ghavamzadeh, Mohammad ;

Lee, Mark .

AUTOMATICA, 2009, 45 (11) :2471-2482

[5] Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning [J].

Bin Peng, Xue ;

Berseth, Glen ;

van de Panne, Michiel .

ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04)

[6]

Brockman Greg, 2016, ARXIV160601540

[7]

Gu SX, 2016, PR MACH LEARN RES, V48

[8] A LEARNING ARCHITECTURE BASED ON REINFORCEMENT LEARNING FOR ADAPTIVE-CONTROL OF THE WALKING MACHINE LAURON [J].

ILG, W ;

BERNS, K .

ROBOTICS AND AUTONOMOUS SYSTEMS, 1995, 15 (04) :321-334

[9]

Ioffe S., 2015, PMLR, P448, DOI DOI 10.48550/ARXIV.1502.03167

[10]

Jung A., 2017, Self-driving truck

← 1 2 3 4 →