Highway Exiting Planner for Automated Vehicles Using Reinforcement Learning

被引:48
作者
Cao, Zhong [1 ,2 ]
Yang, Diange [1 ]
Xu, Shaobing [2 ]
Peng, Huei [2 ]
Li, Boqi [2 ]
Feng, Shuo [1 ]
Zhao, Ding [3 ]
机构
[1] Tsinghua Univ, Dept Automot Engn, Beijing 100084, Peoples R China
[2] Univ Michigan, Dept Mech Engn, Ann Arbor, MI 48105 USA
[3] Carnegie Mellon Univ, Dept Mech Engn, Pittsburgh, PA 15213 USA
关键词
Road transportation; Safety; Reinforcement learning; Vehicle dynamics; Vehicles; Dynamics; Trajectory; Autonomous vehicle; motion planning; decision making; reinforcement learning; LANE; MODEL;
D O I
10.1109/TITS.2019.2961739
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Exiting from highways in crowded dynamic traffic is an important path planning task for autonomous vehicles (AVs). This task can be challenging because of the uncertain motion of surrounding vehicles and limited sensing/observing window. Conventional path planning methods usually compute a mandatory lane change (MLC) command, but the lane change behavior (e.g., vehicle speed and gap acceptance) should also adapt to traffic conditions and the urgency for exiting. In this paper, we propose a reinforcement learning-enhanced highway-exit planner. The learning-based strategy learns from past failures and adjusts the vehicle motion when the AV fails to exit. The reinforcement learning is based on the Monte Carlo tree search (MCTS) approach. The proposed learning-enhanced highway-exit planner is tested 6000 times in stochastic simulations. The results indicate that the proposed planner achieves a higher probability of successful highway exiting than a benchmark MLC planner.
引用
收藏
页码:990 / 1000
页数:11
相关论文
共 32 条
[1]  
[Anonymous], 1997, P TRAFF CONG TRAFF S
[2]  
[Anonymous], 2015, DECISION MAKING UNCE
[3]  
Brechtel S, 2014, 2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), P392, DOI 10.1109/ITSC.2014.6957722
[4]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[5]  
Buehler M, 2009, SPRINGER TRAC ADV RO, V56, P1, DOI 10.1007/978-3-642-03991-1
[6]   An optimal mandatory lane change decision model for autonomous vehicles in urban arterials [J].
Cao, Peng ;
Hu, Yubai ;
Miwa, Tomio ;
Wakita, Yukiko ;
Morikawa, Takayuki ;
Liu, Xiaobo .
JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 21 (04) :271-284
[7]   A geometry-driven car-following distance estimation algorithm robust to road slopes [J].
Cao, Zhong ;
Yang, Diange ;
Jiang, Kun ;
Xu, Shaobing ;
Wang, Sijia ;
Zhu, Minghan ;
Xiao, Zhongyang .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 102 :274-288
[8]  
Cao Zhong., 2017, Society of Automotive Engineers (SAE)-China Congress, P839
[9]   Wireless Toward the Era of Intelligent Vehicles [J].
Cheng, Xiang ;
Zhang, Rongqing ;
Yang, Liuqing .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (01) :188-202
[10]   5G-Enabled Cooperative Intelligent Vehicular (5GenCIV) Framework: When Benz Meets Marconi [J].
Cheng, Xiang ;
Chen, Chen ;
Zhang, Wuxiong ;
Yang, Yang .
IEEE INTELLIGENT SYSTEMS, 2017, 32 (03) :53-59