A DECISION-MAKING METHOD FOR AUTONOMOUS VEHICLES BASED ON SIMULATION AND REINFORCEMENT LEARNING

被引:0
作者
Zheng, Rui [1 ]
Liu, Chunming [1 ]
Guo, Qi [1 ]
机构
[1] Natl Univ Def Technol, Coll Mechatron & Automat, Changsha 410073, Hunan, Peoples R China
来源
PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4 | 2013年
关键词
Autonomous Vehicles; Reinforcement learning; Markov Decision Process; Autonomous driving; Decision-making; SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are still some problems need to be solved though there are a lot of achievements in the field of automatic driving. One of those problems is the difficulty of designing a decision-making system for complex traffic conditions. In recent years, reinforcement learning (RL) shows the potential in solving sequential decision optimization problems, which can be modeled as Markov decision processes (MDPs). In this paper, we establish a 14-DOF dynamic model of an autonomous vehicle and use RL to build a decision-making system for autonomous driving based on simulation. The decision-making process of the vehicle is modeled as an MDP, and the performance of the MDP is improved using an approximate RL. At last, we show the efficiency of the proposed method by simulation in a highway environment.
引用
收藏
页码:362 / 369
页数:8
相关论文
共 15 条
[1]   A Situation-Adaptive Lane-Keeping Support System: Overview of the SAFELANE Approach [J].
Amditis, Angelos ;
Bimpas, Matthaios ;
Thomaidis, George ;
Tsogas, Manolis ;
Netto, Mariana ;
Mammar, Said ;
Beutner, Achim ;
Moehler, Nikolaus ;
Wirthgen, Tom ;
Zipser, Stephan ;
Etemad, Aria ;
Da Lio, Mauro ;
Cicilloni, Renzo .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2010, 11 (03) :617-629
[2]   GOLD: A parallel real-time stereo vision system for generic obstacle and lane detection [J].
Bertozzi, M ;
Broggi, A .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (01) :62-81
[3]   Performance evaluation of decision making strategies for an embedded lane departure warning system [J].
Kwon, W ;
Lee, S .
JOURNAL OF ROBOTIC SYSTEMS, 2002, 19 (10) :499-509
[4]   A new approach for lane departure identification [J].
Lee, JW ;
Kee, CD ;
Yi, UK .
IEEE IV2003: INTELLIGENT VEHICLES SYMPOSIUM, PROCEEDINGS, 2003, :100-105
[5]   Evaluation of automotive forward collision warning and collision avoidance algorithms [J].
Lee, K ;
Peng, H .
VEHICLE SYSTEM DYNAMICS, 2005, 43 (10) :735-751
[6]  
Li Li, 2004, IEEE T INTELLIGENT S, V20, P10
[7]   Video-based lane estimation and tracking for driver assistance: Survey, system, and evaluation [J].
McCall, JC ;
Trivedi, MM .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2006, 7 (01) :20-37
[8]  
Rajami R, 2006, VEHICLE DYNAMICS CON
[9]  
Rao R. P. N., 2000, REINFORCEMENT LEARNI, V13, P133
[10]  
Salvucci D.D., 2004, Proceedings of the Human Factors and Ergonomics Society Annual Meeting, V48, P2228, DOI DOI 10.1177/154193120404801905