Path tracking control based on Deep reinforcement learning in Autonomous driving

被引：4

作者：

Jiang, Le ^{[1
]}

Wang, Yafei ^{[1
]}

Wang, Lin ^{[2
]}

Wu, Jingkai ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai, Peoples R China

[2] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China

来源：

2019 3RD CONFERENCE ON VEHICLE CONTROL AND INTELLIGENCE (CVCI) | 2019年

关键词：

Reinforcement learning; Autonomous Driving; Lane Keep Assist (LKA); Adaptive Cruise Control (ACC); PID Control; Vehicle Control;

D O I：

10.1109/cvci47823.2019.8951665

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Lane keep assist (LKA) and Adaptive Cruise Control (ACC) are two fundamental yet critical functions for autonomous driving, and conventional methods using PID controllers may not perform well in certain extreme driving conditions. In this paper, we propose a reinforcement learning based approach to train the agent to learn LKA and ACC and hence adapt to diverse scenarios. Particularly, we employ deep deterministic policy gradient (DDPG) algorithm to train the agent and consider both state space and action space as continuous, and designed two neural network critic-network and actor-network to simulate the strategy function and Q-function. Then, we train the two neural networks by deep learning method. Finally, Simulations are conducted with both reinforcement learning and traditional PID controller, and the results of reinforcement learning is more adaptive to extreme road conditions in comparison with a traditional PID controller.

引用

页码：414 / 419

页数：6

共 18 条

[1] Experience Replay for Real-Time Reinforcement Learning Control [J].

Adam, Sander ;

Busoniu, Lucian ;

Babuska, Robert .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (02) :201-212

[2]

[Anonymous], INFORM TECHNOLOGY

[3]

GUEZ A, 2016, C AAAI

[4]

Hinton Geoffrey E., 2006, NEURAL COMPUTATION

[5]

Howard Ronald., 1966, Dynamic programming and Markov processes

[6]

Kingma DP, 2014, ARXIV

[7]

Lange S, 2012, IEEE IJCNN

[8]

li shihao, RES AUTOADAPTIVE CRU, DOI [10.16638/j.cnki.1671-7988.2018.23.064, DOI 10.16638/J.CNKI.1671-7988.2018.23.064]

[9]

Li Y, 2017, P ADV NEUR INF PROC, V30, P3812

[10]

Lillicrap TP, 2015, ARXIV150902971

← 1 2 →