Efficient-enhanced Reinforcement Learning for Autonomous Driving in Urban Traffic Scenarios

被引：0

作者：

Yin, Jianwen ^{[1
,2
]}

Jiang, Zhengmin ^{[1
,2
]}

Liang, Qingyi ^{[1
,3
]}

Li, Wenfei ^{[4
]}

Pan, Zhongming ^{[1
]}

Li, Huiyun ^{[1
]}

Liu, Jia ^{[1
]}

机构：

[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Key Lab Human Machine Intelligence Synergy Syst, Shenzhen 518055, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Southern Univ Sci & Technol, Shenzhen 518055, Peoples R China

[4] Zhejiang Lab, Res Inst Interdisciplinary, Res Ctr Intelligent Transportat, Hangzhou 311121, Peoples R China

来源：

2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC | 2023年

关键词：

D O I：

10.1109/ITSC57777.2023.10422557

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Decision intelligence based on reinforcement learning has gained considerable attention. Compared to the rule-based approach, the reinforcement learning-based approach shows great potential in addressing the challenge of high interaction. However, in urban traffic scenarios, decision-making of autonomous vehicles remains a great challenge, including sample efficiency and stability. In this paper, we develop an efficient reinforcement learning approach with advanced features toward end-to-end navigation in urban traffic scenarios. Firstly, bird's-eye-view (BEV) semantic segmentation is served as the concise representation of traffic scenarios, which can boost the training process. Then the expert demonstration is applied to guide the exploration of the policy at the initial training stage, thereby improving the sample efficiency. At last, we employ the quantile regression to estimate the value distribution in order to improve the stability of the policy. We validate our approach using different simulated traffic scenarios. Experimental results show that our approach has better performance in terms of convergence and stability compared to other baselines.

引用

页码：887 / 893

页数：7

共 23 条

[1] Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles [J].

Aradi, Szilard .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) :740-759

[2]

Bellemare MG, 2017, PR MACH LEARN RES, V70

[3]

Brantley K., 2020, INT C LEARNING REPRE

[4]

Chae H, 2017, IEEE INT C INTELL TR

[5]

Chekroun R., 2021, arXiv

[6]

Chen JY, 2019, IEEE INT C INTELL TR, P2765, DOI [10.1109/ITSC.2019.8917306, 10.1109/itsc.2019.8917306]

[7]

Dabney W, 2018, PR MACH LEARN RES, V80

[8]

Dabney W, 2018, AAAI CONF ARTIF INTE, P2892

[9]

Dosovitskiy A, 2017, PR MACH LEARN RES, V78

[10]

Haarnoja T, 2019, Arxiv, DOI arXiv:1812.05905

← 1 2 3 →