Driver Intent-Based Intersection Autonomous Driving Collision Avoidance Reinforcement Learning Algorithm

被引：2

作者：

Chen, Ting ^{[1
]}

Chen, Youjing ^{[1
]}

Li, Hao ^{[2
]}

Gao, Tao ^{[1
]}

Tu, Huizhao ^{[2
]}

Li, Siyu ^{[1
]}

机构：

[1] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China

[2] Tongji Univ, Coll Transportat Engn, Key Lab Rd, Traff Engn Minist Educ, Shanghai 201804, Peoples R China

来源：

SENSORS | 2022年 / 22卷 / 24期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

self-driving vehicles; latent states; variational autoencoder; deep reinforcement learning; INFORMATION; MODEL;

D O I：

10.3390/s22249943

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

With the rapid development of artificial intelligent technology, the deep learning method is widely applied to predict human driving intentions due to its relative accuracy of prediction, which is one of critical links for security guarantee in the distributed, mixed driving scenario. In order to sense the intention of human-driven vehicles and reduce the self-driving collision avoidance rate, an improved intention prediction method for human-driving vehicles based on unsupervised, deep inverse reinforcement learning is proposed. Firstly, a contrast discriminator module was proposed to extract richer features. Then, the residual module was created to overcome the drawbacks of gradient disappearance and network degradation with the increase in network layers. Furthermore, the dropout layer was generated to prevent the over-fitting phenomenon in the whole training process of the GRU network, so as to improve the generalization ability of the network model. Finally, abundant experiments were conducted on datasets to evaluate our proposed method. The pass rate of self-driving vehicles with conservative driver probabilities of p = 0.25, p = 0.4, and p = 0.6 improved by a maximum of 8%, 10%, and 3%, compared with the classical method LSTM and VAE + RNN. It indicates that the prediction results of our proposed method fit more with the basic structure of the given traffic scenario in a long-term prediction range, which verifies the effectiveness of our proposed method.

引用

页数：14

共 29 条

[1]

[Anonymous], 2015, arXiv

[2]

Bai HY, 2015, IEEE INT CONF ROBOT, P454, DOI 10.1109/ICRA.2015.7139219

[3]

Brown K., 2020, ARXIV

[4]

Dong CY, 2017, IEEE INT VEH SYM, P1584, DOI 10.1109/IVS.2017.7995935

[5]

Feng XD, 2019, IEEE INT C INTELL TR, P3514, DOI [10.1109/itsc.2019.8917482, 10.1109/ITSC.2019.8917482]

[6] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[7] Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder Approach [J].

Ivanovic, Boris ;

Leung, Karen ;

Schmerling, Edward ;

Pavone, Marco .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) :295-302

[8] Enhanced intelligent driver model to access the impact of driving strategies on traffic capacity [J].

Kesting, Arne ;

Treiber, Martin ;

Helbing, Dirk .

PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2010, 368 (1928) :4585-4605

[9]

Kingma D.P., 2013, arXiv

[10]

Kostrikov Ilya, 2018, PyTorch Implementations of Asynchronous Advantage Actor Critic

← 1 2 3 →