Personalized Car-Following Control Based on a Hybrid of Reinforcement Learning and Supervised Learning

被引：21

作者：

Song, Dongjian ^{[1
]}

Zhu, Bing ^{[1
]}

Zhao, Jian ^{[1
]}

Han, Jiayi ^{[1
]}

Chen, Zhicheng ^{[1
]}

机构：

[1] Jilin Univ, State Key Lab Automot Simulat & Control, Changchun 130022, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Vehicles; Uncertainty; Safety; Vehicle dynamics; Behavioral sciences; Optimization; Mathematical models; Car-following control; intelligent vehicle; personalized; reinforcement learning; supervised learning; BEHAVIOR PREDICTION; MODEL; VEHICLE; DYNAMICS; MEMORY;

D O I：

10.1109/TITS.2023.3245362

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

With the development of intelligent vehicles, more research has focused on achieving human-like driving. As an important component of intelligent vehicle control, car-following control should ensure safety, tracking, comfort while considering the acceptance of human drivers. In this paper, we propose a car-following control strategy p Hybrid based on a hybrid of reinforcement learning (RL) and supervised learning (SL). RL is used to achieve multi-objective collaborative optimization in car following control, and SL is used to achieve human like car following. Through the complementary advantages of the two learning methods, p Hybrid can achieve high performance car following while matching the personalized car-following characteristics of human drivers. RL is used as the main framework of pHybrid. In addition, the personalized car-following reference model (PCRM) of human drivers based on Gaussian mixture regression, and the motion uncertainty model of preceding vehicle (MUMPV) based on the sequence-to-sequence network are established and incorporated into the RL framework. PCRM can lead pHybrid to learn the different characteristics of human drivers, and improve the anthropomorphism of p Hybrid; MUMPV enables p Hybrid to consider the dynamic changes of the traffic environment and to become more robust. p Hybrid is trained and tested on High D dataset, and the generalizability verification is based on the self-built real vehicle data collection platform. The results show that p Hybrid can match human drivers' personalized car-following characteristics and can outperform human drivers in safety, comfort, and tracking of the preceding vehicle.

引用

页码：6014 / 6029

页数：16

共 62 条

[1] Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles [J].

Aradi, Szilard .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) :740-759

[2] A binary decision model for discretionary lane changing move based on fuzzy inference system [J].

Balal, Esmaeil ;

Cheu, Ruey Long ;

Sarkodie-Gyan, Thompson .

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 67 :47-61

[3] TRAFFIC DYNAMICS - STUDIES IN CAR FOLLOWING [J].

CHANDLER, RE ;

HERMAN, R ;

MONTROLL, EW .

OPERATIONS RESEARCH, 1958, 6 (02) :165-184

[4] Brain-Inspired Cognitive Model With Attention for Self-Driving Cars [J].

Chen, Shitao ;

Zhang, Songyi ;

Shang, Jinghao ;

Chen, Badong ;

Zheng, Nanning .

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2019, 11 (01) :13-25

[5] Self-Learning Optimal Cruise Control Based on Individual Car-Following Style [J].

Chu, Hongqing ;

Guo, Lulu ;

Yan, Yongjun ;

Gao, Bingzhao ;

Chen, Hong .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (10) :6622-6633

[6] Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach [J].

Desjardins, Charles ;

Chaib-draa, Brahim .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (04) :1248-1260

[7]

Dhillon I. S., 2003, Journal of Machine Learning Research, V3, P1265, DOI 10.1162/153244303322753661

[8]

Fadhloun Karim, 2020, International Journal of Transportation Science and Technology, V9, P14, DOI [10.1016/j.ijtst.2019.05.004, 10.1016/j.ijtst.2019.05.004]

[9] Vehicle Dynamics Model for Estimating Typical Vehicle Accelerations [J].

Fadhloun, Karim ;

Rakha, Hesham ;

Loulizi, Amara ;

Abdelkefi, Abdessattar .

TRANSPORTATION RESEARCH RECORD, 2015, (2491) :61-71

[10] Deep Inverse Reinforcement Learning for Behavior Prediction in Autonomous Driving: Accurate Forecasts of Vehicle Motion [J].

Fernando, Tharindu ;

Denman, Simon ;

Sridharan, Sridha ;

Fookes, Clinton .

IEEE SIGNAL PROCESSING MAGAZINE, 2021, 38 (01) :87-96

← 1 2 3 4 5 6 7 →