Personalized Car-Following Control Based on a Hybrid of Reinforcement Learning and Supervised Learning

被引:21
作者
Song, Dongjian [1 ]
Zhu, Bing [1 ]
Zhao, Jian [1 ]
Han, Jiayi [1 ]
Chen, Zhicheng [1 ]
机构
[1] Jilin Univ, State Key Lab Automot Simulat & Control, Changchun 130022, Peoples R China
基金
中国国家自然科学基金;
关键词
Vehicles; Uncertainty; Safety; Vehicle dynamics; Behavioral sciences; Optimization; Mathematical models; Car-following control; intelligent vehicle; personalized; reinforcement learning; supervised learning; BEHAVIOR PREDICTION; MODEL; VEHICLE; DYNAMICS; MEMORY;
D O I
10.1109/TITS.2023.3245362
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
With the development of intelligent vehicles, more research has focused on achieving human-like driving. As an important component of intelligent vehicle control, car-following control should ensure safety, tracking, comfort while considering the acceptance of human drivers. In this paper, we propose a car-following control strategy p Hybrid based on a hybrid of reinforcement learning (RL) and supervised learning (SL). RL is used to achieve multi-objective collaborative optimization in car following control, and SL is used to achieve human like car following. Through the complementary advantages of the two learning methods, p Hybrid can achieve high performance car following while matching the personalized car-following characteristics of human drivers. RL is used as the main framework of pHybrid. In addition, the personalized car-following reference model (PCRM) of human drivers based on Gaussian mixture regression, and the motion uncertainty model of preceding vehicle (MUMPV) based on the sequence-to-sequence network are established and incorporated into the RL framework. PCRM can lead pHybrid to learn the different characteristics of human drivers, and improve the anthropomorphism of p Hybrid; MUMPV enables p Hybrid to consider the dynamic changes of the traffic environment and to become more robust. p Hybrid is trained and tested on High D dataset, and the generalizability verification is based on the self-built real vehicle data collection platform. The results show that p Hybrid can match human drivers' personalized car-following characteristics and can outperform human drivers in safety, comfort, and tracking of the preceding vehicle.
引用
收藏
页码:6014 / 6029
页数:16
相关论文
共 62 条
[1]   Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles [J].
Aradi, Szilard .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) :740-759
[2]   A binary decision model for discretionary lane changing move based on fuzzy inference system [J].
Balal, Esmaeil ;
Cheu, Ruey Long ;
Sarkodie-Gyan, Thompson .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 67 :47-61
[3]   TRAFFIC DYNAMICS - STUDIES IN CAR FOLLOWING [J].
CHANDLER, RE ;
HERMAN, R ;
MONTROLL, EW .
OPERATIONS RESEARCH, 1958, 6 (02) :165-184
[4]   Brain-Inspired Cognitive Model With Attention for Self-Driving Cars [J].
Chen, Shitao ;
Zhang, Songyi ;
Shang, Jinghao ;
Chen, Badong ;
Zheng, Nanning .
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2019, 11 (01) :13-25
[5]   Self-Learning Optimal Cruise Control Based on Individual Car-Following Style [J].
Chu, Hongqing ;
Guo, Lulu ;
Yan, Yongjun ;
Gao, Bingzhao ;
Chen, Hong .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (10) :6622-6633
[6]   Cooperative Adaptive Cruise Control: A Reinforcement Learning Approach [J].
Desjardins, Charles ;
Chaib-draa, Brahim .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 12 (04) :1248-1260
[7]  
Dhillon I. S., 2003, Journal of Machine Learning Research, V3, P1265, DOI 10.1162/153244303322753661
[8]  
Fadhloun Karim, 2020, International Journal of Transportation Science and Technology, V9, P14, DOI 10.1016/j.ijtst.2019.05.004
[9]   Vehicle Dynamics Model for Estimating Typical Vehicle Accelerations [J].
Fadhloun, Karim ;
Rakha, Hesham ;
Loulizi, Amara ;
Abdelkefi, Abdessattar .
TRANSPORTATION RESEARCH RECORD, 2015, (2491) :61-71
[10]   Deep Inverse Reinforcement Learning for Behavior Prediction in Autonomous Driving: Accurate Forecasts of Vehicle Motion [J].
Fernando, Tharindu ;
Denman, Simon ;
Sridharan, Sridha ;
Fookes, Clinton .
IEEE SIGNAL PROCESSING MAGAZINE, 2021, 38 (01) :87-96