Driver Behavior Modeling via Inverse Reinforcement Learning Based on Particle Swarm Optimization

被引:1
作者
Liu, Zeng-Jie [1 ]
Wu, Huai-Ning [1 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
来源
2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年
关键词
Driver behavior modeling; T-S fuzzy model; inverse reinforcement learning; particle swarm optimization;
D O I
10.1109/CAC51589.2020.9327174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an inverse reinforcement learning method based on particle swarm optimization (PSO) is proposed to model driver's steering behavior. Initially, the vehicle dynamics is represented by a Takagi-Sugeno (T-S) fuzzy model which provides a method of approximating Q-function. Then the driver behavior model is described as an optimal control policy with decision-making model which illustrates the driving style. Subsequently, the Q-function is approximated by a quadratic polynomial-in-memberships form and the PSO algorithm is used to obtain the decision-making model from the driving data. And the corresponding optimal control policy is obtained by using the Q-learning policy iteration method. Finally, a numerical simulation is carried to show the effectiveness of the proposed method.
引用
收藏
页码:7232 / 7237
页数:6
相关论文
共 19 条
[1]  
Aghasadeghi N, 2011, IEEE INT C INT ROBOT, P1561, DOI 10.1109/IROS.2011.6048804
[2]   A Survey of Particle Swarm Optimization Applications in Electric Power Systems [J].
AlRashidi, M. R. ;
El-Hawary, M. E. .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2009, 13 (04) :913-918
[3]  
[Anonymous], 2015, THESIS
[4]  
[Anonymous], 2004, P 21 INT C MACH LEAR
[5]  
[Anonymous], 2004, Fuzzy control systems design and analysis: a linear matrix inequality approach
[6]  
Boularias A., 2011, P 14 INT C ART INT S, P182
[7]   Improvement of LMI controllers of Takagi-Sugeno models via Q-learning [J].
Diaz, Henry ;
Armesto, Leopoldo ;
Sala, Antonio .
IFAC PAPERSONLINE, 2016, 49 (05) :67-72
[8]   Adaptive learning of human motor behaviors: An evolving inverse optimal control approach [J].
El-Hussieny, Haitham ;
Abouelsoud, A. A. ;
Assal, Samy F. M. ;
Megahed, Said M. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 50 :115-124
[9]  
Guo H. Y., 2013, IFAC P, V46, P133
[10]   MODELING OF DRIVER VEHICLE DIRECTIONAL CONTROL-SYSTEM [J].
GUO, K ;
GUAN, H .
VEHICLE SYSTEM DYNAMICS, 1993, 22 (3-4) :141-184