Driver Behavior Modeling via Inverse Reinforcement Learning Based on Particle Swarm Optimization

被引：1

作者：

Liu, Zeng-Jie ^{[1
]}

Wu, Huai-Ning ^{[1
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China

来源：

2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年

关键词：

Driver behavior modeling; T-S fuzzy model; inverse reinforcement learning; particle swarm optimization;

D O I：

10.1109/CAC51589.2020.9327174

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an inverse reinforcement learning method based on particle swarm optimization (PSO) is proposed to model driver's steering behavior. Initially, the vehicle dynamics is represented by a Takagi-Sugeno (T-S) fuzzy model which provides a method of approximating Q-function. Then the driver behavior model is described as an optimal control policy with decision-making model which illustrates the driving style. Subsequently, the Q-function is approximated by a quadratic polynomial-in-memberships form and the PSO algorithm is used to obtain the decision-making model from the driving data. And the corresponding optimal control policy is obtained by using the Q-learning policy iteration method. Finally, a numerical simulation is carried to show the effectiveness of the proposed method.

引用

页码：7232 / 7237

页数：6

共 19 条

[1]

Aghasadeghi N, 2011, IEEE INT C INT ROBOT, P1561, DOI 10.1109/IROS.2011.6048804

[2] A Survey of Particle Swarm Optimization Applications in Electric Power Systems [J].

AlRashidi, M. R. ;

El-Hawary, M. E. .

IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2009, 13 (04) :913-918

[3]

[Anonymous], 2015, THESIS

[4]

[Anonymous], 2004, P 21 INT C MACH LEAR

[5]

[Anonymous], 2004, Fuzzy control systems design and analysis: a linear matrix inequality approach

[6]

Boularias A., 2011, P 14 INT C ART INT S, P182

[7] Improvement of LMI controllers of Takagi-Sugeno models via Q-learning [J].

Diaz, Henry ;

Armesto, Leopoldo ;

Sala, Antonio .

IFAC PAPERSONLINE, 2016, 49 (05) :67-72

[8] Adaptive learning of human motor behaviors: An evolving inverse optimal control approach [J].

El-Hussieny, Haitham ;

Abouelsoud, A. A. ;

Assal, Samy F. M. ;

Megahed, Said M. .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 50 :115-124

[9]

Guo H. Y., 2013, IFAC P, V46, P133

[10] MODELING OF DRIVER VEHICLE DIRECTIONAL CONTROL-SYSTEM [J].

GUO, K ;

GUAN, H .

VEHICLE SYSTEM DYNAMICS, 1993, 22 (3-4) :141-184

← 1 2 →