Driver Behavior Modeling via Inverse Reinforcement Learning Based on Particle Swarm Optimization

被引：1

作者：

Liu, Zeng-Jie ^{[1
]}

Wu, Huai-Ning ^{[1
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China

来源：

2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年

关键词：

Driver behavior modeling; T-S fuzzy model; inverse reinforcement learning; particle swarm optimization;

D O I：

10.1109/CAC51589.2020.9327174

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an inverse reinforcement learning method based on particle swarm optimization (PSO) is proposed to model driver's steering behavior. Initially, the vehicle dynamics is represented by a Takagi-Sugeno (T-S) fuzzy model which provides a method of approximating Q-function. Then the driver behavior model is described as an optimal control policy with decision-making model which illustrates the driving style. Subsequently, the Q-function is approximated by a quadratic polynomial-in-memberships form and the PSO algorithm is used to obtain the decision-making model from the driving data. And the corresponding optimal control policy is obtained by using the Q-learning policy iteration method. Finally, a numerical simulation is carried to show the effectiveness of the proposed method.

引用

页码：7232 / 7237

页数：6

共 50 条

[31] Biogeography-based learning particle swarm optimization
Xu Chen
Huaglory Tianfield
Congli Mei
Wenli Du
Guohai Liu
Soft Computing, 2017, 21 : 7519 - 7541
[32] Particle Swarm Optimization based on Vector Gaussian Learning
Zhao, Jia
Lv, Li
Wang, Hui
Sun, Hui
Wu, Runxiu
Nie, Jugen
Xie, Zhifeng
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (04): : 2038 - 2057
[33] A behavior fusion method based on inverse reinforcement learning
Shi, Haobin
Li, Jingchen
Chen, Shicong
Hwang, Kao-Shing
INFORMATION SCIENCES, 2022, 609 : 429 - 444
[34] Decision Boundary Learning Based on Particle Swarm Optimization
Watarai, Kyohei
Zhao, Qiangfu
Kaneda, Yuya
4TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2012), 2012, : 41 - 45
[35] Inverse Design of Metamaterial Absorber Sensor Based on Particle Swarm Optimization
Han D.
Ma Z.
Wang J.
Wang X.
Liu S.
Zhongguo Jiguang/Chinese Journal of Lasers, 2022, 49 (17):
[36] A Bayesian Approach forQuantifying Data Scarcity when Modeling Human Behavior via Inverse Reinforcement Learning
Hossain, Tahera
Shen, Wanggang
Antar, Anindya
Prabhudesai, Snehal
Inoue, Sozo
Huan, Xun
Banovic, Nikola
ACM TRANSACTIONS ON COMPUTER-HUMAN INTERACTION, 2023, 30 (01)
[37] Multi-swarm particle swarm optimization based on autonomic learning and elite swarm
Jiang, Hai-Yan
Wang, Fang-Fang
Guo, Xiao-Qing
Zhuang, Jia-Xiang
Kongzhi yu Juece/Control and Decision, 2014, 29 (11): : 2034 - 2040
[38] Multi-swarm Particle Swarm Optimization Based on Mixed Search Behavior
Jie, Jing
Wang, Wanliang
Liu, Chunsheng
Hou, Beiping
ICIEA 2010: PROCEEDINGS OF THE 5TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOL 2, 2010, : 32 - +
[39] Modeling framework of human driving behavior based on Deep Maximum Entropy Inverse Reinforcement Learning
Wang, Yongjie
Niu, Yuchen
Xiao, Mei
Zhu, Wenying
You, Xinshang
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 652
[40] A new Reinforcement Learning-based Memetic Particle Swarm Optimizer
Samma, Hussein
Lim, Chee Peng
Saleh, Junita Mohamad
APPLIED SOFT COMPUTING, 2016, 43 : 276 - 297

← 1 2 3 4 5 →