Driver Behavior Modeling via Inverse Reinforcement Learning Based on Particle Swarm Optimization

被引：1

作者：

Liu, Zeng-Jie ^{[1
]}

Wu, Huai-Ning ^{[1
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China

来源：

2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年

关键词：

Driver behavior modeling; T-S fuzzy model; inverse reinforcement learning; particle swarm optimization;

D O I：

10.1109/CAC51589.2020.9327174

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an inverse reinforcement learning method based on particle swarm optimization (PSO) is proposed to model driver's steering behavior. Initially, the vehicle dynamics is represented by a Takagi-Sugeno (T-S) fuzzy model which provides a method of approximating Q-function. Then the driver behavior model is described as an optimal control policy with decision-making model which illustrates the driving style. Subsequently, the Q-function is approximated by a quadratic polynomial-in-memberships form and the PSO algorithm is used to obtain the decision-making model from the driving data. And the corresponding optimal control policy is obtained by using the Q-learning policy iteration method. Finally, a numerical simulation is carried to show the effectiveness of the proposed method.

引用

页码：7232 / 7237

页数：6

共 50 条

[41] Enhanced Particle Swarm Optimization Based on Reference Direction and Inverse Model for Optimization Problems
Li, Wei
Fan, Yaochi
Xu, Qingzheng
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 98 - 129
[42] Enhanced Particle Swarm Optimization Based on Reference Direction and Inverse Model for Optimization Problems
Wei Li
Yaochi Fan
Qingzheng Xu
International Journal of Computational Intelligence Systems, 2020, 13 : 98 - 129
[43] Example-based learning particle swarm optimization for continuous optimization
Huang, Han
Qin, Hu
Hao, Zhifeng
Lim, Andrew
INFORMATION SCIENCES, 2012, 182 (01) : 125 - 138
[44] Analyzing Sensor-Based Individual and Population Behavior Patterns via Inverse Reinforcement Learning
Lin, Beiyu
Cook, Diane J.
SENSORS, 2020, 20 (18) : 1 - 21
[45] OPTIMAL BANDWIDTH DESIGN FOR LAZY LEARNING VIA PARTICLE SWARM OPTIMIZATION
Pan, Tian Hong
Li, Shaoyuan
Li, Ning
INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2009, 15 (01) : 1 - 11
[46] Learning Behavior Styles with Inverse Reinforcement Learning
Lee, Seong Jae
popovic, Zoran
ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
[47] Contribution to Inverse Kinematic Modeling of a Planar Continuum Robot Using a Particle Swarm Optimization
Amouri, Ammar
Mahfoudi, Chawki
Zaatri, Abdelouahab
MULTIPHYSICS MODELLING AND SIMULATION FOR SYSTEMS DESIGN AND MONITORING, 2015, 2 : 141 - 150
[48] Dynamic Multi-swarm Particle Swarm Optimization Based on Mite Learning
Tang, Yichao
Wei, Bo
Xia, Xuewen
Gui, Ling
2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2311 - 2318
[49] Dynamic Multi-Swarm Particle Swarm Optimization Based on Elite Learning
Xia, Xuewen
Tang, Yichao
Wei, Bo
Gui, Ling
IEEE ACCESS, 2019, 7 : 184849 - 184865
[50] Sparse Unmixing for Hyperspectral Imagery via Comprehensive-Learning-Based Particle Swarm Optimization
Miao, Yapeng
Yang, Bin
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 9727 - 9742

← 1 2 3 4 5 →