Driver Behavior Modeling via Inverse Reinforcement Learning Based on Particle Swarm Optimization

被引:1
|
作者
Liu, Zeng-Jie [1 ]
Wu, Huai-Ning [1 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
来源
2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年
关键词
Driver behavior modeling; T-S fuzzy model; inverse reinforcement learning; particle swarm optimization;
D O I
10.1109/CAC51589.2020.9327174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an inverse reinforcement learning method based on particle swarm optimization (PSO) is proposed to model driver's steering behavior. Initially, the vehicle dynamics is represented by a Takagi-Sugeno (T-S) fuzzy model which provides a method of approximating Q-function. Then the driver behavior model is described as an optimal control policy with decision-making model which illustrates the driving style. Subsequently, the Q-function is approximated by a quadratic polynomial-in-memberships form and the PSO algorithm is used to obtain the decision-making model from the driving data. And the corresponding optimal control policy is obtained by using the Q-learning policy iteration method. Finally, a numerical simulation is carried to show the effectiveness of the proposed method.
引用
收藏
页码:7232 / 7237
页数:6
相关论文
共 50 条
  • [31] Biogeography-based learning particle swarm optimization
    Xu Chen
    Huaglory Tianfield
    Congli Mei
    Wenli Du
    Guohai Liu
    Soft Computing, 2017, 21 : 7519 - 7541
  • [32] Particle Swarm Optimization based on Vector Gaussian Learning
    Zhao, Jia
    Lv, Li
    Wang, Hui
    Sun, Hui
    Wu, Runxiu
    Nie, Jugen
    Xie, Zhifeng
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (04): : 2038 - 2057
  • [33] A behavior fusion method based on inverse reinforcement learning
    Shi, Haobin
    Li, Jingchen
    Chen, Shicong
    Hwang, Kao-Shing
    INFORMATION SCIENCES, 2022, 609 : 429 - 444
  • [34] Decision Boundary Learning Based on Particle Swarm Optimization
    Watarai, Kyohei
    Zhao, Qiangfu
    Kaneda, Yuya
    4TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2012), 2012, : 41 - 45
  • [35] Inverse Design of Metamaterial Absorber Sensor Based on Particle Swarm Optimization
    Han D.
    Ma Z.
    Wang J.
    Wang X.
    Liu S.
    Zhongguo Jiguang/Chinese Journal of Lasers, 2022, 49 (17):
  • [36] A Bayesian Approach forQuantifying Data Scarcity when Modeling Human Behavior via Inverse Reinforcement Learning
    Hossain, Tahera
    Shen, Wanggang
    Antar, Anindya
    Prabhudesai, Snehal
    Inoue, Sozo
    Huan, Xun
    Banovic, Nikola
    ACM TRANSACTIONS ON COMPUTER-HUMAN INTERACTION, 2023, 30 (01)
  • [37] Multi-swarm particle swarm optimization based on autonomic learning and elite swarm
    Jiang, Hai-Yan
    Wang, Fang-Fang
    Guo, Xiao-Qing
    Zhuang, Jia-Xiang
    Kongzhi yu Juece/Control and Decision, 2014, 29 (11): : 2034 - 2040
  • [38] Multi-swarm Particle Swarm Optimization Based on Mixed Search Behavior
    Jie, Jing
    Wang, Wanliang
    Liu, Chunsheng
    Hou, Beiping
    ICIEA 2010: PROCEEDINGS OF THE 5TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOL 2, 2010, : 32 - +
  • [39] Modeling framework of human driving behavior based on Deep Maximum Entropy Inverse Reinforcement Learning
    Wang, Yongjie
    Niu, Yuchen
    Xiao, Mei
    Zhu, Wenying
    You, Xinshang
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 652
  • [40] A new Reinforcement Learning-based Memetic Particle Swarm Optimizer
    Samma, Hussein
    Lim, Chee Peng
    Saleh, Junita Mohamad
    APPLIED SOFT COMPUTING, 2016, 43 : 276 - 297