Driver Behavior Modeling via Inverse Reinforcement Learning Based on Particle Swarm Optimization

被引:1
|
作者
Liu, Zeng-Jie [1 ]
Wu, Huai-Ning [1 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
来源
2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年
关键词
Driver behavior modeling; T-S fuzzy model; inverse reinforcement learning; particle swarm optimization;
D O I
10.1109/CAC51589.2020.9327174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an inverse reinforcement learning method based on particle swarm optimization (PSO) is proposed to model driver's steering behavior. Initially, the vehicle dynamics is represented by a Takagi-Sugeno (T-S) fuzzy model which provides a method of approximating Q-function. Then the driver behavior model is described as an optimal control policy with decision-making model which illustrates the driving style. Subsequently, the Q-function is approximated by a quadratic polynomial-in-memberships form and the PSO algorithm is used to obtain the decision-making model from the driving data. And the corresponding optimal control policy is obtained by using the Q-learning policy iteration method. Finally, a numerical simulation is carried to show the effectiveness of the proposed method.
引用
收藏
页码:7232 / 7237
页数:6
相关论文
共 50 条
  • [21] Particle swarm optimization for generating interpretable fuzzy reinforcement learning policies
    Hein, Daniel
    Hentschel, Alexander
    Runkler, Thomas
    Udluft, Steffen
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 87 - 98
  • [22] Fault Tolerant Control Using Reinforcement Learning and Particle Swarm Optimization
    Zhang, Dapeng
    Gao, Zhiwei
    IEEE ACCESS, 2020, 8 : 168802 - 168811
  • [23] Inverse design of semiconductor laser parameters based on deep learning and particle swarm optimization method
    Ma, Zihao
    Feng, Pei
    Li, Yu
    ELEVENTH INTERNATIONAL CONFERENCE ON INFORMATION OPTICS AND PHOTONICS (CIOP 2019), 2019, 11209
  • [24] Trajectory modeling via random utility inverse reinforcement learning
    Pitombeira-Neto, Anselmo R.
    Santos, Helano P.
    da Silva, Ticiana L. Coelho
    de Macedo, Jose Antonio F.
    INFORMATION SCIENCES, 2024, 660
  • [25] A Review of Geophysical Modeling Based on Particle Swarm Optimization
    Francesca Pace
    Alessandro Santilano
    Alberto Godio
    Surveys in Geophysics, 2021, 42 : 505 - 549
  • [26] A Review of Geophysical Modeling Based on Particle Swarm Optimization
    Pace, Francesca
    Santilano, Alessandro
    Godio, Alberto
    SURVEYS IN GEOPHYSICS, 2021, 42 (03) : 505 - 549
  • [27] Inverse Modeling in Geoenvironmental Engineering Using a Novel Particle Swarm Optimization Algorithm
    Bharat, Tadikonda Venkata
    Sharma, Jitendra
    SWARM INTELLIGENCE, 2010, 6234 : 448 - 455
  • [28] Power System Load Frequency Active Disturbance Rejection Control via Reinforcement Learning-Based Memetic Particle Swarm Optimization
    Zheng, Yuemin
    Huang, Zhaoyang
    Tao, Jin
    Sun, Hao
    Sun, Qinglin
    Dehmer, Matthias
    Sun, Mingwei
    Chen, Zengqiang
    IEEE ACCESS, 2021, 9 : 116194 - 116206
  • [29] Biogeography-based learning particle swarm optimization
    Chen, Xu
    Tianfield, Huaglory
    Mei, Congli
    Du, Wenli
    Liu, Guohai
    SOFT COMPUTING, 2017, 21 (24) : 7519 - 7541
  • [30] Iterative Learning Controller Based on Particle Swarm Optimization
    Wen, Xiulan
    Li, Hongsheng
    Wang, Dongxia
    Huang, Jiacai
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 593 - 600