Driver Behavior Modeling via Inverse Reinforcement Learning Based on Particle Swarm Optimization

被引:1
|
作者
Liu, Zeng-Jie [1 ]
Wu, Huai-Ning [1 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
来源
2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年
关键词
Driver behavior modeling; T-S fuzzy model; inverse reinforcement learning; particle swarm optimization;
D O I
10.1109/CAC51589.2020.9327174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an inverse reinforcement learning method based on particle swarm optimization (PSO) is proposed to model driver's steering behavior. Initially, the vehicle dynamics is represented by a Takagi-Sugeno (T-S) fuzzy model which provides a method of approximating Q-function. Then the driver behavior model is described as an optimal control policy with decision-making model which illustrates the driving style. Subsequently, the Q-function is approximated by a quadratic polynomial-in-memberships form and the PSO algorithm is used to obtain the decision-making model from the driving data. And the corresponding optimal control policy is obtained by using the Q-learning policy iteration method. Finally, a numerical simulation is carried to show the effectiveness of the proposed method.
引用
收藏
页码:7232 / 7237
页数:6
相关论文
共 50 条
  • [41] Enhanced Particle Swarm Optimization Based on Reference Direction and Inverse Model for Optimization Problems
    Li, Wei
    Fan, Yaochi
    Xu, Qingzheng
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 98 - 129
  • [42] Enhanced Particle Swarm Optimization Based on Reference Direction and Inverse Model for Optimization Problems
    Wei Li
    Yaochi Fan
    Qingzheng Xu
    International Journal of Computational Intelligence Systems, 2020, 13 : 98 - 129
  • [43] Example-based learning particle swarm optimization for continuous optimization
    Huang, Han
    Qin, Hu
    Hao, Zhifeng
    Lim, Andrew
    INFORMATION SCIENCES, 2012, 182 (01) : 125 - 138
  • [44] Analyzing Sensor-Based Individual and Population Behavior Patterns via Inverse Reinforcement Learning
    Lin, Beiyu
    Cook, Diane J.
    SENSORS, 2020, 20 (18) : 1 - 21
  • [45] OPTIMAL BANDWIDTH DESIGN FOR LAZY LEARNING VIA PARTICLE SWARM OPTIMIZATION
    Pan, Tian Hong
    Li, Shaoyuan
    Li, Ning
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2009, 15 (01) : 1 - 11
  • [46] Learning Behavior Styles with Inverse Reinforcement Learning
    Lee, Seong Jae
    popovic, Zoran
    ACM TRANSACTIONS ON GRAPHICS, 2010, 29 (04):
  • [47] Contribution to Inverse Kinematic Modeling of a Planar Continuum Robot Using a Particle Swarm Optimization
    Amouri, Ammar
    Mahfoudi, Chawki
    Zaatri, Abdelouahab
    MULTIPHYSICS MODELLING AND SIMULATION FOR SYSTEMS DESIGN AND MONITORING, 2015, 2 : 141 - 150
  • [48] Dynamic Multi-swarm Particle Swarm Optimization Based on Mite Learning
    Tang, Yichao
    Wei, Bo
    Xia, Xuewen
    Gui, Ling
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2311 - 2318
  • [49] Dynamic Multi-Swarm Particle Swarm Optimization Based on Elite Learning
    Xia, Xuewen
    Tang, Yichao
    Wei, Bo
    Gui, Ling
    IEEE ACCESS, 2019, 7 : 184849 - 184865
  • [50] Sparse Unmixing for Hyperspectral Imagery via Comprehensive-Learning-Based Particle Swarm Optimization
    Miao, Yapeng
    Yang, Bin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 9727 - 9742