Continuous interval type-2 fuzzy Q-learning algorithm for trajectory tracking tasks for vehicles

被引:2
作者
Xuan, Chengbin [1 ]
Lam, Hak-Keung [1 ]
Shi, Qian [1 ]
Chen, Ming [1 ]
机构
[1] Kings Coll London, Dept Engn, London WC2R 2LS, England
关键词
reinforcement learning; interval type-2 fuzzy system; vehicle automation; fuzzy Q-learning; fuzzy control;
D O I
10.1002/rnc.6056
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Trajectory tracking is a fundamental but challenging task for vehicle automation. In addition to the system nonlinearity, the main difficulties in the trajectory tracking task are due to the environmental noise and the model uncertainties under different driving scenarios. Considering the uncertainties in the environment, the reinforcement learning method with continuous action and noise-resistance capability could be a promising way to overcome these issues. In this article, a novel continuous interval type-2 fuzzy Q-learning (CIT2FQL) algorithm is proposed to deal with the trajectory tracking task. By introducing the n-dimensional interval type-2 fuzzy inference system (n-D IT2FIS) in fuzzy Q-learning, our proposed method achieves the continuous Q-learning by combining the action interpolation with IT2FIS for the first time. We also proposed a simplified type-reduction method for n-D IT2FIS to improve the computing efficiency of the proposed method. Moreover, a radial basis function (RBF) layer is chosen as the basis function to achieve the q-value interpolation. Finally, a trajectory tracking task in a simulation environment is conducted to verify the effectiveness and robustness of the proposed method under different scenarios. The results demonstrate that the proposed method has better robustness and noise-resistance capability while maintaining good tracking performance compared with the state-of-the-art baseline algorithms including double deep Q network (DDQN), proximal policy optimization (PPO), and interval type-2 dynamic fuzzy Q-learning (IT2DFQL).
引用
收藏
页码:4788 / 4815
页数:28
相关论文
共 50 条
  • [1] Self-Organizing Interval Type-2 Fuzzy Q-Learning For Reinforcement Fuzzy Control
    Hsu, Chia-Hung
    Juang, Chia-Feng
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2033 - 2038
  • [2] ENHANCEMENTS OF FUZZY Q-LEARNING ALGORITHM
    Glowaty, Grzegorz
    COMPUTER SCIENCE-AGH, 2005, 7 : 77 - 87
  • [3] A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking without Exploration
    Valiollahi, S.
    Ghaderi, R.
    Ebrahimzadeh, A.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2012, 25 (04): : 355 - 366
  • [4] Adaptive hierarchical interval type-2 fuzzy control for trajectory tracking of an underactuated AUV with uncertain dynamics
    Wang, Yuxuan
    Hou, Yaochun
    Ye, Mengjia
    Lai, Zhounian
    Cao, Linlin
    Wu, Dazhuan
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2024,
  • [5] Anomaly Detection using Fuzzy Q-learning Algorithm
    Shamshirband, Shahaboddin
    Anuar, Nor Badrul
    Kiah, Miss Laiha Mat
    Misra, Sanjay
    ACTA POLYTECHNICA HUNGARICA, 2014, 11 (08) : 5 - 28
  • [6] Fish growth trajectory tracking using Q-learning in precision aquaculture
    Chahid, Abderrazak
    N'Doye, Ibrahima
    Majoris, John E.
    Berumen, Michael L.
    Laleg-Kirati, Taous-Meriem
    AQUACULTURE, 2022, 550
  • [7] Stochastic Genetic Algorithm-Assisted Fuzzy Q-Learning for Robotic Manipulators
    Kukker, Amit
    Sharma, Rajneesh
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (10) : 9527 - 9539
  • [8] A Fuzzy Q-Learning Algorithm for Storage Optimization in Islanding Microgrid
    Yu, Yunjun
    Qin, Yang
    Gong, Hancheng
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2021, 16 (05) : 2343 - 2353
  • [9] A Fuzzy Q-Learning Algorithm for Storage Optimization in Islanding Microgrid
    Yunjun Yu
    Yang Qin
    Hancheng Gong
    Journal of Electrical Engineering & Technology, 2021, 16 : 2343 - 2353
  • [10] Reinforcement distribution in continuous state action space fuzzy Q-learning: A novel approach
    Bonarini, A
    Montrone, F
    Restelli, M
    FUZZY LOGIC AND APPLICATIONS, 2006, 3849 : 40 - 45