Continuous interval type-2 fuzzy Q-learning algorithm for trajectory tracking tasks for vehicles

被引:2
作者
Xuan, Chengbin [1 ]
Lam, Hak-Keung [1 ]
Shi, Qian [1 ]
Chen, Ming [1 ]
机构
[1] Kings Coll London, Dept Engn, London WC2R 2LS, England
关键词
reinforcement learning; interval type-2 fuzzy system; vehicle automation; fuzzy Q-learning; fuzzy control;
D O I
10.1002/rnc.6056
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Trajectory tracking is a fundamental but challenging task for vehicle automation. In addition to the system nonlinearity, the main difficulties in the trajectory tracking task are due to the environmental noise and the model uncertainties under different driving scenarios. Considering the uncertainties in the environment, the reinforcement learning method with continuous action and noise-resistance capability could be a promising way to overcome these issues. In this article, a novel continuous interval type-2 fuzzy Q-learning (CIT2FQL) algorithm is proposed to deal with the trajectory tracking task. By introducing the n-dimensional interval type-2 fuzzy inference system (n-D IT2FIS) in fuzzy Q-learning, our proposed method achieves the continuous Q-learning by combining the action interpolation with IT2FIS for the first time. We also proposed a simplified type-reduction method for n-D IT2FIS to improve the computing efficiency of the proposed method. Moreover, a radial basis function (RBF) layer is chosen as the basis function to achieve the q-value interpolation. Finally, a trajectory tracking task in a simulation environment is conducted to verify the effectiveness and robustness of the proposed method under different scenarios. The results demonstrate that the proposed method has better robustness and noise-resistance capability while maintaining good tracking performance compared with the state-of-the-art baseline algorithms including double deep Q network (DDQN), proximal policy optimization (PPO), and interval type-2 dynamic fuzzy Q-learning (IT2DFQL).
引用
收藏
页码:4788 / 4815
页数:28
相关论文
共 50 条
  • [21] High-level Tracking of Autonomous Underwater Vehicles Based on Pseudo Averaged Q-learning
    Shi, Wenjie
    Song, Shiji
    Wu, Cheng
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 4138 - 4143
  • [22] Interval Type-2 Fuzzy Sets and Systems: Overview and Outlook
    Wu D.-R.
    Zeng Z.-G.
    Mo H.
    Wang F.-Y.
    Wu, Dong-Rui (drwu@hust.edu.cn), 1600, Science Press (46): : 1539 - 1556
  • [23] Adaptive Speed Control of Electric Vehicles Based on Multi-Agent Fuzzy Q-Learning
    Gheisarnejad, Meysam
    Mirzavand, Ghazal
    Ardeshiri, Reza Rouhi
    Andresen, Bjorn
    Khooban, Mohammad Hassan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (01): : 102 - 110
  • [24] Interval Type-2 Fuzzy Maximum Power Point Tracking Control for Wind Power Buck Coversion Systems
    Tsai, Ching-Chih
    Chang, Chun-Chien
    Yu, Chien-Cheng
    Tai, Feng-Chun
    PROCEEDINGS OF THE 2015 CONFERENCE OF THE INTERNATIONAL FUZZY SYSTEMS ASSOCIATION AND THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY, 2015, 89 : 275 - 280
  • [25] A Practical Fuzzy Controller with Q-learning Approach for the Path Tracking of a Walking-aid Robot
    Lin, Chun-Tse
    Chiang, Hsin-Han
    Lee, Tsu-Tian
    2013 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE), 2013, : 888 - 893
  • [26] Reinforcement Interval Type-2 Fuzzy Controller Design by Online Rule Generation and Q-Value-Aided Ant Colony Optimization
    Juang, Chia-Feng
    Hsu, Chia-Hung
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2009, 39 (06): : 1528 - 1542
  • [27] Sparse Bayesian Learning-Based Interval Type-2 Fuzzy Logic Control for Electrospinning Processes
    Sun, Hongwei
    Zhang, Hai-Tao
    Xing, Ning
    Wang, Yasen
    Shi, Yang
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (08) : 9489 - 9499
  • [28] Indirect adaptive interval type-2 fuzzy control for nonlinear systems
    Chafaa, Kheireddine
    Saidi, Lamir
    Ghanai, Mouna
    Benmahammed, Khier
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2007, 2 (02) : 106 - 119
  • [29] Output-feedback Control of Interval Type-2 Fuzzy Systems
    Sun Xingjian
    Pan Yingnan
    Lam Hak-Keung
    Li Hongyi
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 4446 - 4451
  • [30] Multi-objective fuzzy Q-learning to solve continuous state-action problems
    Asgharnia, Amirhossein
    Schwartz, Howard
    Atia, Mohamed
    NEUROCOMPUTING, 2023, 516 : 115 - 132