Continuous interval type-2 fuzzy Q-learning algorithm for trajectory tracking tasks for vehicles

被引:2
作者
Xuan, Chengbin [1 ]
Lam, Hak-Keung [1 ]
Shi, Qian [1 ]
Chen, Ming [1 ]
机构
[1] Kings Coll London, Dept Engn, London WC2R 2LS, England
关键词
reinforcement learning; interval type-2 fuzzy system; vehicle automation; fuzzy Q-learning; fuzzy control;
D O I
10.1002/rnc.6056
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Trajectory tracking is a fundamental but challenging task for vehicle automation. In addition to the system nonlinearity, the main difficulties in the trajectory tracking task are due to the environmental noise and the model uncertainties under different driving scenarios. Considering the uncertainties in the environment, the reinforcement learning method with continuous action and noise-resistance capability could be a promising way to overcome these issues. In this article, a novel continuous interval type-2 fuzzy Q-learning (CIT2FQL) algorithm is proposed to deal with the trajectory tracking task. By introducing the n-dimensional interval type-2 fuzzy inference system (n-D IT2FIS) in fuzzy Q-learning, our proposed method achieves the continuous Q-learning by combining the action interpolation with IT2FIS for the first time. We also proposed a simplified type-reduction method for n-D IT2FIS to improve the computing efficiency of the proposed method. Moreover, a radial basis function (RBF) layer is chosen as the basis function to achieve the q-value interpolation. Finally, a trajectory tracking task in a simulation environment is conducted to verify the effectiveness and robustness of the proposed method under different scenarios. The results demonstrate that the proposed method has better robustness and noise-resistance capability while maintaining good tracking performance compared with the state-of-the-art baseline algorithms including double deep Q network (DDQN), proximal policy optimization (PPO), and interval type-2 dynamic fuzzy Q-learning (IT2DFQL).
引用
收藏
页码:4788 / 4815
页数:28
相关论文
共 50 条
  • [31] Reinforcement Self-Organizing Interval Type-2 Fuzzy System with Ant Colony Optimization
    Juang, Chia-Feng
    Hsu, Chia-Hung
    Chuang, Chia-Feng
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 771 - 776
  • [32] FQBDDA: fuzzy Q-learning based DDoS attack detection algorithm for cloud computing environment
    Kumar A.
    Dutta S.
    Pranav P.
    International Journal of Information Technology, 2024, 16 (2) : 891 - 900
  • [33] Interval-Boundary-Dependent Control for Interval Type-2 T–S Fuzzy Systems
    Park B.Y.
    Shin J.W.
    Park, Bum Yong (bumyong.park@kumoh.ac.kr), 2018, Springer Science and Business Media, LLC (29) : 677 - 683
  • [34] A new mobile robot navigation method using fuzzy logic and a modified Q-learning algorithm
    Boubertakh, H.
    Tadjine, M.
    Glorennec, P. -Y.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2010, 21 (1-2) : 113 - 119
  • [35] A Novel Structure of Actor-Critic Learning Based on an Interval Type-2 TSK Fuzzy Neural Network
    Khater, A. Aziz
    El-Nagar, Ahmad M.
    El-Bardini, Mohammad
    El-Rabaie, Nabila
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (11) : 3047 - 3061
  • [36] An Online Energy Management Strategy for Fuel Cell Vehicles Based on Fuzzy Q-Learning and Road Condition Recognition
    Yang, Duo
    Wang, Siyu
    Liao, Yuefeng
    Pan, Rui
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (09) : 12120 - 12130
  • [37] Design of Interval Type-2 Fuzzy Controllers for Active Magnetic Bearing Systems
    Ren, Gui-Ping
    Chen, Zhiyong
    Zhang, Hai-Tao
    Wu, Yue
    Meng, Haofei
    Wu, Dongrui
    Ding, Han
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2020, 25 (05) : 2449 - 2459
  • [38] Optimization of interval type-2 fuzzy logic controllers using evolutionary algorithms
    O. Castillo
    P. Melin
    A. Alanis
    O. Montiel
    R. Sepulveda
    Soft Computing, 2011, 15 : 1145 - 1160
  • [39] Stability Analysis and Controller Design of Discrete Interval Type-2 Fuzzy Systems
    Sheng, Long
    Li, Chunguang
    2011 INTERNATIONAL CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND AUTOMATION (CCCA 2011), VOL III, 2010, : 132 - 135
  • [40] Optimization of interval type-2 fuzzy logic controllers using evolutionary algorithms
    Castillo, O.
    Melin, P.
    Alanis, A.
    Montiel, O.
    Sepulveda, R.
    SOFT COMPUTING, 2011, 15 (06) : 1145 - 1160