Safe Reinforcement Learning for Model-Reference Trajectory Tracking of Uncertain Autonomous Vehicles With Model-Based Acceleration

被引:23
作者
Hu, Yifan [1 ]
Fu, Junjie [1 ,2 ]
Wen, Guanghui [1 ]
机构
[1] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China
[2] Purple Mt Labs, Nanjing 211111, Peoples R China
来源
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2023年 / 8卷 / 03期
基金
中国国家自然科学基金;
关键词
Safety; Predictive models; Trajectory tracking; Training; Reinforcement learning; Heuristic algorithms; Uncertainty; Model-reference control; autonomous vehicle; safe reinforcement learning; model-based reinforcement learning; Gaussian process; control barrier function;
D O I
10.1109/TIV.2022.3233592
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applying reinforcement learning (RL) algorithms to control systems design remains a challenging task due to the potential unsafe exploration and the low sample efficiency. In this paper, we propose a novel safe model-based RL algorithm to solve the collision-free model-reference trajectory tracking problem of uncertain autonomous vehicles (AVs). Firstly, a new type of robust control barrier function (CBF) condition for collision-avoidance is derived for the uncertain AVs by incorporating the estimation of the system uncertainty with Gaussian process (GP) regression. Then, a robust CBF-based RL control structure is proposed, where the nominal control input is composed of the RL policy and a model-based reference control policy. The actual control input obtained from the quadratic programming problem can satisfy the constraints of collision-avoidance, input saturation and velocity boundedness simultaneously with a relatively high probability. Finally, within this control structure, a Dyna-style safe model-based RL algorithm is proposed, where the safe exploration is achieved through executing the robust CBF-based actions and the sample efficiency is improved by leveraging the GP models. The superior learning performance of the proposed RL control structure is demonstrated through simulation experiments.
引用
收藏
页码:2332 / 2344
页数:13
相关论文
共 50 条
  • [1] Model-Reference Reinforcement Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles
    Zhang, Qingrui
    Pan, Wei
    Reppa, Vasso
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 8770 - 8781
  • [2] Model-Based Reinforcement Learning for Trajectory Tracking of Musculoskeletal Robots
    Xu, Haoran
    Fan, Jianyin
    Wang, Qiang
    2023 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, I2MTC, 2023,
  • [3] Model-Based Safe Reinforcement Learning With Time-Varying Constraints: Applications to Intelligent Vehicles
    Zhang, Xinglong
    Peng, Yaoqian
    Luo, Biao
    Pan, Wei
    Xu, Xin
    Xie, Haibin
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (10) : 12744 - 12753
  • [4] Model-Based Offline Reinforcement Learning for Autonomous Delivery of Guidewire
    Li, Hao
    Zhou, Xiao-Hu
    Xie, Xiao-Liang
    Liu, Shi-Qi
    Feng, Zhen-Qiu
    Gui, Mei-Jiang
    Xiang, Tian-Yu
    Huang, De-Xing
    Hou, Zeng-Guang
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2024, 6 (03): : 1054 - 1062
  • [5] Supervised reinforcement learning based trajectory tracking control for autonomous vehicles
    Mihaly, Andras
    Van Tan Vu
    Trong Tu Do
    Gaspar, Peter
    IFAC PAPERSONLINE, 2024, 58 (10): : 140 - 145
  • [6] SAMBA: safe model-based & active reinforcement learning
    Cowen-Rivers, Alexander, I
    Palenicek, Daniel
    Moens, Vincent
    Abdullah, Mohammed Amin
    Sootla, Aivar
    Wang, Jun
    Bou-Ammar, Haitham
    MACHINE LEARNING, 2022, 111 (01) : 173 - 203
  • [7] SAMBA: safe model-based & active reinforcement learning
    Alexander I. Cowen-Rivers
    Daniel Palenicek
    Vincent Moens
    Mohammed Amin Abdullah
    Aivar Sootla
    Jun Wang
    Haitham Bou-Ammar
    Machine Learning, 2022, 111 : 173 - 203
  • [8] Trajectory tracking algorithm for autonomous vehicles using adaptive reinforcement learning
    De Paula, Mariano
    Acosta, Gerardo G.
    OCEANS 2015 - MTS/IEEE WASHINGTON, 2015,
  • [9] Model-reference adaptive sliding mode control of longitudinal speed tracking for autonomous vehicles
    Jo, Ara
    Lee, Hyunsung
    Seo, Dabin
    Yi, Kyongsu
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2023, 237 (2-3) : 493 - 515
  • [10] Multi-ASV Coordinated Tracking With Unknown Dynamics and Input Underactuation via Model-Reference Reinforcement Learning Control
    Hu, Wenbo
    Chen, Fei
    Xiang, Linying
    Chen, Guanrong
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (10) : 6588 - 6597