Safe Reinforcement Learning for Model-Reference Trajectory Tracking of Uncertain Autonomous Vehicles With Model-Based Acceleration

被引：23

作者：

Hu, Yifan ^{[1
]}

Fu, Junjie ^{[1
,2
]}

Wen, Guanghui ^{[1
]}

机构：

[1] Southeast Univ, Sch Math, Nanjing 210096, Peoples R China

[2] Purple Mt Labs, Nanjing 211111, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2023年 / 8卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Safety; Predictive models; Trajectory tracking; Training; Reinforcement learning; Heuristic algorithms; Uncertainty; Model-reference control; autonomous vehicle; safe reinforcement learning; model-based reinforcement learning; Gaussian process; control barrier function;

D O I：

10.1109/TIV.2022.3233592

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Applying reinforcement learning (RL) algorithms to control systems design remains a challenging task due to the potential unsafe exploration and the low sample efficiency. In this paper, we propose a novel safe model-based RL algorithm to solve the collision-free model-reference trajectory tracking problem of uncertain autonomous vehicles (AVs). Firstly, a new type of robust control barrier function (CBF) condition for collision-avoidance is derived for the uncertain AVs by incorporating the estimation of the system uncertainty with Gaussian process (GP) regression. Then, a robust CBF-based RL control structure is proposed, where the nominal control input is composed of the RL policy and a model-based reference control policy. The actual control input obtained from the quadratic programming problem can satisfy the constraints of collision-avoidance, input saturation and velocity boundedness simultaneously with a relatively high probability. Finally, within this control structure, a Dyna-style safe model-based RL algorithm is proposed, where the safe exploration is achieved through executing the robust CBF-based actions and the sample efficiency is improved by leveraging the GP models. The superior learning performance of the proposed RL control structure is demonstrated through simulation experiments.

引用

页码：2332 / 2344

页数：13

共 50 条

[1] Model-Reference Reinforcement Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles
Zhang, Qingrui
Pan, Wei
Reppa, Vasso
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 8770 - 8781
[2] Model-Based Reinforcement Learning for Trajectory Tracking of Musculoskeletal Robots
Xu, Haoran
Fan, Jianyin
Wang, Qiang
2023 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, I2MTC, 2023,
[3] Model-Based Safe Reinforcement Learning With Time-Varying Constraints: Applications to Intelligent Vehicles
Zhang, Xinglong
Peng, Yaoqian
Luo, Biao
Pan, Wei
Xu, Xin
Xie, Haibin
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (10) : 12744 - 12753
[4] Model-Based Offline Reinforcement Learning for Autonomous Delivery of Guidewire
Li, Hao
Zhou, Xiao-Hu
Xie, Xiao-Liang
Liu, Shi-Qi
Feng, Zhen-Qiu
Gui, Mei-Jiang
Xiang, Tian-Yu
Huang, De-Xing
Hou, Zeng-Guang
IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2024, 6 (03): : 1054 - 1062
[5] Supervised reinforcement learning based trajectory tracking control for autonomous vehicles
Mihaly, Andras
Van Tan Vu
Trong Tu Do
Gaspar, Peter
IFAC PAPERSONLINE, 2024, 58 (10): : 140 - 145
[6] SAMBA: safe model-based & active reinforcement learning
Cowen-Rivers, Alexander, I
Palenicek, Daniel
Moens, Vincent
Abdullah, Mohammed Amin
Sootla, Aivar
Wang, Jun
Bou-Ammar, Haitham
MACHINE LEARNING, 2022, 111 (01) : 173 - 203
[7] SAMBA: safe model-based & active reinforcement learning
Alexander I. Cowen-Rivers
Daniel Palenicek
Vincent Moens
Mohammed Amin Abdullah
Aivar Sootla
Jun Wang
Haitham Bou-Ammar
Machine Learning, 2022, 111 : 173 - 203
[8] Trajectory tracking algorithm for autonomous vehicles using adaptive reinforcement learning
De Paula, Mariano
Acosta, Gerardo G.
OCEANS 2015 - MTS/IEEE WASHINGTON, 2015,
[9] Model-reference adaptive sliding mode control of longitudinal speed tracking for autonomous vehicles
Jo, Ara
Lee, Hyunsung
Seo, Dabin
Yi, Kyongsu
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2023, 237 (2-3) : 493 - 515
[10] Multi-ASV Coordinated Tracking With Unknown Dynamics and Input Underactuation via Model-Reference Reinforcement Learning Control
Hu, Wenbo
Chen, Fei
Xiang, Linying
Chen, Guanrong
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (10) : 6588 - 6597

← 1 2 3 4 5 →